看球学日语:gohan
日本对澳大利亚那场,听到最多的是gohan...两个字.
解说员,赛后的队员记者会.大家都在说.
我就奇怪了为什么揪着"御饭"(也就是大米饭)不放呢?
难道是在德国受到了不公平的待遇,没给吃饱饭?这个照理说,是中国队经常得到的待遇啊.原来是连带着亚洲球队全都被歧视...
嗯...存疑
直到今天,又听到gohan,才突然天雷地火电闪雷鸣地明白.人家!那个!应该是:
后半!!!后半场的简称.
服了我自己...
In Progression, Before Mature
日本对澳大利亚那场,听到最多的是gohan...两个字.
解说员,赛后的队员记者会.大家都在说.
我就奇怪了为什么揪着"御饭"(也就是大米饭)不放呢?
难道是在德国受到了不公平的待遇,没给吃饱饭?这个照理说,是中国队经常得到的待遇啊.原来是连带着亚洲球队全都被歧视...
嗯...存疑
直到今天,又听到gohan,才突然天雷地火电闪雷鸣地明白.人家!那个!应该是:
后半!!!后半场的简称.
服了我自己...
0
comments
Posted by
midi
@
1:04 AM
for
William W. Cohen, Pradeep Ravikumar, S. E. F. (2003). A comparison of string metrics for matching names and records. the Workshop on Data Cleaning and Object Consoliation.
and
http://secondstring.sourceforge.net
1.edit distance: (the differences in position matters)
0
comments
Posted by
midi
@
2:39 PM
1. record linkage:
http://en.wikipedia.org/wiki/Record_linkage
Record linkage also known as deduplication, refers to the task of finding entries that refer to the same entity in two or more files. Record linkage is an appropriate technique when you have to join data sets that do not have a unique database key in common. A data set that have been through Record linkage is said to be linked.
2.Blocking methods
are used in record linkage systems to reduce the number of candidate record comparison pairs to a feasible number whilst still maintaining linkage accuracy.
Blocking methods partition the data sets into blocks or clusters of records which share a blocking attribute or are otherwise similar with respect to a defined criterion.
e.g. from [ref2.]
standard traditional blocking
0
comments
Posted by
midi
@
6:09 PM
抽空写两句。
前两天跟DN说起来google怎样全方位掌控着个人隐私的问题。比如他就不用google的canlendar。而我就觉得无所谓。
你用了google的“免费”服务,总是要付出点什么的。(世界就要这么运转,哪能天上掉馅饼不是。)我付出的,就是我输入的关键字被它跟踪,刻画我这个人的爱好;我的邮件上下文都在gmail中;我的日程表为google内部工作人员可见。这就是我与google 签订的隐含协议:出卖我自己,买来些服务。
你用hotmail,用yahoo,用baidu,都是这样。但我只跟google做这样的买卖。因为它说,dont be evil,而且也在这么做。
美国政府向yahoo, AOL,微软, google要求搜索数据,只有google拒绝。大家就可以理解google是如何的难得了。
在前者的合同条款中(也许写在那个谁也不会读的服务条约中,也许没有),“你”的信息被完全卖给了全世界,它们可以为所欲为。而在google的合同中,只是和google在做交易。两相比较一下哪个买卖更划算一目了然。
也正是如此,当google.cn开始为了进入中国提供经过过滤的信息时,人们才那么的焦虑以及不安。因为,在原则问题上退一步还是一百步,没有区别。换句话说,同样是我的个人资料交出去,买到的信息不是本来的样子。已然隐隐让人觉得有些亏本。况且,能在提供信息上让步,就未必不会再信息保密上让步。大家都不是傻的,看看itwire上给的统计数据“... Brin told Reuters that only 1% of Chinese users accessed Google.cn with the rest going to Google.com.”
更新的报道是“Google创始人考虑抛弃谷歌,策略与理念冲突”当然我不知道这个声明背后是什么原因。可能是因为作了让步的google仍然时不时受到GFW的干扰屏蔽,所以放些话来造势和谈判?或者像连岳说的:“Google也许知道了没有"半人半奴"这种选项”。我不要求google一贯正确,但很欣慰看到他一直坚持自己的原则。
等到他放弃原则的那天,就是我开始放弃google的那天。
希望那天不会到来。
0
comments
Posted by
midi
@
1:41 PM

实验室的法国佬原来是一个魔方高手。据说在四○多秒之内就可以拼好六面!!
让他demo了一下,zmazing!运指如飞。我很想录下来他拼魔方时候的状态放到blog上……
小时候我有过一个魔方啦。我总是先拼一面,然后想拼另一面,似乎从来没有把两面都拼好过。(印象里有一个小姨夫,拼出过两面。不知道是不是记错了)所以最后那个魔方的结局是,被我一块一块的掰下来,再重新安上,做成一个完美的六面。以后就以此为乐了,魔方被我找到了新玩法!0(^_^)V
我要不要告诉那个法国人,我能在四十秒的时间中把魔方拆了再装好?
今天才知道魔方的正确思路:先拼一面,然后拼紧接拼好那个底面的一条变,然后是之上的第二条边,然后是最后一条(连同上底面)。打个比方,就好象我们编一个笼子那样。原来我一直以为魔方是一个面一个面的拼的!
今天试了一下,可以很顺利的拼好底面和第一条边,过几天去攻克下一条。网上有很多教程来得,不过我决定先死磕死磕再看。
又刺激了我的购物欲:去tokyo hands,买个魔方。:)
0
comments
Posted by
midi
@
11:38 PM

今天发现donews的很多blog侧拦上都都出现了一个扎针小人图。原因在这里。
0
comments
Posted by
midi
@
2:37 PM
0. rdfs:member in RDF vocabulary
rdfs:member:"is an instance of rdf:Property that is a super-property of all the container membership properties i.e. each container membership property has an rdfs:subPropertyOf relationship to the property rdfs:member."
ReferTo: RDF Vocabulary Description Language 1.0: RDF Schema
1. rdfs:member with protege
Recently, I am trying to build a domain ontoloy with protege version 3.2 alpha. To reserve the full expressive capability, the project is set to be
owl full. Then I
Protege list rdfs:member as the individule of rdf:Property in the "individuals tab", not as property in the "properties tab". Now I am curious about the reason. Might I send an email to their mailinglist? @_-?
2. A quick reference to OWL (Lite, DL, Full) construct.
Since it seems one can't use rdfs:member (or it's instance RDF:_#), I have to wondering whether it is a legal member of owl full or not. And the next question is, which rdf vocabularies are reversed in owl(ful, dl and lite).
ref:
A brief summary:
0
comments
Posted by
midi
@
8:42 PM
try this:
很多事情,理论上知道跟亲身体验还真是两回事。
比如我们知道伊斯兰文都是从右向左写的。可从来没想过,他们的网页会是怎样。
上面的url是我从msn space的访问历史里面挖出来的一个。震惊啊震惊~
全部的右对齐。别扭的是是每行还是从左起。。。
很好奇他们是否国内的网站都是右对齐的……如果是的话,他们访问别的国家的网站岂不是很不习惯?
0
comments
Posted by
midi
@
5:53 PM
predefined ontologies which might be useful
1. FOAF
2. event ontology from UMBC ebiquity group
3. Dublin Core
related work
1. Stojanovic, L., Staab, S and Studer, R. (2001) ELearning Based on the Semantic Web. Proceedings WebNet2001 - World Conference on the WWW and Internet, Orlando, Florida, USA, 2001.
2. Journal of Educational Technology & Society
Special Issue on "Ontologies and the Semantic Web for E-learning"
0
comments
Posted by
midi
@
5:35 PM