11/26/2005

电影们



--------------
哈利波特
--------------
每天走在地铁里,都能看到很漂亮的海报。海报上的小哈利(芬特,好像也是yu cheng qing&yi nengjing的儿子的名字)终于不再幼齿,逐渐长大;有点美少年的意思了。我对这一点很满意。于是也有了要去看看的想法。

如果要找个矫情一点的理由,就是我又想起在大讲堂一起看电影的人了。243的四个孩子,一起跑去看上一个哈利(是第几忘了)。说真的,电影很无聊,很无聊。
或者集体买票去看魔戒。和红红拿了十个学生证,很早去排队。结果发现前面排了八九个人,手里都是十几个学生证;而且,都是,计算机系的...然后女生在讲堂里夸张的由衷惊叹美丽可爱举世无双天下第一无人能敌的精灵王子奥兰多-布鲁姆。
(图)
或者跟小刚去看老舍正红旗下。
你也没办法说是感动或者怀旧,只是记忆不着痕迹,就留了下来而已。

回头找个机会去电影院看哈利波特。
------------------------------------------
《猛龙》和《龙城岁月》
------------------------------------------
前两天看了《猛龙》和《龙城岁月》。
对于前者,我只能说从前(港片)一两个人就可以号召票房的时代当下盛况难现了。难道是因为老的老,死的死?
dragon squard简单说来,就是一个国际化的好人team,跟一个国际化的坏人team死磕得优美动作片。好人team里面,有我喜欢的小姜文!(夏雨?)吴建豪!(脱离了f4,电影确是演得不错了!吊吊向上飘的眼角)黄依圣!(qie....西红柿...)还有三个,对不起。不喜欢因此略去。坏人team里有maggiQ,就是那个混血儿。嗯。怎么看起来有点点像舒琪。

让我想起来姜文。怎么形容姜文呢。我想,正是因为姜文,才让那么多少女少妇抛青春洒热情默默无闻甘当当第三者;才让中年男人有了活下去的希望。
就是这么一个长相平凡(甚至lower than everage)的男人,不用抬手,甚至看都不用看你一眼,就可以对你下了蛊。

《龙城岁月》是不得不点名表扬的片子。场景,用光,演技和音乐。你去想象一个香港版低俗小说会是什么样子,大概就是龙城岁月了。
时间似乎是定位在15年以前?对迷笛这样大小的人来说,这是个不远不近的时间,太多关于香港的东西会被想起。有时候会自己错乱:不知道是自己在看一个15年前拍的电影,还是一个关于15年前的电影。
嗯。刚刚去baidu 了一下(再次点名表扬一下自己的品味。hoho),据说是影片暴力镜头太过云云。也许我看到的是剪过的版本。我看到的暴力的场景,一律选择了非常合适的音乐(对不起,词汇贫乏,只会用非常合适),演员的动作的节奏和画面的节奏都处理得非常好。在看的时候,第一次想到暴力的美学几个字。个人觉得比sin city好!sincity用了黑白两色,比较容易有冲击感,但是深入人心就不足。
梁家辉和任达华两人一静一动,可圈可点。每次你看梁家辉演戏的时候把,就觉得有点过,可是看完了回过头来想,似乎他的表现方法又是最合适的:一点点神经质和自我陶醉。任达华在这部戏里面走了“气宗”路线,跟姜文一样,一招不发而致敌于死地。
情节就一般了。最后的结尾更白痴。竟然一个黑社会的中坚分子跟警察说人的贪念没有得到抑制真的是很可怕的事情...我芬特。

THE SEMANTIC WEB: AN INTERVIEW WITH TIM BERNERS-LEE -by Andrew Updegrove

---------------------------------------
Brief Content of the Interview
---------------------------------------

Here is the interview to TimBerners-lee by Andrew Updegrove.

The topics include:
"Questions and Answers: Our interview was intended to help those that do not yet have the Semantic Web in focus gain an understanding of what the Semantic Web will (and will not) be, what we can look forward to using it for, and how it is likely to become real. Our questions were divided into the following six categories:

  • Vision: Why build a Semantic Web now, rather than add other capabilities at this point in time, and what new capabilities will the Semantic Web have?
  • Status: Who is already committed to create the Semantic Web, and how do we get the rest on board?
  • Critics: What has worked well and what has not proceeded so smoothly in developing Semantic Web standards, and what are the things that critics don’t “get” about the Semantic Web?
  • Business Reality: What are the biggest challenges to bringing the Semantic Web into being?
  • Infrastructure: Will other standards be needed in the future to take full advantage of the Semantic Web, and who will develop them?
  • Users: What will it be like to use the Semantic Web?

"

---------------------------------------
Important Points
---------------------------------------

1.in the Semantic Web, it is not necessary for every page having a corresponding metadata(encoding in RDF/OWL). That is, we have pages for people only; have metadata for machines only (of course people can read /use directly), and have pages plus their metadata.

2.In the recent years, W3C has contributed mostly in providing an infrastructure on how to describe knowledge by machine in distributed environments. not on how to create metadata. It is like to define the TCP/IP protocol stack instead to build applications based on TCP/IP

"CSB: What are the limitations of the Semantic Web – what will it enable someone to do, and what will it not permit us to do?
TBL: The goal of the Semantic Web initiative is to create a universal medium for the exchange of data where data can be shared and processed by automated tools as well as by people. The Semantic Web is designed to smoothly interconnect personal information management, enterprise application integration, and the global sharing of commercial, scientific and cultural data. We are talking about data here, not human documents.
The Semantic Web is not about the meaning of English documents. It’s not about marking up existing HTML documents to let a computer understand what they say. It’s not about the artificial intelligence areas of machine learning or natural language understanding -- they use the word semantics with a different meaning.
It is about the data which currently is in relational databases, XML documents, spreadsheets, and proprietary format data files, and all of which would be useful to have access to as one huge database.

"

---------------------------------------
Still No Killer App?
---------------------------------------

When asked on the Killer app of SW, we response by: SW itself is a killer app. But it's not a good answer of course.

In fact, it is hard for us to find a good application. Why?

1. if we set the scenario to be the interoperation among several enterprise, we lose the feature of distribution, which is the most significant different of SW/RDF/OW to the XML.

2.if we try to find a scenario benefits the entire web. Intuitively we will start off by finding the usefulness of the SW to the ordinary people, which, almost immediately equals to, to make use of metadata connected with web pages.

Many researchers work on this point now. But the results are not so inspiring. I think it is because the problem is complicated and far from be used widely. (Think I will look into it later ).

Fortunately, the work from web2.0 will help.

The most important of all is , I think, the SW is mainly about the machines, not about the connection of webpage to metadata. So SW will not make the connecting work more easier, excepting we have a language for metadata/knowledge(of course it is important). On the contrary, it is the connecting will made the SW more convictive.

11/21/2005

Folksonomy & Semantic web, taggin and googlebase

------------------------------------
Finally, I am here talking about folksonomy, ontology and tagging again,because the topic I am ready to explore turned out to have be published in an European conference in Dec, 2005.

The professor I worked with said it is a striking info for us......The idea, the experiment and the method are all quite similar to what we are going to do.

The lesson is, you should be quick!

Here you can find the paper, if interested in.

After paining and moaning to my friends for several days. Finally I calm down and think over them again. And here comes the initial work.
------------------------------------

Many discussions arised on Folksonomy and Semantic Web,
e.g.
Folktologies -- Beyond the Folksonomy vs. Ontology Distinction ;
http://ecolab.ruc.edu.cn/blog/zhangsr.php?itemid=132 (In Chinese);
http://spaces.msn.com/members/folksonomy/;
among which the statements from wikipedia inteste me the most.

Take tagging, the possibly most popular form of folksonomy , as an example. The wide spreading and growth of tagging proved, instead of debased, the fundamental idea of semantic web: add machine readable metadata will make a better web for human being.

Thus, the most valuable part of folksonomy lies in its guiding us the way to add metadata to the web (Regrettebly, the machine is far from satisfaction on this task.)

Tagging is just an start point. By tagging, we get "metadata soup".
What's the next? It is what google base done. Google base involved people in, create metadata and knowledge!

11/20/2005

凌晨的警报



睡得迷迷糊糊的时候,被警报吵醒。 经历了一个漫长的清醒过程,并不是一下就醒来然后从床上弹起:做梦梦到警报,逐渐发现并确认是真实的声音,难道是火灾或者地震?可是好困,既然只有警报没有人的动静,应该还OK,不想动...哎,可是一直在响,好麻烦。要起来要起来....不想动,先判断一下是哪里的声音吧?窗外的?走廊的?窗外的?走廊的?窗外的....?
算了,起床!

凌晨五点三十七分。
先看看窗外的情形。天还是黑乎乎的,路上一个人也没有,对面楼的窗户们大都黑着,零星的一两个还点着灯,光线从厚厚的窗帘那边渗过来。一个典型的冬天的凌晨。于是警报声就显得那么刺耳。

决定出门去看看。以前从来不用猫眼的,下意识的先用猫眼看了一下外面的状况。什么也看不到。出门,走廊也是一片死寂。原来只是隔壁得警报在响。红色的警灯一闪一闪。四周逡巡了一下,确认只有隔壁的警报在响。逡巡的途中,警铃就停了。在回房间的路上,感觉到害怕。为什么只有自己一个人?其一是,只有自己被惊醒;其二是,大家都已经撤离(嗯,当然如果是这个原因就太好莱坞了。)想找人商量,可是三更半夜...扰人清梦是最不道德的。
顺便再次从事件中抽身,对正在走路的那个自己说:你看我都说过,一个人生活在异乡就是艰难吧。然后正在走路的那个自己说:好了好了,不要罗索,我知道。

可是为什么隔壁会响呢?谋杀?火灾?嗬嗬,不能看恐怖片的人,都是想象力过于丰富,然后自己吓死自己。我就是。最后决定给管理处打电话询问一下。

宿舍的电话好像从来只能接不能打。想,不管它,实在不行用手机好了。翻箱倒柜找出刚搬来时候发的小册子找到电话号码,然后开始组织语言,查字典。查“警铃”和“响”。一边查一边想,这可真是滑稽啊,如果真的是大灾难来了,不知道有没有时间让自己查字典。

先尝试内线,竟然拨通!然后说:“对不起,我是D311的,刚才D312的门的上面的警铃响了,现在好了。我很担心,不知道什么了?”这就是目前自己的日语水平,这个中文水平和日文水平的巨大落差,确实是让自己不禁哑然。嗯。对方跟我说了咕噜咕噜一堆。抓到几个关键字:先前,开,和日文发音的stop。想来应该是没有事情了。最后确认一下“:现在好了么?”得到肯定答复之后,说多谢和失礼,挂电话。

再用了不少的时间重新睡着。睡觉前决定,今天去问一下为什么电话只能打给事物室,并且写blog。

11/17/2005

blogspot上的相册和lc的科研贴

今天在自己的google blog装上了flickr的相册。删掉以前的那个map(那个map实在是无聊)。
把日光旅游的照片,贴在了里面。还不错。^_^

然后在lc的blog上看到There's no tommorrow的帖子。
什么叫一石激起千层浪阿~所有潜水不科研天天泡论坛泡blog的海外学子们,
纷纷浮出水面表示同感。hoho...当然也包括midi。

嗯。要改正要改正。

11/16/2005

转载,向数学家和物理学家致敬

见招拆招那里看到《费马大定理》阅读手记(修订版)

把自己带入数学的江湖。

《费马大定理——一个困惑了世间智者358年的谜》,(英)西蒙·辛格(Simon Singh) 著 薛密 译 上海译文出版社出版

“这是一本写得非常精彩的书,费马大定理的破解过程,与一部简明的数学史,被作者西蒙·辛格有机地糅合在一起。但我的疯劲儿发作,以极大的兴趣和耐心将其拆散,以《读者文摘》的笔法重新归置梳理了一遍。一字一字敲在电脑中时,我的心中涌动着巨大的惆怅。但愿有一个少年,能够在如我那个决定命运的关键时刻,读到这个故事。”

www.rdfabout.net

A website that explains RDF and its functions to the outsided world.
As the author, Joshua Tauberer, said:

" The problem I was trying to fix is that, as
far as I've seen, we need to do a much better job of explaining to the outside world just why RDF is so simple and useful. We're not going to
reach a Semantic Web any time soon unless more people know about and understand RDF. "


The article "Quick Intro to RDF" is so well written. Clear and hitting the point. I would like to recommend all the beginners of SW and RDF to this article.

-------------------
Citing again,
-------------------

  • "RDF isn't strictly an XML format, it's not just about metadata, it has little to do with RSS, and it's not as complicated as you think."
  • "RDF is a method designed for expressing knowledge in a decentralized world and is the foundation of the Semantic Web, in which computer applications make use of distributed, structured information spread throughout the Web."
  • "Everything at all mentioned in RDF means something, whether a reference to something concrete in the world, an abstract concept, or a fact. Standards built on RDF describe logical inferences between facts and how to search for facts in a large database of RDF knowledge"
  • "RDF applications can put together RDF files posted by different people around the Internet and easily learn from them new things that no single document asserted. It does this in two ways, first by linking documents together by the common vocabularies they use, and second by allowing any document to use any vocabulary"

11/15/2005

抽烟的帅哥和游记的预告

晚上收到小刚的照片三张。拍的是他们在灯光球场踢球的情景。
用小刚的话说:在灯光球场进了一个任意球,职业足球生涯终于又完整了一些。
三张照片,分别是:大家一起踢球的全景,小刚V得单人照,和小刚正在点烟的照片。

迷笛说,还是喜欢那个正在点烟的照片
为什么?
帅阿!(还用问)

由此在此证明证明:
1.女生看球都是看帅哥的
2.迷笛见到抽烟好看的帅哥就会开始流口水,恨不得扑上去。这个迷笛跟冬子讨论过,嗯。所谓萝卜白菜。(当然有烟瘾的就不好了。因为吸烟很多的人整个人会变得臭臭的)
2.小刚是帅哥;而且抽烟的样子也很好看。^_^

哈哈。某位同学不要骄傲。

八卦完之后,炒一个新闻。
就是迷笛昨天和今天(不对,现在过了十二点,就应该是前天和昨天)去了“日光”旅游。哈哈,等有了照片,就可以写游记啦。请关注后续报道。反复点击本blog~~~

11/11/2005

一点点不知所措

今天上日语课的时候。老师讲单词:岛,半岛,大陆。等等。
开始举例子。日本国是岛国,印度是半岛,台湾国是岛国...
然后就被midi同学excuse me乐。提示了一下台湾的问题。

老师没有表示什么。一边去擦黑板,一边用日语咕噜咕噜说大家的看法不同,然后又说这个是政治问题。
等她咕噜完了。迷笛谈谈说了下sumimasan表达了一下对于打扰她课程的抱歉。

可是后来提到的时候,老师有一次还是用的kuni(日语的国家这个单词)。不想引发争吵。所以谜笛没有再次跳出来。

其实迷笛本心是很向往做一个温顺和善的人,因此其实很少跟人争执。但是说句不要脸的话,当你自己知道的东西慢慢变多,就会形成自己的判断,于是就很难轻易同意别人的看法;但一般也只是在心里不同意而以,尊重每个人的说法。所以,一方面跳出来提示老师,简直是一瞬间的直觉反应;可是事后又觉得这样一来跟自己温顺善良的目标又远了不少。

同样一个班的中国同学有四个,就我跳出来;甚至让人怀疑是不是自己太过多事。
很困扰;所以更怀念冬子。:-)
冬子是很直接的人,喜欢就喜欢,不喜欢就不喜欢,没有那么多的一方面,另一方面。老天保佑冬子块考试完~~

11/08/2005

Three letters.

Can`t input chinese because I am working on an apple with only english and japanese input method.

It`s a desk top apple. of course it is cool. but I am not familiar with it till now. The keyboard is a littel bit small (expecially it is in japanese layout) and the UI is not familiar. O.K. O.K. anyway, I am old and out. I will choose working on MS windows till there are reasons other than fashion or in or evil MS. @¥@

I write three letters during my weekend, now I feel relief and comfortable.

Wanghao has mentioned that Floyd has a theory that one will be happy and peaceful if and only if he gets well with( in chinese we use the word: gao3ding4.) the most closet 10 peoples around him. I agreed with him, and again, this theory is proved true now.

Recently there is a theory with the name "6-dimension space". In short it argues that u can get to know anyone along a path with at most 6 people. But why I bother to know the people has nothing to do with me. Suppose I can get to know Bush in this way, can I persuade him stopping the war? redicuios. en.. maybe I am the people labed by inactive. ok, I am lazy and feel enough with no more than 20 people around me.

1st letter to Del.icio.us. explain my experiments to the staff. and just to see if they are mercy enough as to permitting my usage of their data. And till now I just get the confirmation letter but not the response. So I started my programe again, this time with only one thread. This thread has been survived for 20 hours at the time I left my dorm. My heart bleeding, because the frequent I/O made my hard disk noise....
2nd letter to prof Xu. My advisor in Peking Univ. Haven`t contact with him for a long time.
3rd letter is finished at about 2:00am this morning. To my advisor here in tokyo univ. Because I am trapped by myself. That is I am so eager to working on my previous topic, but prof. assigned me a new one. I am not opposit to it, just I need time and concentrate. One time one job. :D (In fact, professor has talked with me after the mail. He is so kind ^_^ and now I felt relief. need not worrying and wanderring, and go on with my resesarch. hoho)

So the world is so beautiful today~~ lalalala...

11/04/2005

三句话

昨天跟starsea逛街,相谈甚欢。我们站在,我们站在药店门口,starsea问我:新的泰晤士排名里面,北大排在了东大前面,请问你作何感想。
迷笛说:我当作不知道这个消息好了。

今天碰到实验室的前成员,我们update自己的最近情况,我说我写了一个crawler抓del.icio.us的数据。will说:you write a crawler YOURSELF?迷笛说:That's not SO difficult。中华人民共和国北京大学计算机系毕业的女学生也是计算机系毕业的学生好不好。

下午开会碰到王浩,说:你买了新的电脑啊。恭喜恭喜。没想到他说:没有啊,我没买。晕倒。那晚上的时候迷笛在msn上聊了半天的人是谁?(后来证明是三鹰的同学。芬特)

11/03/2005

终于ip被del.icio.us封了

从del.icio.us上搜集实验数据来着。
很谨慎的起了五个线程做抓取。
但是就在我的程序可以漂亮运行的时候,发现ip被del.icio.us封了...


从新renew了一下ip,爬上del.icio.us去看了一下他的隐私声明。以及copy right。现在正在犹豫,要不要跟它联系一下,看看能不能sign一个协议。然后比较合法的利用他的程序?

sigh....发愁中。
第一次数据运行的时候,搜集了大概2000个用户的3万多条数据。也许可以先在这些数据的基础上分析看看?

痛苦痛苦。