Ten Years

十年一剑!
-------------------------------------------------
Operating System Research / Technique

Sunday, September 24, 2006

(HotOS'03 Note)Why cant I Find My File?

Why can’t I Find My Files? New methods for automating attribute assignment
Craig A.N. Soules
HotOS 2003

本文讨论基于属性的文件管理检索系统中的属性赋值问题。对于Attribute-based naming系统,the key challenge is assigning useful, meaningful attributes to files.通常的有用户输入和内容分析两种方法。用户输入很麻烦,而内容分析太复杂,还有专有文件格式、数据类型问题。作者在文章讨论了context analysis方法。具体包括:access-based context analysisinter-file context analysis。作者借鉴了Web搜索引擎的思路:Web search engines, however, have found greater success obtaining attributes by combining content analysis with context analysis.

Access-based context analysis
a) Application assistance,如邮件的标题对附件的说明
b) Existing user input,如文件路径对文件内容的说明

Inter-file context analysis
a) User access pattern,如用户连续打开的文件往往相关
b) Inter-file content analysis,如类似有些文件利用hash函数去除块的重复存储,可以对文件内容进行分析。(还记得compare-by-hash吗?)这对于文件的多个版本很有作用。

PS:读的时候反而是对Google的搜索技术更感兴趣一些,感觉自己在向铁杆GFans前进,呵呵

0 Comments:

Post a Comment

<< Home