搜索 Keyword Rules & Tips
1. 遵守中国大陆相关法律法规
2. 提倡分享有借鉴意义的搜索经历
3. Think with Search(Keyword)

谣传:Google 创始人是受到李彦宏演讲的启发申请了 Page Rank

查看: 3816|回复: 1
1
撩月 发表于 2018-1-8 10:40:32

民间广为流传的说法是:Google 的创始人佩奇和布林是在1998年的国际互联网大会上听到了李彦宏的演讲,这才受到启发申请了自己的 Page Rank 算法专利。

然而事实上,第七届国际互联网大会是在1998年4月14~18日召开的;而page rank专利则申请于1998年1月——看来佩奇和布林还发明了时间机器呢,就是不知道他们为什么不为时间机器申请个专利。

其实,当时佩奇和布林向李彦宏询问的,是“搜索引擎如何实现商用化”问题——有点社会经验恐怕就会微笑了:这明显是试探潜在竞争对手有没有什么商业计划呢。


李彦宏的专利内容United States Patent: 5920859

A search engine for retrieving documents pertinent to a query indexes documents in accordance with hyperlinks pointing to those documents. The indexer traverses the hypertext database and finds hypertext information including the address of the document the hyperlinks point to and the anchor text of each hyperlink. The information is stored in an inverted index file, which may also be used to calculate document link vectors for each hyperlink pointing to a particular document. When a query is entered, the search engine finds all document vectors for documents having the query terms in their anchor text. A query vector is also calculated, and the dot product of the query vector and each document link vector is calculated. The dot products relating to a particular document are summed to determine the relevance ranking for each document.

原理简述:根据指向同一篇文档的链接数目为文档排序;然后在搜索时返回排序更靠前的。这很容易理解,就好像学术文档一样,越重要越核心的,被引用次数就越多。



Page Rank 专利内容United States Patent: 6285999

A method assigns importance ranks to nodes in a linked database, such as any database of documents containing citations, the world wide web or any other hypermedia database. The rank assigned to a document is calculated from the ranks of documents citing it. In addition, the rank of a document is calculated from a constant representing the probability that a browser through the database will randomly jump to the document. The method is particularly useful in enhancing the performance of search engine results for hypermedia databases, such as the world wide web, whose documents have a large variation in quality.

原理简述:先给链接数据库里的链接估算“重要度级别”;然后利用链接本身的重要程度,估计它所指向文章的质量——这也很容易理解,被爱因斯坦引用的文章,肯定比被我引用的可靠的太多。同样的,一个网站越可靠、越严肃,它所链接的文章质量就越高:反过来说也对,你尽管和别人交换链接吧,越和垃圾网站交换链接,你的估值就越低。
布拉格 发表于 2018-1-8 11:48:16
两者创立公司的动机也有差异吧...李彦宏就是瞅准了商机,回国给搜狐那些门户网站提供站内搜索技术服务。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

虫部落 陕ICP备14001577号-1川公网安备 51019002003015号联系我们FAQ关于虫部落免责声明虫部落生存法则蛙先知 - AI 玩家社区 🚧

Build with for "make search easier" Copyright © 2013-2024. Powered by Discuz! GMT+8, 2024-3-29 03:38

快速回复 返回顶部 返回列表