- How to find out that Darwish's Probabilistic Structured Query is better than Pirkola's theory through the graph in slides?
- How to understand the relationship between CLIA, MT and summarization?
Monday, March 31, 2014
Week 12 muddiestPoint
Tuesday, March 25, 2014
Week 12 reading note
Cross-Language information retrieval addresses the problem of finding information in one language in response to queries expressed in another. CLIR is sometimes called” trans lingual information retrieval”. There are three major challenges in translation-based CLIR: what to translate, how to obtain translation knowledge, and how to apply the translation knowledge.
There are also some natural language processing techniques like text processing progress.
The evaluation of an interactive CLIR system can be modelled by examining how well a CLIR system can support: (2) query formulation and translations, and (2) document selection and examination.
Probably the most noticeable achievement in CLIR is that cross-language document ranking can often achieve near 100%, or even higher. Of the retrieval effectiveness of monolingual document ranking.
Week 11 muddiestPoint
In PageRank, if the number of out-link is too large, this page is almost equal to the page has no out-link, how to distinguish these two types of pages?
Monday, March 17, 2014
Week 11 reading note(IIR)
After reading chapter 19, I know the early web search engines. Early attempts at making web information “discoverable” fell into two broad categories: (1) full-text index search engines such as Altavista, Excite and Infoseek and (2) taxonomies populated with web pages in categories, such as Yahoo! The web search engine also comes out a new field called search engine marketing. For advertisers, understanding how search engines do this ranking and how to allocate marketing campaign budgets to different keywords and to different sponsored search engines has become a profession known as search engine marketing (SEM).
How do search engines differentiate themselves and grow their traffic? Here Google identified two principles that helped it grow at the expense of its competitors: (1) a focus on relevance, specifically precision rather than recall in the first few results; (2) a user experience that is lightweight, meaning that both the search query page and the search results page are uncluttered and almost entirely textual, with very few graphical elements. The effect of the first was simply to save users time in locating the information they sought. The effect of the second is to provide a user experience that is extremely responsive, or at any rate not bottleneck by the time to load the search query or results page.
Subscribe to:
Comments (Atom)