Taxonomy for search engines

Taxonomy for search engines refers to classification methods that improve relevance in vertical search. Taxonomies of entities are tree structures whose nodes are labelled with entities likely to occur in a web search query. Searches use these trees to match keywords from search a query to keywords from answers (or snippets).

Taxonomies, thesauri and concept hierarchies are crucial components for many applications of information retrieval, natural language processing and knowledge management. Building, tuning and managing taxonomies and ontologies are costly since a lot of manual operations are required. A number of studies proposed the automated building of taxonomies based on linguistic resources and/or statistical machine learning ^[1]

Web mining is one approach to build a search engine taxonomy. The taxonomy construction process starts from seed entities, and mines available source domains for new entities associated with these seed entities. The process forms new entities by applying machine learning to current web search results for existing entities to identify commonalities between them. These commonality expressions then form parameters of existing entities, and turn into new entities at the next learning iteration.^[2]

References

↑ Vicient C. , Sánchez D., Moreno A.. An automatic approach for ontology-based feature extraction from heterogeneous textual resources. Engineering Applications of Artificial Intelligence. 2013;26(3):1092–1106. doi:10.1016/j.engappai.2012.08.002.
↑ Galitsky B. Transfer learning of syntactic structures for building taxonomies for search engines. Engineering Applications of Artificial Intelligence. 2013;26(10):2504–2515. doi:10.1016/j.engappai.2013.08.010.

Taxonomy for search engines

References

See also