Measuring semantic similarity between named entities by searching the Web directory

Liu Jiahui*, Lawrence A Birnbaum

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

19 Citations (Scopus)

Abstract

The importance of named entities in information retrieval and knowledge management has recently brought interest in characterizing semantic relationships between entities. In this paper, we propose a method for measuring semantic similarity, an important type of semantic relationship, between entities. The method is based on Google Directory, a search interface to the Open Directory Project. Via the search engine, we can locate the web pages relevant to an entity and automatically create a profile of the entity according to the directory assignments of its web pages, which capture various features of the entity. Using their profiles, the semantic similarity between entities can be measured in different dimensions. We apply the semantic similarity measurement to two knowledge acquisition tasks: thesaurus construction of entities and fine grained categorization of entities. Our experiments demonstrate that the proposed method works effectively in these two tasks.

Original languageEnglish (US)
Title of host publicationProceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007
Pages461-465
Number of pages5
DOIs
StatePublished - Dec 1 2007
EventIEEE/WIC/ACM International Conference on Web Intelligence, WI 2007 - Silicon Valley, CA, United States
Duration: Nov 2 2007Nov 5 2007

Publication series

NameProceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007

Other

OtherIEEE/WIC/ACM International Conference on Web Intelligence, WI 2007
CountryUnited States
CitySilicon Valley, CA
Period11/2/0711/5/07

Fingerprint

Semantics
Websites
Thesauri
Knowledge acquisition
Knowledge management
Search engines
Information retrieval
Experiments

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications

Cite this

Jiahui, L., & Birnbaum, L. A. (2007). Measuring semantic similarity between named entities by searching the Web directory. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007 (pp. 461-465). [4427135] (Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007). https://doi.org/10.1109/WI.2007.75
Jiahui, Liu ; Birnbaum, Lawrence A. / Measuring semantic similarity between named entities by searching the Web directory. Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007. 2007. pp. 461-465 (Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007).
@inproceedings{ab72bbe77ca4405781af6cc11fd46d47,
title = "Measuring semantic similarity between named entities by searching the Web directory",
abstract = "The importance of named entities in information retrieval and knowledge management has recently brought interest in characterizing semantic relationships between entities. In this paper, we propose a method for measuring semantic similarity, an important type of semantic relationship, between entities. The method is based on Google Directory, a search interface to the Open Directory Project. Via the search engine, we can locate the web pages relevant to an entity and automatically create a profile of the entity according to the directory assignments of its web pages, which capture various features of the entity. Using their profiles, the semantic similarity between entities can be measured in different dimensions. We apply the semantic similarity measurement to two knowledge acquisition tasks: thesaurus construction of entities and fine grained categorization of entities. Our experiments demonstrate that the proposed method works effectively in these two tasks.",
author = "Liu Jiahui and Birnbaum, {Lawrence A}",
year = "2007",
month = "12",
day = "1",
doi = "10.1109/WI.2007.75",
language = "English (US)",
isbn = "0769530265",
series = "Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007",
pages = "461--465",
booktitle = "Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007",

}

Jiahui, L & Birnbaum, LA 2007, Measuring semantic similarity between named entities by searching the Web directory. in Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007., 4427135, Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007, pp. 461-465, IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007, Silicon Valley, CA, United States, 11/2/07. https://doi.org/10.1109/WI.2007.75

Measuring semantic similarity between named entities by searching the Web directory. / Jiahui, Liu; Birnbaum, Lawrence A.

Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007. 2007. p. 461-465 4427135 (Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Measuring semantic similarity between named entities by searching the Web directory

AU - Jiahui, Liu

AU - Birnbaum, Lawrence A

PY - 2007/12/1

Y1 - 2007/12/1

N2 - The importance of named entities in information retrieval and knowledge management has recently brought interest in characterizing semantic relationships between entities. In this paper, we propose a method for measuring semantic similarity, an important type of semantic relationship, between entities. The method is based on Google Directory, a search interface to the Open Directory Project. Via the search engine, we can locate the web pages relevant to an entity and automatically create a profile of the entity according to the directory assignments of its web pages, which capture various features of the entity. Using their profiles, the semantic similarity between entities can be measured in different dimensions. We apply the semantic similarity measurement to two knowledge acquisition tasks: thesaurus construction of entities and fine grained categorization of entities. Our experiments demonstrate that the proposed method works effectively in these two tasks.

AB - The importance of named entities in information retrieval and knowledge management has recently brought interest in characterizing semantic relationships between entities. In this paper, we propose a method for measuring semantic similarity, an important type of semantic relationship, between entities. The method is based on Google Directory, a search interface to the Open Directory Project. Via the search engine, we can locate the web pages relevant to an entity and automatically create a profile of the entity according to the directory assignments of its web pages, which capture various features of the entity. Using their profiles, the semantic similarity between entities can be measured in different dimensions. We apply the semantic similarity measurement to two knowledge acquisition tasks: thesaurus construction of entities and fine grained categorization of entities. Our experiments demonstrate that the proposed method works effectively in these two tasks.

UR - http://www.scopus.com/inward/record.url?scp=48349136698&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=48349136698&partnerID=8YFLogxK

U2 - 10.1109/WI.2007.75

DO - 10.1109/WI.2007.75

M3 - Conference contribution

AN - SCOPUS:48349136698

SN - 0769530265

SN - 9780769530260

T3 - Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007

SP - 461

EP - 465

BT - Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007

ER -

Jiahui L, Birnbaum LA. Measuring semantic similarity between named entities by searching the Web directory. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007. 2007. p. 461-465. 4427135. (Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007). https://doi.org/10.1109/WI.2007.75