GIANT: Scalable Creation of a Web-scale Ontology

Liu, Bang; Guo, Weidong; Niu, Di; Luo, Jinwen; Wang, Chaoyue; Wen, Zhen; Xu, Yu

doi:10.1145/3318464.3386145

Computer Science > Computation and Language

arXiv:2004.02118 (cs)

[Submitted on 5 Apr 2020]

Title:GIANT: Scalable Creation of a Web-scale Ontology

Authors:Bang Liu, Weidong Guo, Di Niu, Jinwen Luo, Chaoyue Wang, Zhen Wen, Yu Xu

View PDF

Abstract:Understanding what online users may pay attention to is key to content recommendation and search services. These services will benefit from a highly structured and web-scale ontology of entities, concepts, events, topics and categories. While existing knowledge bases and taxonomies embody a large volume of entities and categories, we argue that they fail to discover properly grained concepts, events and topics in the language style of online population. Neither is a logically structured ontology maintained among these notions. In this paper, we present GIANT, a mechanism to construct a user-centered, web-scale, structured ontology, containing a large number of natural language phrases conforming to user attentions at various granularities, mined from a vast volume of web documents and search click graphs. Various types of edges are also constructed to maintain a hierarchy in the ontology. We present our graph-neural-network-based techniques used in GIANT, and evaluate the proposed methods as compared to a variety of baselines. GIANT has produced the Attention Ontology, which has been deployed in various Tencent applications involving over a billion users. Online A/B testing performed on Tencent QQ Browser shows that Attention Ontology can significantly improve click-through rates in news recommendation.

Comments:	Accepted as full paper by SIGMOD 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2004.02118 [cs.CL]
	(or arXiv:2004.02118v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2004.02118
Related DOI:	https://doi.org/10.1145/3318464.3386145

Submission history

From: Bang Liu [view email]
[v1] Sun, 5 Apr 2020 07:51:23 UTC (1,339 KB)

Computer Science > Computation and Language

Title:GIANT: Scalable Creation of a Web-scale Ontology

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:GIANT: Scalable Creation of a Web-scale Ontology

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators