Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

Chen, Bo; Gu, Xiaotao; Hu, Yufeng; Tang, Siliang; Hu, Guoping; Zhuang, Yueting; Ren, Xiang

Computer Science > Computation and Language

arXiv:1904.06475 (cs)

[Submitted on 13 Apr 2019]

Title:Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

Authors:Bo Chen, Xiaotao Gu, Yufeng Hu, Siliang Tang, Guoping Hu, Yueting Zhuang, Xiang Ren

View PDF

Abstract:Recently, distant supervision has gained great success on Fine-grained Entity Typing (FET). Despite its efficiency in reducing manual labeling efforts, it also brings the challenge of dealing with false entity type labels, as distant supervision assigns labels in a context agnostic manner. Existing works alleviated this issue with partial-label loss, but usually suffer from confirmation bias, which means the classifier fit a pseudo data distribution given by itself. In this work, we propose to regularize distantly supervised models with Compact Latent Space Clustering (CLSC) to bypass this problem and effectively utilize noisy data yet. Our proposed method first dynamically constructs a similarity graph of different entity mentions; infer the labels of noisy instances via label propagation. Based on the inferred labels, mention embeddings are updated accordingly to encourage entity mentions with close semantics to form a compact cluster in the embedding space,thus leading to better classification performance. Extensive experiments on standard benchmarks show that our CLSC model consistently outperforms state-of-the-art distantly supervised entity typing systems by a significant margin.

Comments:	accepted by NAACL-HLT 2019
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:1904.06475 [cs.CL]
	(or arXiv:1904.06475v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1904.06475

Submission history

From: Bo Chen [view email]
[v1] Sat, 13 Apr 2019 03:52:56 UTC (4,248 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-04

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bo Chen
Xiaotao Gu
Yufeng Hu
Siliang Tang
Guoping Hu

…

export BibTeX citation

Computer Science > Computation and Language

Title:Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators