On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner

Li, Na; Zhu, Renyu; Zhou, Xiaoxu; He, Xiangnan; Cai, Wenyuan; Gao, Ming; Zhou, Aoying

Abstract:Author disambiguation arises when different authors share the same name, which is a critical task in digital libraries, such as DBLP, CiteULike, CiteSeerX, etc. While the state-of-the-art methods have developed various paper embedding-based methods performing in a top-down manner, they primarily focus on the ego-network of a target name and overlook the low-quality collaborative relations existed in the ego-network. Thus, these methods can be suboptimal for disambiguating authors.
In this paper, we model the author disambiguation as a collaboration network reconstruction problem, and propose an incremental and unsupervised author disambiguation method, namely IUAD, which performs in a bottom-up manner. Initially, we build a stable collaboration network based on stable collaborative relations. To further improve the recall, we build a probabilistic generative model to reconstruct the complete collaboration network. In addition, for newly published papers, we can incrementally judge who publish them via only computing the posterior probabilities. We have conducted extensive experiments on a large-scale DBLP dataset to evaluate IUAD. The experimental results demonstrate that IUAD not only achieves the promising performance, but also outperforms comparable baselines significantly. Codes are available at this https URL.

Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2011.14333 [cs.IR]
	(or arXiv:2011.14333v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2011.14333

Computer Science > Information Retrieval

Title:On Disambiguating Authors: Collaboration Network Reconstruction in a Bottom-up Manner

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators