Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

Yuan, Zixuan; Zhu, Yada; Zhang, Wei; Huang, Ziming; Ye, Guangnan; Xiong, Hui

Computer Science > Machine Learning

arXiv:2112.00963 (cs)

[Submitted on 2 Dec 2021 (v1), last revised 3 Dec 2021 (this version, v2)]

Title:Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

Authors:Zixuan Yuan, Yada Zhu, Wei Zhang, Ziming Huang, Guangnan Ye, Hui Xiong

View PDF

Abstract:Earnings call (EC), as a periodic teleconference of a publicly-traded company, has been extensively studied as an essential market indicator because of its high analytical value in corporate fundamentals. The recent emergence of deep learning techniques has shown great promise in creating automated pipelines to benefit the EC-supported financial applications. However, these methods presume all included contents to be informative without refining valuable semantics from long-text transcript and suffer from EC scarcity issue. Meanwhile, these black-box methods possess inherent difficulties in providing human-understandable explanations. To this end, in this paper, we propose a Multi-Domain Transformer-Based Counterfactual Augmentation, named MTCA, to address the above problems. Specifically, we first propose a transformer-based EC encoder to attentively quantify the task-inspired significance of critical EC content for market inference. Then, a multi-domain counterfactual learning framework is developed to evaluate the gradient-based variations after we perturb limited EC informative texts with plentiful cross-domain documents, enabling MTCA to perform unsupervised data augmentation. As a bonus, we discover a way to use non-training data as instance-based explanations for which we show the result with case studies. Extensive experiments on the real-world financial datasets demonstrate the effectiveness of interpretable MTCA for improving the volatility evaluation ability of the state-of-the-art by 14.2\% in accuracy.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computational Engineering, Finance, and Science (cs.CE); Information Retrieval (cs.IR)
Cite as:	arXiv:2112.00963 [cs.LG]
	(or arXiv:2112.00963v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2112.00963

Submission history

From: Zixuan Yuan [view email]
[v1] Thu, 2 Dec 2021 03:40:17 UTC (2,487 KB)
[v2] Fri, 3 Dec 2021 05:12:05 UTC (2,487 KB)

🚨2024-09-29: arxiv.org is experience DB issues. The announce tonight will be 3 hours later than usual.🚨

Computer Science > Machine Learning

Title:Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

🚨2024-09-29: arxiv.org is experience DB issues. The announce tonight will be 3 hours later than usual.🚨

Computer Science > Machine Learning

Title:Multi-Domain Transformer-Based Counterfactual Augmentation for Earnings Call Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators