A multi-domain sentiment classification model based on sample filtering and transfer learning

QU Zhaowei; ZHAO Yanjiao; WANG Xiaoru

doi:10.3969/j.issn.0253-2778.2019.01.002

JUSTC > 2019 > 49(1): 8-14. > DOI: 10.3969/j.issn.0253-2778.2019.01.002 CSTR: 32290.14.j.issn.0253-2778.2019.01.002

PDF (3370 KB)

Open Access JUSTC

A multi-domain sentiment classification model based on sample filtering and transfer learning

School of Computer Science and Technology, Beijing University of Posts and Telecommunications, Beijing 100876, China

Cite this: JUSTC, 2019, 49(1): 8-14

https://doi.org/10.3969/j.issn.0253-2778.2019.01.002

CSTR: 32290.14.j.issn.0253-2778.2019.01.002

More Information

Received Date: May 28, 2018
Revised Date: September 17, 2018
Published Date: January 30, 2019

Full text PDF

Abstract

Abstract

Most of the models for sentiment classification are trained and tested on a single dataset. However, the model parameters obtained by training on one dataset are not suitable for another dataset and the model is not generic. A multi-domain sentiment classification model (MDSC) was proposed. With sample filtering and transfer learning, the trained model can be applied to different datasets in multiple domains and the model is more applicable and expandable. Specifically, a document is first mapped to the domain distribution which is used as a bridge between domain classification and sentiment classification, and then sentiment classification is completed. In order to make the model more generic, representative data samples should be selected. MDSC constructs a domain-independent sentiment lexicon to filter sentences that belong to the same document and obtain a high-quality training dataset. At the same time, to improve the classification accuracy and reduce the training time, parameter-based transfer learning with neutral networks is used to obtain the document embeddings for classification. Extensive experiments on datasets containing 15 different domains show that the proposed model can achieve better performance compared with traditional models when applied to datasets in multiple domains.

Abstract

Most of the models for sentiment classification are trained and tested on a single dataset. However, the model parameters obtained by training on one dataset are not suitable for another dataset and the model is not generic. A multi-domain sentiment classification model (MDSC) was proposed. With sample filtering and transfer learning, the trained model can be applied to different datasets in multiple domains and the model is more applicable and expandable. Specifically, a document is first mapped to the domain distribution which is used as a bridge between domain classification and sentiment classification, and then sentiment classification is completed. In order to make the model more generic, representative data samples should be selected. MDSC constructs a domain-independent sentiment lexicon to filter sentences that belong to the same document and obtain a high-quality training dataset. At the same time, to improve the classification accuracy and reduce the training time, parameter-based transfer learning with neutral networks is used to obtain the document embeddings for classification. Extensive experiments on datasets containing 15 different domains show that the proposed model can achieve better performance compared with traditional models when applied to datasets in multiple domains.

FullText(HTML)

References (0)

Cited By

Track Citations

Get Citation

{{if article.articleBusiness.pdfLink && article.articleBusiness.pdfLink != ''}} {{else}} {{/if}}PDF

XML

[1]	Xiangrui Meng, Minggen He, Zhensheng Yuan. Pure state tomography with adaptive Pauli measurements[J]. JUSTC, 2022, 52(8): 1-1-1-6. DOI: 10.52396/JUSTC-2022-0037
[2]	Yi Yin, Weiming Zhang, Nenghai Yu, Kejiang Chen. Steganalysis of neural networks based on parameter statistical bias[J]. JUSTC, 2022, 52(1): 1-1-1-12. DOI: 10.52396/JUSTC-2021-0197
[3]	SUI Hongjian, SHANG Weiwei, LI Xiang, CONG Shuang. Robot control policy transfer based on progressive neural network[J]. JUSTC, 2019, 49(10): 812-819. DOI: 10.3969/j.issn.0253-2778.2019.10.006
[4]	YANG Ziwen, CHEN Lei, PU Jianyu. Recognizing emotions from abstract paintings using convolutional neural network with two-layer transfer learning scheme[J]. JUSTC, 2019, 49(1): 40-48. DOI: 10.3969/j.issn.0253-2778.2019.01.006
[5]	LONG Aoming, BI Xiuchun, ZHANG Shuguang. An arbitrage strategy model for ferrous metal futures based on LSTM neural network[J]. JUSTC, 2018, 48(2): 125-132. DOI: 10.3969/j.issn.0253-2778.2018.02.006
[6]	CHANG Xinzhuo, YANG Kaizhong, LI Xin, SHEN Hongxin, LI Hengnian. Localized atmospheric density prediction method based on NARX neural network[J]. JUSTC, 2017, 47(12): 1015. DOI: 10.3969/j.issn.0253-2778.2017.12.007
[7]	ZHANG Guangbin, ZHANG Runmei. Life span prediction of Huizhou architecture based on improved Elman neural network[J]. JUSTC, 2017, 47(10): 817-822. DOI: 10.3969/j.issn.0253-2778.2017.10.003
[8]	PAN Qingxian, DONG Hongbin, HAN Qilong, WANG Yingjie, DING Rui. A computing method for attribute importance based on BP neural network[J]. JUSTC, 2017, 47(1): 18-25. DOI: 10.3969/j.issn.0253-2778.2017.01.003
[9]	ZHANG Li, LU Xingning, LU Conglin, WANG bangjun, LI Fanzhang. National matriculation test prediction based on support vector machines[J]. JUSTC, 2017, 47(1): 1-9. DOI: 10.3969/j.issn.0253-2778.2017.01.001
[10]	ZHU Shunzhi, WANG Dahan, HE Yanan, WNAG Yan. Hospital outpatient visit analysis and forecasting using time series models[J]. JUSTC, 2015, 45(10): 795-803. DOI: 10.3969/j.issn.0253-2778.2015.10.001

TrendMD

Volume 49 Issue 1 PP. 8-14

Cover

Keywords

Article Metrics

Article views (222) PDF downloads (648)

A multi-domain sentiment classification model based on sample filtering and transfer learning

Abstract

Abstract

Related Articles

Catalog

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

A multi-domain sentiment classification model based on sample filtering and transfer learning

Share

Tools

Abstract

Abstract

Related Articles

Catalog

Related Articles

TrendMD

Article Metrics

Authors

Browse

Contact Us

About

Export File

Citation

Format

Content