【摘 要】
:
Nowadays,research on stylistic features(SF)mainly focuses on two aspects: lexical elements and syntactic structures.The lexical elements act as the content of a sentence and the syntactic structures c
【机 构】
:
Tsinghua University,Beijing,China
【出 处】
:
第十七届全国计算语言学学术会议暨第六届基于自然标注大数据的自然语言处理国际学术研讨会(CCL 2018)
论文部分内容阅读
Nowadays,research on stylistic features(SF)mainly focuses on two aspects: lexical elements and syntactic structures.The lexical elements act as the content of a sentence and the syntactic structures constitute the framework of a sentence.How to combine both aspects and exploit their common advantages is a challenging issue.In this paper,we propose a Principal Stylistic Features Analysis method(PSFA)to combine these two parts,and then mine the relations between features.From a statistical analysis point of view,many interesting linguistic phenomena can be found.Through the PSFA method,we finally extract some representative features which cover different aspects of styles.To verify the performance of these selected features,classification experiments are conducted.The results show that the elements selected by the PSFA method provide a significantly higher classification accuracy than other advanced methods.
其他文献
Relation extraction is an important semantic processing task in natu-ral language processing.The state-of-the-art systems usually rely on elaborately designed features,which are usually time-consuming
The analysis and understanding of spoken texts is an impor-tant task in artificial intelligence and natural language processing.How-ever,there are many verbose expressions(such as mantras,nonsense,mod
A Bi-LSTM based encode/decode mechanism for named entity recognition was studied in this research.In the proposed mechanism,Bi-LSTM was used for encoding,an Attention method was used in the intermedia
Relation extraction is an important part of many information extrac-tion systems that mines structured facts from texts.Recently,deep learning has achieved good results in relation extraction.Attentio
Using sequence-to-sequence models for abstractive text sum-marization is generally plagued by three problems: inability to deal with out-of-vocabulary words,repetition in summaries and time-consuming
As an essential sub-task of frame-semantic parsing,Frame Identifica-tion(FI)is a fundamentally important research topic in shallow semantic pars-ing.However,most existing work is based on sophisticate
The evaluation of word embeddings has received a considerable amount of attention in recent years,but there have been some debates about whether intrinsic measures can predict the performance of downs
Task-oriented dialog systems usually face the challenge of querying knowledge base.However,it usually cannot be explicitly modeled due to the lack of annotation.In this paper,we introduce an explicit
At present,the research on Tibetan machine translation is mainly fo-cused on Tibetan-Chinese machine translation and the research on Chinese-Tibetan machine translation is almost blank.In this paper,t
Extracting term translation pairs is of great help for Chinese histori-cal classics translation since term translation is the most time-consuming and challenging part in the translation of historical