Data-driven multi-modal corpus construction
August 14, 2024 16:08 Source: "Chinese Social Sciences Journal" Issue 2955, August 14, 2024 Author: Chen Yajing

August 10,2024 Corpus Construction and Application Seminar was held in Beijing。

Participating scholars focused on cutting-edge issues in corpus linguistics,Let’s discuss data-driven language research、Innovative applications and future prospects。Proposed by Gu Yueguo, a researcher at the Institute of Languages, Chinese Academy of Social Sciences,Corpus linguistics should start from live experience,People-centered,Bet365 lotto review Corpus linguistics is not only a methodology,It is an important branch of linguistics,The ultimate goal is to understand people through the study of language。Based on this,He proposed two propositions for corpus construction: the first is the principle of linguistic facts,That is, natural spontaneous corpus should become the basis of the corpus;The second is the principle of man-made ultimate purpose,Emphasize that the construction of the corpus must serve specific research purposes。

With the continuous expansion of data scale,Corpus-based discourse analysis researcher,Faced with how to use Bet365 app download new computing technologies to process large-scale data、How to dig out the attitude meaning hidden under the surface of the proposition in the local context、A series of challenges such as how to analyze precise and delicate tissues。For this,Wei Naixing, a professor at the School of Foreign Languages, Beihang University, thinks,There is an urgent need to improve current data processing technology and improve language analysis tools to solve the above problems。At the same time,Intelligent analysis technology brings convenience but also has problems of randomness and arbitrariness caused by algorithm limitations,bet365 Play online games Linguists always need to pay attention to the reading of real texts,And further debug and intervene in intelligent technology based on specific research questions。

The conference is organized by the Corpus and Computational Linguistics Research Center of the Institute of Linguistics, Chinese Academy of Social Sciences、Co-sponsored by the China Foreign Languages ​​and Education Research Center of Beijing Foreign Studies University。

Editor: Cui Bohan
QR code icon 2.jpg
Highly recommended
Latest article
bet365 live casino games

Friendly links:

Website registration number: Beijing Public Network Security No. 11010502030146 bet365 Play online games Ministry of Industry and Information Technology:

All rights reserved by China Social Sciences Magazine. No reproduction or use without permission is allowed

Chief editor’s email: zzszbj@126.com Contact information of this website: 010-85886809 Address: Floor 11-12, Building 1, No. 15 Guanghua Road, Chaoyang District, Beijing Postal Code: 100026