Digital resource activation Oracle Study
July 20, 2023 08:31 Source: "Chinese Social Sciences", July 20, 2023, Issue 2695, Issue 2695

Oracle contains thick Chinese cultural genes and long -distance cultural root veins,It is an unable to renewable resource to explore Chinese civilization。Oracle multi -level focuses on the history of ancient Chinese social development from 3500 to 3000 years ago,It provides the first -hand authentic historical material for the construction of the Chinese civilization exploration source and the ancient Chinese history system。General Secretary Xi Jinping pointed out,China Civilization Source Project "" To strengthen overall planning and scientific layout,Adhere to multi -disciplinary、Multi -angle、Multi -layered、Comprehensive,History and history、Joint research on humanities science and natural science,Greeprait research and space scope and coverage area,Further answer the origin of Chinese civilization、Formed、Basic pictures of development、Internal mechanism and the evolution path of civilization in various regions "。Researcher of Oracle as a new era,Should be Chongwen Jian History、Use the world to use,Serving the country's major strategy、Demand for the development of national cultural development,calm down to keep the academic conscience、bottom line and scientific spirit,To build a disciplinary system for the construction of Chinese civilization、The academic system and discourse system can do something。

  bet365 best casino games

October 30, 2017,UNESCO announced Oracle selected "World Memory List"。This is a long -awaited selection,For a long time,Oracle continues to be affected by the country、Government、The attention and protection of society and individuals。Oracle is found since being discovered in 1899,Oracle bones unearthed in 124 have flowed to all over the world and have been collected by nearly 100 collectors。As a unique valuable historical material,From Wang Yirong、Liu Ye、Dong Zuobin,to Luo Zhenyu、Wang Guowei、Dong Zuobin、Guo Moruo ("Oracle Four Halls"),Then to Tanglan、Rong Geng、Ke Changji、Shang Chengyi ("Four Oracle") and Wang Xiang、Yuyu、Chen Mengjia、Hu Houxuan and other generations of former scholars go forward and work unremitting efforts,Oracle has aroused great attention and in -depth exploration of academic circles at home and abroad,Become an international manifestation。For Oracle scattered around the world,Comprehensive professional finishing work is the most critical。As early as 1984,Hu Houxuan once pointed out that the total number of Oracle unearthed about 150,000 tablets,Our statistics have exceeded 160,000 pieces。In the middle and late 20th century,Milestrokes, such as "Oracle Collection" and "Oracle Collection Supplementary Compilation", etc.,Created a good condition for promoting Oracle and Oracle Research。Hu Houxuan mentioned more than 40 years ago,Oracle Article Khan Niu Dongzong,Scattered on various publications,It is not easy to find information,Could it be a book about Oracle Research Documents。Unfortunately,,This wish did not realize until his death。Inheriting Hu Houxuan's legacy,early 21st century,Under the leadership of Researcher Song Zhenhao at the Institute of Ancient History of the Chinese Academy of Social Sciences,Finally compiled 40 volumes of "Oracle Literature Integration",Sisters known as the 13 volumes of "Oracle Collection",Research results for the former gathering,The latter general exchange research material,provides basic academic information Bet365 lotto review for oracle scholarship research。

Taking the Institute of Ancient History of the Chinese Academy of Social Sciences,I am writing "Three Editor of Oracle Collection",Collection "Oracle Collection" and "Oracle Collection Supplement" omissions and the Oracle of Oracle,Replenishment and replenishment of some of the public and private households,A total of more than 30,000 pieces of oracle bones,It will provide a new collection of large -scale armor records for the academic community; in recent years, I have compiled the travel blog、Jinbo、Shandong Bo、Chongqing Three Gorges Bo、Russian Winter Palace、Local Collection and other 15 batches of more than 20,000 pieces of oracle bone collections。According to statistics,There are 11 people with more than 1700 pieces of Oracle in public units in the country,There are 13 in 200-900 pieces,60-199 pieces have 12 pieces。Of these 36,12 have been sorted out or there are 12 people who are being sorted out,accounting for 33.3%; 5 people who are not in place,accounted for 13.9%; 19 companies to be sorted out,52.8%。So,We deeply feel that it is necessary to comprehensively start the holographic collation of Oracle from the national level,Implementation of Oracle Heritage Rescue Protection Measures,Comprehensively promote the scientific research of Oracle、Cultural Communication、Historical Education,Promote the in -depth exploration of the root of Chinese civilization。

 "Yin Qi Wenyuan" promotes the digitalization of Oracle

Oracle is fragile,Surface loose powder and damage are more common,Save、​​Show、Utilization is not easy。Because nearly 160,000 pieces of oracle bones are collected in museums at home and abroad、Library、Scientific research institutions、University and other at least 174 institutions,Can't re -concentrate on the "physical form" to study,And Oracle digital service resource construction,Especially with big data、Artificial Intelligence is a technical foundation Oracle Digital Project,The original information of the oracle bone and its bearing text can be preserved to the greatest extent。

The first task of the construction of Oracle digital service resource construction is to bring Oracle's material、Tools Letter、Research Document Digitalization,The core work is the construction of the database,Able to remove oracle bone texts or pictures,and can meet various retrieval needs。Multiple Oracle databases have been built at home and abroad,If the Hangda Ancient Books Database Expressing System developed by the Chinese University of Hong Kong,Including 7 major large -scale oracle bone books at home and abroad such as "Oracle Collection Interpretation" and "Tibetan Orthodile Collection in the UK"。The "Oracle World" database developed by the National Library of China Collection 5932 Oracle Photos、Top film 3177。other,"Academia Sinica" in Taiwan、East China Normal University、Institutes of Toyo Culture, the University of Tokyo, Japan have also developed several Oracle databases,Database developed by individuals such as the website of Guoxue Master website can also be used publicly。

Although the current Oracle digital service resource construction has achieved a lot of results,To a large extent, it is also convenient for Oracle research,But the integrity of Orthodox data as a whole、The degree of normative and correlation is not high,In particular, there are problems with poor multiple retrieval efficiency of users。

Oracle's hometown in Yinxu, Anyang, Henan Province。Some scholars said in the 1980s,Oracle is distributed around the world,I hope that one day those oracle bones that are flowing in foreign countries can return to their homeland。Let all the oracle bone cultural relics flowing bet365 Play online games back to the homeland may not be unrealistic,But through big data,All the digital Oracle texts are concentrated in Anyang,It is still possible to achieve。Out of such a vision,Oracle Information Treatment of Oracle Information Treatment of Anyang Teachers College with the Key Laboratory of the Ministry of Education and the Oracle and Shang History Research Center of the Chinese Academy of Social Sciences,"Yin Qi Wenyuan" was launched in October 2019,Use big data、Digitalization of Oracle Calcus、Intelligent,Covisons to build Oracle Big Data Platform。This platform includes "Sanku and One Platform",Record library、glyph library and literature library,and Oracle Knowledge Service Platform,There are 153 types of nail bones、Image 239289、Oracle 4000 Most Words、Academic theory 34234 species,and still constantly update。Signing through the information of multi -dimensional dimensions,Implementation and glyph、Glyphs and related reference books、Huanya、Literature and other multi -functional correlations,Solved the Oracle Bone Bone Bone Books due to the difficulty of input difficulty and the tedious labeling of the information、Difficulties in the large -scale sharing and promotion of literature resources。At the same time,Platform is not only open to the world for free,It also provides a dedicated public data set and various information resource integration services used by various artificial intelligence technology research institutes,The fourth phase of the R & D and construction is currently undergoing the fourth phase。

  Digital Wisdom Energy Oracle Protection and Heritage

With the continuous advancement of oracle scholarship research,A large amount of Oracle Knowledge Data was generated,Top films such as recorded、Photo、copies,Especially the three -dimensional oracle data that has appeared in recent years,Oracle -shaped head、Word、Alien shape,and a large number of Oracle Research Documents,These multi -dimensional、Multi -mode data is an important information for Oracle Research,It is also the basis for the data of Oracle information processing research in the new era。Here is the mainly recorded image of Oracle、Oracle shape、Digitalization of Oracle Research Documents、Intelligent applications to sort out,The purpose of promoting the organicity of Oracle Innovation and helping to achieve new breakthroughs in Oracle research。

Word detection and recognition in the recording of Oracle。Digitalized work on existing nail bones,The first thing to solve is the detection and recognition of oracle text,It is the basis for the automatic processing of the computer to process oracle image data。On the one hand,Digitalization of existing nail bones can improve the research efficiency of oracle scholarship experts,Especially the appropriate search technology (including search graph、Search for words、Searching for Figures) can improve the efficiency of scholars' query of the literature; on the other hand,,Using computer vision analysis technology,Test and recognize the oracle text in the image of the nail bone,Not only the process of accelerating the digitalization of Oracle literature,You can also study other ancient words、Oracle Culture Promotion and Communication will help。

Traditional Oracle text recognition method,Generally divided into feature extraction and feature classification。The purpose of the feature extraction is to obtain the unique features of Oracle text images,Characteristic classification judges which oracle word belongs to the extraordinary features。Common feature extraction methods are: non -changing feature transformation、bet365 Play online games The direction gradient diagram、Gabor、Local two -value mode, etc.。The most common feature classifier is to support vector machine。It can be seen from the traditional Oracle text detection method,Feature detection and feature recognition designs have a strong dependence on algorithm designers,Choose different features and identifier,The identification effect is very different。This is also a problem with traditional image recognition。

In recent years,The recognition technology based on deep neural network has achieved great development。This technology does not require manual selection features,The recognition results that can be available to the end -to -end。The more representative results are a layered -based Oracle text recognition method proposed by the Microsoft Asian Research Institute.。

Oracle Information Treatment of Oracle Information Treatment of Anyang Teachers College and the Key Laboratory of the Ministry of Education with the Oracle Top Data Set OBC306 released by South China University of Technology,The recognition rate has been improved for different convolutional neural networks (CNN); Liu Chenglin team proposed by Liu Chenglin team of the Institute of Automation of the Chinese Academy of Sciences,Use the copy of the copy of the copy to assist the shaped method of the fonts; Yang Zhengfeng, East China Normal University, etc.,The accuracy accuracy rate of identification is obtained at the self -built data set OBI100; West Jiaotong University Wang Qiufeng and others proposed a hybrid increase in Oracle text recognition; Wide method; Meng Lin University of Japan Liming Museum proposed a dynamic data increase method in the self -built data set OBI125。Compared with traditional Oracle text recognition technology,These methods have made very obvious progress。

Although the effects of various identification methods seem good,But most of them only selected 100-300 font categories with higher glyphs,and the identification object is a pioneer oracle word with a relatively large number of samples。So focus on consideration,Extreme uneven phenomenon of the distribution of Oracle data distribution. The Oracle recognition model caused by the low -word frequency category recognition performance is not good。For this,Let's join forces with Tencent Company,On the basis of "Yin Qi Wenyuan" Oracle data labeling and processing,Customized algorithm,Continuously enrich and improve the Oracle Model Library,As of now, it has established the world's largest Oracle single -word database with 1.43 million words,Improve Oracle recognition and test interpretation、Oracle on the efficiency of content extraction, etc.。

Oracle Coding and Input Method Application。Oracle is already a relatively mature text system,But because it has no standard strokes、Alien characters more、A large number of unprepared characters and pronunciation cannot be known,Computer input to realize Oracle faces a lot of challenges。Coding and glyphs of Oracle characters have always been the focus of Oracle research,It is also one of the key issues of the Oracle Digital Project。From the coding implementation plan of Oracle,The corresponding coding of modern Chinese characters,still use the PRIVATE USE AREA interval of the Unicode space for re -coding,Nothing can completely solve the vitamins in Oracle and the dynamic increase in the formation of the Oracle shape with research、Questions of changes。So,The problem that needs to be solved urgently is the basic glyph standard of the oracle character,and implement Oracle to enter the international Unicode encoding work,After passing the international standard review,Fixed its position in the unicode coding space,Construction for the Oracle Font Library、bet365 Play online games Input method and digital publication work lays the foundation。

Oracle Input Method is the basis for the digital editing of Oracle。For the current use,Not limited to personal computer use,More reflected in the text display based on the web page and the digital publishing business based on publishing editing。Oracle text and modern Chinese characters are very different,The input method of Oracle is facing great challenges。The current feasible solution is to prepare a concise and easy -to -use encoding table,Pinyin, which is more commonly used by experts and scholars、Code、Handwriting、Oracle input method of multiple dimensions such as visualization to analyze,Each has its own characteristics。However, we believe that with the digitalization of the Oracle, it continues to deepen,Unified encoding standards of Oracle Once established,Can be available、easy to use、enough Oracle input method will be perfect,It can also make Oracle really "live"。

Fragmentation and heterogeneity of Oracle Research Documents。Oracle literature is the most complicated literature in all literature,in layout、Text、Images and other aspects are extremely challenging。Current,"Yin Qi Wenyuan" research team has included more than 120 years of Oracle -related research documents 34234 articles,and achieved the title on the digital platform、Summary、Author、Record information of keywords and other questions to retrieve the literature and the corresponding PDF format document download and other functions,But the full text retrieval and image retrieval can not be achieved。

With the deepening of Oracle research,Only through the title of the article、Author or keywords to find an isolated article in the database can no longer meet the deepening of the needs of oracle scholarship。Intelligent retrieval dominated by knowledge map technology、Related push and other knowledge services are the demands of more oracle scholars at present,But the existing Oracle database is generally unable to extract the content information of the literature from the literature mainly scanned pictures,Need to process these scanned pictures deep -level processing。Specifically,The transformation of the literary scanning picture into the text、Picture、Non -structured data composed of heterogeneous data such as chart,and disassembled it into a fine particle size information unit based on the content as the unit,The XML document formed by the final formation of heterogeneous data。

Different from modern documents is,Oracle literature published before the founding of New China is affected by backward printing technology restrictions and the impact of writing rules for the New Culture Movement,Usually there is no unified typesetting method、Use word specifications and punctuation symbol use rules,This causes conventional fragmented tools to be directly applied to Oracle literature。other,Different picture data of articles illustrates in heterogeneous data of modern documents,A remote characters often appear in Oracle Literature、Dingzi characters and ancient text such as existing characters recognition technology that cannot be effectively identified is not yet effectively identified,These glyphs also need to be saved in the form of picture data。So,In the heterogeneous data structure of Oracle Literature,Picture data is much higher than modern literature。High -proportion of picture data compilation requirements also lead to a double increase in the difficulty of organizing oracle li literature。Current,The heterogeneous processing of Oracle Literature is basically manually entered,Use OCR tools to assist recognition in some articles that do not involve Oracle.bet365 best casino games ,But the overall progress of the literature is slow,Only a small part of the articles have achieved heterogeneity。

Digitalization of Oracle Literature provides a computer for the computer、Digital material for association and analysis,To achieve the convenience of oracle scholarship research、Intelligence laid the foundation,The use of artificial intelligence technology for Oracle literature has also become a future development trend。other,Digitalization technology of literature can also provide a series of intelligent services for researchers and Oracle enthusiasts Bet365 lotto review for oracle scholarship,If the picture handwritten oracle word recognition、Tushing characters associated information retrieval,Constantly expand the breadth and depth of oracle science research。

Targeting automatic heterogeneity processing with oracle bone literature,Use artificial intelligence technology to analyze document analysis and character recognition of literature pictures,According to the requirements of organizing,Identification of the heterogeneous data types of the contents of each part of the literature,extract it into text or pictures and other heterogeneous data,and store in the database in XML format。This organizational method is not only applicable to the collation of Oracle literature,It can also be promoted to all deep processing tasks involved in ancient text literature。In the heterogeneity of literature、On the basis of knowledgeized processing,A combination of the Oracle Bone Text Library and Record Library,Realize the connection between the three libraries,and provide intelligent retrieval service based on knowledge reasoning based on the content semantic information of the extracted content。

Oracle Full Information Data Model and Oracle Digital Revitalization。Systematic research by the Oracle Collection Agency and the research institution,We found that Oracle Digitalization faces two major problems: one is how to achieve the high -fidelity number reduction of the Oracle "physical"; the other is how to achieve the high -efficiency number search of the Oracle "text"。From April 2022,We combined with Tencent to form a co -creation team to explore the integration of artificial intelligence technology,Use "Micro Market Analysis" to make a three -dimensional model,Use "glyph match" to perform Oracle "search word、Search Figure with words ",Realizing the actual high -fidelity display of Oracle、High -efficiency query in text、High -quality correlation between physical and text。

To break the situation that is decentralized by Oracle data,The co -creation team forms "Oracle Full Information Data Model",Implement 3D modeling、Layings of traditional data such as high -quality data such as text association and copy of the copy, etc.。Under the operation of the synergy mechanism,We propose to integrate artificial intelligence,Break through the "microdes extraction" technology、Photography、Transcript Technology,Gao Baozhen Show the Details of Oracle,Multi -dimensional fusion of Oracle data at the same time,Formation of extension、Multi -layer information coordinates alignment cross -media format "Oracle Full Information Data Model",The "physical" rejuvenation of the Oracle "real realization,Some of the results have been displayed on the WeChat Mini Program, which was released on April 20, 2023,With the attention of the industry and praise。Another,We pass authority、Professional、Practical、Interesting、Common Oracle Digital Network Version,Let more general public understand Oracle、Perceive Oracle、Study on Oracle、Use Oracle,The road to the inheritance and spread of Oracle is unobstructed。

Oracle Auxiliary Examination Bet365 lotto review Examination Based on "Yin Qi Wenyuan 2.0"。With the advancement of artificial intelligence technology in the information age,Based on big data technology to help push Oracle test interpretation must be a new idea and method,Expansion of Oracle -related data,Create more data support,Use more mature artificial intelligence technology, especially deep learning for Oracle auxiliary test interpretation research。Current,Expansion of Oracle -related data,We mainly clean the underlying cleaning of Oracle data,Update the record library、Gravoleum、Literature Library、Library,Construction of "Yin Qi Wenyuan 2.0 Oracle Biography" model library,Provides "word search with the glyph -based matching series algorithm、Search for Data Toolbox,Build Oracle Knowledge Map,AI algorithm with "glyph match" and "human machine collaboration" mode help Oracle "breaks"。

Oracle digital service construction has greatly promoted the in -depth study of Oracle,Especially in recent years, the development of artificial intelligence new technology mainly based on deep learning technology and high attention at the national level,It indicates that Oracle Studies have a bright prospects under the empowerment of digital wisdom。Although still facing more technical problems and other challenges,But do we believe,Combined with new technology、New means,Carry out more interdisciplinary in -depth research,will definitely make oracle bone culture rejuvenating in modern society,In -depth promotion of the creative transformation and innovative development of ancient texts and other ancient texts。

(This article is the ancient text and the Chinese civilization inheritance development project planning project "Yin Qi Wenyuan -Oracle Data Data Platform" (G2812)、"Identification and Extract Technology of heterogeneous data in oracle literature" (G1806)、The Research on the Periodic Teaching Service Platform of Oracle inheritance and Innovation of Oracle Bone Culture and Research Innovation Fund Projects (2021rya05002) phased results)

(Author Unit: Key Laboratory of Oracle Information Treatment of Oracle Information Treatment of Anyang Teachers College; Institute of Ancient History of the Chinese Academy of Social Sciences、Zhengzhou University Chinese Character Civilization Research Center)

Editor in charge: Changchang
QR code icons 2.jpg
Key recommendation
The latest article
Graphics
bet365 live casino games

Friendship link:

Website filing number: Jinggong.com Anmi 11010502030146 Ministry of Industry and Information Technology:

All rights reserved by China Social Sciences Magazine shall not be reproduced and used without permission

General Editor Email: zzszbj@126.com This website contact information: 010-85886809 Address: 11-12, Building 1, Building 1, No. 15, Guanghua Road, Chaoyang District, Beijing: 100026