A best-effort approach to an infrastructure for Chinese Web related research
Weining QIAN , Aoying ZHOU , Minqi ZHOU
Front. Electr. Electron. Eng. ›› 2011, Vol. 6 ›› Issue (2) : 388 -396.
A best-effort approach to an infrastructure for Chinese Web related research
The design of the infrastructure for Chinese Web (CWI), a prototype system aimed at forum data analysis, is introduced. CWI takes a best effort approach. 1) It tries its best to extract or annotate semantics over the web data. 2) It provides flexible schemes for users to transform the web data into eXtensible Markup Language (XML) forms with more semantic annotations that are more friendly for further analytical tasks. 3) A distributed graph repository, called DISGR is used as backend for management of web data. The paper introduces the design issues, reports the progress of the implementation, and discusses the research issues that are under study.
Chinese Web infrastructure / semantic entity / graph data model / distributed storage
| [1] |
|
| [2] |
China Internet Network Information Center. The 24th Statistical Report on the Development of the Chinese Internet. CNNIC, 2009 |
| [3] |
|
| [4] |
|
| [5] |
|
| [6] |
|
| [7] |
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
|
| [15] |
|
| [16] |
|
| [17] |
|
| [18] |
|
| [19] |
|
| [20] |
|
| [21] |
|
| [22] |
|
| [23] |
|
| [24] |
|
Higher Education Press and Springer-Verlag Berlin Heidelberg
/
| 〈 |
|
〉 |