PDF] YAWN: A Semantically Annotated Wikipedia XML Corpus
Por um escritor misterioso
Descrição
YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags, is presented. The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.
Debabala Swain Machine Learning and Information 2020, PDF, Artificial Intelligence
PDF) Clitic climbing, finiteness and the raising control distinction. A corpus-based study.
Concept Extraction Using Pointer–Generator Networks and Distant Supervision for Data Augmentation
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
Army ANT: A Workbench for Innovation in Entity-Oriented Search
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
PDF) Automatic Annotation of Similes in Literary Texts
PDF) Mining the History Sections of Wikipedia Articles on Science and Technology
The language network reliably 'tracks' naturalistic meaningful non-verbal stimuli
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
Army ANT: A Workbench for Innovation in Entity-Oriented Search
Information_retrieval_and_extraction_IIIT
de
por adulto (o preço varia de acordo com o tamanho do grupo)