for Journals by Title or ISSN
for Articles by Keywords
help
  Subjects -> HUMANITIES (Total: 872 journals)
    - ASIAN STUDIES (157 journals)
    - CLASSICAL STUDIES (110 journals)
    - DEMOGRAPHY AND POPULATION STUDIES (142 journals)
    - ETHNIC INTERESTS (155 journals)
    - GENEALOGY AND HERALDRY (7 journals)
    - HUMANITIES (273 journals)
    - NATIVE AMERICAN STUDIES (28 journals)

HUMANITIES (273 journals)                  1 2     

Showing 1 - 71 of 71 Journals sorted alphabetically
Aboriginal and Islander Health Worker Journal     Full-text available via subscription   (Followers: 10)
Aboriginal Child at School     Full-text available via subscription   (Followers: 4)
About Performance     Full-text available via subscription   (Followers: 9)
Access     Full-text available via subscription   (Followers: 22)
ACCESS: Critical Perspectives on Communication, Cultural & Policy Studies     Full-text available via subscription   (Followers: 8)
Acta Academica     Full-text available via subscription   (Followers: 5)
Acta Universitaria     Open Access   (Followers: 4)
Adeptus     Open Access   (Followers: 1)
Advocate: Newsletter of the National Tertiary Education Union     Full-text available via subscription   (Followers: 2)
African and Black Diaspora: An International Journal     Hybrid Journal   (Followers: 11)
African Historical Review     Hybrid Journal   (Followers: 15)
AFRREV IJAH : An International Journal of Arts and Humanities     Open Access   (Followers: 2)
Agriculture and Human Values     Hybrid Journal   (Followers: 12)
Akademika : Journal of Southeast Asia Social Sciences and Humanities     Open Access   (Followers: 4)
Aldébaran     Open Access   (Followers: 2)
Altre Modernità     Open Access   (Followers: 3)
Amaltea. Revista de mitocrítica     Open Access   (Followers: 1)
American Imago     Full-text available via subscription   (Followers: 3)
American Journal of Humanities and Social Sciences     Open Access   (Followers: 8)
American Review of Canadian Studies     Hybrid Journal   (Followers: 6)
Anabases     Open Access  
Analyse & Kritik. Zeitschrift f     Full-text available via subscription  
Angelaki: Journal of Theoretical Humanities     Hybrid Journal   (Followers: 15)
Antik Tanulmányok     Full-text available via subscription  
Antipode     Hybrid Journal   (Followers: 45)
Anuario Americanista Europeo     Open Access  
Arbutus Review     Open Access  
Argumentation et analyse du discours     Open Access   (Followers: 6)
Ars & Humanitas     Open Access   (Followers: 4)
Arts and Humanities in Higher Education     Hybrid Journal   (Followers: 29)
Asia Europe Journal     Hybrid Journal   (Followers: 4)
Australasian Journal of Popular Culture, The     Hybrid Journal   (Followers: 2)
Behaviour & Information Technology     Hybrid Journal   (Followers: 50)
Behemoth     Open Access   (Followers: 3)
Bereavement Care     Hybrid Journal   (Followers: 8)
Cahiers de praxématique     Open Access   (Followers: 1)
Carl Beck Papers in Russian and East European Studies     Full-text available via subscription   (Followers: 5)
Child Care     Full-text available via subscription   (Followers: 6)
Choreographic Practices     Hybrid Journal   (Followers: 1)
Chronicle of Philanthropy     Full-text available via subscription   (Followers: 2)
Ciencias Sociales y Humanidades     Open Access   (Followers: 1)
Claroscuro     Open Access   (Followers: 1)
Co-herencia     Open Access  
Coaching: An International Journal of Theory, Research and Practice     Hybrid Journal   (Followers: 9)
Cogent Arts & Humanities     Open Access   (Followers: 3)
Colloquia Humanistica     Open Access  
Communication and Critical/Cultural Studies     Hybrid Journal   (Followers: 21)
Comprehensive Therapy     Hybrid Journal   (Followers: 3)
Congenital Anomalies     Hybrid Journal   (Followers: 1)
Conjunctions. Transdisciplinary Journal of Cultural Participation     Open Access   (Followers: 2)
Conservation Science in Cultural Heritage     Open Access   (Followers: 10)
Cornish Studies     Hybrid Journal   (Followers: 2)
Creative Industries Journal     Hybrid Journal   (Followers: 8)
Critical Arts : South-North Cultural and Media Studies     Hybrid Journal   (Followers: 11)
Crossing the Border : International Journal of Interdisciplinary Studies     Open Access   (Followers: 4)
Cuadernos de historia de España     Open Access   (Followers: 3)
Cultural History     Hybrid Journal   (Followers: 18)
Cultural Studies     Hybrid Journal   (Followers: 44)
Culturas     Open Access   (Followers: 1)
Culture, Theory and Critique     Hybrid Journal   (Followers: 22)
Daedalus     Hybrid Journal   (Followers: 14)
Dandelion : Postgraduate Arts Journal & Research Network     Open Access   (Followers: 2)
Death Studies     Hybrid Journal   (Followers: 17)
Debatte: Journal of Contemporary Central and Eastern Europe     Hybrid Journal   (Followers: 5)
Digital Humanities Quarterly     Open Access   (Followers: 54)
Diogenes     Hybrid Journal   (Followers: 7)
Doct-Us Journal     Open Access  
e-Hum : Revista das Áreas de Humanidade do Centro Universitário de Belo Horizonte     Open Access   (Followers: 1)
Early Modern Culture Online     Open Access   (Followers: 36)
Égypte - Monde arabe     Open Access   (Followers: 4)
Eighteenth-Century Fiction     Full-text available via subscription   (Followers: 18)
Éire-Ireland     Full-text available via subscription   (Followers: 8)
En-Claves del pensamiento     Open Access   (Followers: 1)
Ethiopian Journal of the Social Sciences and Humanities     Full-text available via subscription   (Followers: 6)
Études arméniennes contemporaines     Open Access   (Followers: 1)
Études canadiennes / Canadian Studies     Open Access   (Followers: 1)
Études de lettres     Open Access   (Followers: 2)
European Journal of Cultural Studies     Hybrid Journal   (Followers: 22)
European Journal of Social Theory     Hybrid Journal   (Followers: 14)
Expositions     Full-text available via subscription  
Fronteras : Revista de Ciencias Sociales y Humanidades     Open Access   (Followers: 2)
Fudan Journal of the Humanities and Social Sciences     Hybrid Journal  
GAIA - Ecological Perspectives for Science and Society     Full-text available via subscription   (Followers: 4)
German Research     Hybrid Journal   (Followers: 1)
German Studies Review     Full-text available via subscription   (Followers: 25)
Germanic Review, The     Hybrid Journal   (Followers: 5)
Globalizations     Hybrid Journal   (Followers: 8)
Gothic Studies     Full-text available via subscription   (Followers: 11)
Gruppendynamik und Organisationsberatung     Hybrid Journal   (Followers: 1)
Habitat International     Hybrid Journal   (Followers: 5)
Hacettepe Üniversitesi Edebiyat Fakültesi Dergisi     Open Access   (Followers: 1)
Harvard Journal of Asiatic Studies     Full-text available via subscription   (Followers: 11)
Heritage & Society     Hybrid Journal   (Followers: 15)
History of Humanities     Full-text available via subscription   (Followers: 3)
Hopscotch: A Cultural Review     Full-text available via subscription  
Human Affairs     Open Access   (Followers: 1)
Human and Ecological Risk Assessment: An International Journal     Hybrid Journal   (Followers: 4)
Human Nature     Hybrid Journal   (Followers: 14)
Human Performance     Hybrid Journal   (Followers: 4)
Human Remains and Violence : An Interdisciplinary Journal     Full-text available via subscription  
Human Studies     Hybrid Journal   (Followers: 11)
humanidades     Open Access  
Humanitaire     Open Access   (Followers: 1)
Humanities     Open Access   (Followers: 11)
Hungarian Cultural Studies     Open Access  
Hungarian Studies     Full-text available via subscription  
Ibadan Journal of Humanistic Studies     Full-text available via subscription  
Inkanyiso : Journal of Humanities and Social Sciences     Open Access   (Followers: 1)
Inter Faculty     Open Access  
Interim : Interdisciplinary Journal     Open Access   (Followers: 3)
International Journal for History, Culture and Modernity     Open Access   (Followers: 5)
International Journal of Arab Culture, Management and Sustainable Development     Hybrid Journal   (Followers: 7)
International Journal of Cultural Studies     Hybrid Journal   (Followers: 21)
International Journal of Heritage Studies     Hybrid Journal   (Followers: 16)
International Journal of Humanities and Arts Computing     Hybrid Journal   (Followers: 12)
International Journal of Humanities and Cultural Studies     Open Access   (Followers: 1)
International Journal of Humanities of the Islamic Republic of Iran     Open Access   (Followers: 11)
International Journal of Listening     Hybrid Journal   (Followers: 4)
International Journal of the Classical Tradition     Hybrid Journal   (Followers: 7)
Interventions : International Journal of Postcolonial Studies     Hybrid Journal   (Followers: 15)
ÍSTMICA. Revista de la Facultad de Filosofía y Letras     Open Access  
Jangwa Pana     Open Access  
Jewish Culture and History     Hybrid Journal   (Followers: 15)
Journal de la Société des Américanistes     Open Access  
Journal des africanistes     Open Access   (Followers: 1)
Journal for Cultural Research     Hybrid Journal   (Followers: 10)
Journal for General Philosophy of Science     Hybrid Journal   (Followers: 7)
Journal for Learning Through the Arts     Open Access   (Followers: 7)
Journal for New Generation Sciences     Open Access   (Followers: 2)
Journal for Research into Freemasonry and Fraternalism     Hybrid Journal  
Journal for Semitics     Full-text available via subscription   (Followers: 5)
Journal Of Advances In Humanities     Open Access   (Followers: 2)
Journal of Aesthetics & Culture     Open Access   (Followers: 17)
Journal of African American Studies     Hybrid Journal   (Followers: 8)
Journal of African Cultural Studies     Hybrid Journal   (Followers: 5)
Journal of African Elections     Full-text available via subscription  
Journal of Arts & Communities     Hybrid Journal   (Followers: 4)
Journal of Arts and Humanities     Open Access   (Followers: 16)
Journal of Bioethical Inquiry     Hybrid Journal   (Followers: 3)
Journal of Cultural Economy     Hybrid Journal   (Followers: 9)
Journal of Cultural Geography     Hybrid Journal   (Followers: 21)
Journal of Data Mining and Digital Humanities     Open Access   (Followers: 23)
Journal of Developing Societies     Hybrid Journal   (Followers: 2)
Journal of Family Theory & Review     Hybrid Journal   (Followers: 2)
Journal of Franco-Irish Studies     Open Access   (Followers: 1)
Journal of Happiness Studies     Hybrid Journal   (Followers: 23)
Journal of Interactive Humanities     Open Access   (Followers: 3)
Journal of Intercultural Communication Research     Hybrid Journal   (Followers: 14)
Journal of Intercultural Studies     Hybrid Journal   (Followers: 12)
Journal of Interdisciplinary History     Hybrid Journal   (Followers: 21)
Journal of Labor Research     Hybrid Journal   (Followers: 17)
Journal of Medical Humanities     Hybrid Journal   (Followers: 22)
Journal of Medieval and Early Modern Studies     Full-text available via subscription   (Followers: 31)
Journal of Modern Greek Studies     Full-text available via subscription   (Followers: 4)
Journal of Modern Jewish Studies     Hybrid Journal   (Followers: 11)
Journal of Open Humanities Data     Open Access  
Journal of Semantics     Hybrid Journal   (Followers: 11)
Journal of the Musical Arts in Africa     Hybrid Journal   (Followers: 1)
Journal of Visual Culture     Hybrid Journal   (Followers: 26)
Jurisprudence     Hybrid Journal   (Followers: 17)
L'Orientation scolaire et professionnelle     Open Access   (Followers: 1)
La lettre du Collège de France     Open Access  
La Revue pour l’histoire du CNRS     Open Access   (Followers: 2)
Lagos Notes and Records     Full-text available via subscription  
Language and Intercultural Communication     Hybrid Journal   (Followers: 20)
Language Resources and Evaluation     Hybrid Journal   (Followers: 7)
Law and Humanities     Hybrid Journal   (Followers: 8)
Law, Culture and the Humanities     Hybrid Journal   (Followers: 10)
Le Portique     Open Access   (Followers: 1)
Leadership     Hybrid Journal   (Followers: 29)
Legal Ethics     Hybrid Journal   (Followers: 14)
Legon Journal of the Humanities     Full-text available via subscription  
Letras : Órgano de la Facultad de Letras y Ciencias Huamans     Open Access  
Literary and Linguistic Computing     Hybrid Journal   (Followers: 5)
Litnet Akademies : 'n Joernaal vir die Geesteswetenskappe, Natuurwetenskappe, Regte en Godsdienswetenskappe     Open Access  
Lwati : A Journal of Contemporary Research     Full-text available via subscription  
Measurement     Hybrid Journal   (Followers: 3)
Medical Humanities     Full-text available via subscription   (Followers: 25)
Medieval Encounters     Hybrid Journal   (Followers: 9)
Médiévales     Open Access   (Followers: 5)
Mélanges de la Casa de Velázquez     Partially Free   (Followers: 1)
Memory Studies     Hybrid Journal   (Followers: 31)
Mens : revue d'histoire intellectuelle et culturelle     Full-text available via subscription  
Messages, Sages and Ages     Open Access  
Mind and Matter     Full-text available via subscription   (Followers: 3)
Modern Italy     Hybrid Journal   (Followers: 6)
Motivation Science     Full-text available via subscription  
Mouseion     Open Access   (Followers: 1)
Mouseion: Journal of the Classical Association of Canada     Full-text available via subscription   (Followers: 11)
Museum International Edition Francaise     Hybrid Journal   (Followers: 4)
National Academy Science Letters     Hybrid Journal   (Followers: 5)
Nationalities Papers     Hybrid Journal   (Followers: 7)
Natures Sciences Sociétés     Full-text available via subscription  
Neophilologus     Hybrid Journal   (Followers: 8)
New German Critique     Full-text available via subscription   (Followers: 11)
New West Indian Guide     Open Access  
nonsite.org     Full-text available via subscription   (Followers: 3)
Northeast African Studies     Full-text available via subscription   (Followers: 3)
OMEGA - Journal of Death and Dying     Full-text available via subscription   (Followers: 8)
Österreichische Zeitschrift für Soziologie     Hybrid Journal  

        1 2     

Journal Cover Language Resources and Evaluation
  [SJR: 0.915]   [H-I: 31]   [7 followers]  Follow
    
   Hybrid Journal Hybrid journal (It can contain Open Access articles)
   ISSN (Print) 1574-0218 - ISSN (Online) 1574-020X
   Published by Springer-Verlag Homepage  [2335 journals]
  • Erratum to: Replicability and reproducibility of research results for
           human language technology: introducing an LRE special section
    • Authors: António Branco; Kevin Bretonnel Cohen; Piek Vossen; Nancy Ide; Nicoletta Calzolari
      PubDate: 2017-03-17
      DOI: 10.1007/s10579-017-9386-7
       
  • What’s missing in geographical parsing?
    • Authors: Milan Gritta; Mohammad Taher Pilehvar; Nut Limsopatham; Nigel Collier
      Abstract: Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain.
      PubDate: 2017-03-07
      DOI: 10.1007/s10579-017-9385-8
       
  • LRE journal CNL introduction
    • PubDate: 2017-03-01
       
  • A multilingual FrameNet-based grammar and lexicon for controlled natural
           language
    • Abstract: Berkeley FrameNet is a lexico-semantic resource for English based on the theory of frame semantics. It has been exploited in a range of natural language processing applications and has inspired the development of framenets for many languages. We present a methodological approach to the extraction and generation of a computational multilingual FrameNet-based grammar and lexicon. The approach leverages FrameNet-annotated corpora to automatically extract a set of cross-lingual semantico-syntactic valence patterns. Based on data from Berkeley FrameNet and Swedish FrameNet, the proposed approach has been implemented in Grammatical Framework (GF), a categorial grammar formalism specialized for multilingual grammars. The implementation of the grammar and lexicon is supported by the design of FrameNet, providing a frame semantic abstraction layer, an interlingual semantic application programming interface (API), over the interlingual syntactic API already provided by GF Resource Grammar Library. The evaluation of the acquired grammar and lexicon shows the feasibility of the approach. Additionally, we illustrate how the FrameNet-based grammar and lexicon are exploited in two distinct multilingual controlled natural language applications. The produced resources are available under an open source license.
      PubDate: 2017-03-01
       
  • A GF miniature resource grammar for Tswana: modelling the proper verb
    • Abstract: The Grammatical Framework (GF) not only offers state of the art grammar-based machine translation support between an increasing number of languages through its so-called Resource Grammar Library, but is also fast becoming a de facto framework for developing multilingual controlled natural languages (CNLs). For a natural language to share maximally in the opportunities that GF-based multilingual CNL support presents, it has to have a GF resource grammar. Tswana, an agglutinating Bantu language, spoken in Southern Africa as one of the eleven official languages of South Africa, does not yet have such a grammar. This article reports on the development of a so-called miniature resource grammar, a first step towards a full resource grammar for Tswana. The focus is on the modelling of the Tswana proper verb as it occurs in simple sentences. The (proper) verb is the morphologically most complex word category in Tswana, and therefore constitutes a notable contribution towards the development of a GF resource grammar for Tswana. The computational model is discussed in some detail, implemented and tested on a systematically constructed treebank.
      PubDate: 2017-03-01
       
  • The PENG ASP system: architecture, language and authoring tool
    • Abstract: In this article, we describe the architecture, the language and the authoring tool of the PENG \(^{ASP}\) system. This system supports the writing of non-monotonic specifications in controlled natural language with the help of a web-based predictive text editor. This predictive editor communicates asynchronously with a controlled natural language processor that translates the specification text via discourse representation structures into executable Answer Set Programs (ASP). The controlled natural language processor additionally generates lookahead categories and anaphoric expressions for the author of a specification text, and it provides a paraphrase of the specification that clarifies the interpretation of the text by the machine. The predictive editor is a central component of the PENG \(^{ASP}\) system; it guides the writing process and displays multiple sets of lookahead categories simultaneously for different possible sentence completions as well as anaphoric expressions, and supports the addition of new content words to the lexicon .
      PubDate: 2017-03-01
       
  • CNLs for the semantic web: a state of the art
    • Abstract: One of the core challenges for building the semantic web is the creation of ontologies, a process known as ontology authoring. Controlled natural languages (CNLs) propose different frameworks for interfacing and creating ontologies in semantic web systems using restricted natural language. However, in order to engage non-expert users with no background in knowledge engineering, these language interfacing must be reliable, easy to understand and accepted by users. This paper includes the state-of-the-art for CNLs in terms of ontology authoring and the semantic web. In addition, it includes a detailed analysis of user evaluations with respect to each CNL and offers analytic conclusions with respect to the field.
      PubDate: 2017-03-01
       
  • Fully automatic multi-language translation with a catalogue of phrases:
           successful employment for the Swiss avalanche bulletin
    • Abstract: The Swiss avalanche bulletin is produced twice a day in four languages. Due to the lack of time available for manual translation, a fully automated translation system is employed, based on a catalogue of predefined phrases and predetermined rules of how these phrases can be combined to produce sentences. Because this catalogue of phrases is limited to a small sublanguage, the system is able to automatically translate such sentences from German into the target languages French, Italian and English without subsequent proofreading or correction. Having been operational for two winter seasons, we assess here the quality of the produced texts based on two different surveys where participants rated texts from real avalanche bulletins from both origins, the catalogue of phrases versus manually written and translated texts. With a mean recognition rate of 55 %, users can hardly distinguish between the two types of texts, and give very similar ratings with respect to their language quality. Overall, the output from the catalogue system can be considered virtually equivalent to a text written by avalanche forecasters and then manually translated by professional translators. Furthermore, forecasters declared that all relevant situations were captured by the system with sufficient accuracy. Forecaster’s working load did not change with the introduction of the catalogue: the extra time to find matching sentences is compensated by the fact that they no longer need to double-check manually translated texts. The reduction of daily translation costs is expected to offset the initial development costs within a few years.
      PubDate: 2017-03-01
       
  • A hybrid (visual/natural) controlled language
    • Abstract: We define the notion of controlled hybrid language that allows information share and interaction between a controlled natural language (specified by a context-free grammar) and a controlled visual language (specified by a Symbol-Relation grammar). We present the controlled hybrid language INAUT, used to represent nautical charts of the French Naval and Hydrographic Service (SHOM) and their companion texts (Instructions nautiques).
      PubDate: 2017-03-01
       
  • Toward a knowledge-to-text controlled natural language of isiZulu
    • Abstract: The language isiZulu belongs to the Nguni group of languages, which also include isiXhosa, isiNdebele and siSwati. Of the four Nguni languages, isiZulu is the most dominant language in South Africa, which is spoken by 22.7 % of the country’s 51.8 million population. However, isiZulu (and even more so the other Nguni languages) still remains an under-resourced language for software applications. In this article we focus on controlled natural languages for structured knowledge-to-text viewed from a potential utility for verbalising business rules and OWL ontologies. IsiZulu grammar—and by extension, all Bantu languages—shows that a template-based approach is infeasible. This is due to, mainly, the noun class system, the agglutination and verb conjugation with concords for each noun class. We present verbalisation patterns for existential and universal quantification, taxonomic subsumption, axioms with simple properties, and basic cases of negation. Based on the preliminary user assessment of the patterns, selected ones are refined into algorithms for verbalisation to generate correct isiZulu sentences, which have been evaluated.
      PubDate: 2017-03-01
       
  • Towards the creation of a CNL adapted to requirements writing by combining
           writing recommendations and spontaneous regularities: example in a space
           project
    • Abstract: The Quality Department of the French National Space Agency (CNES, Centre National d’Études Spatiales) wishes to design a writing guide based on the real and regular writing of requirements. As a first step in this project, the present article proposes a linguistic analysis of requirements written in French by CNES engineers. One of our goals is to determine to what extent they conform to several rules laid down in two existing Controlled Natural Languages (CNLs), namely the Simplified Technical English developed by the AeroSpace and Defense Industries Association of Europe and the Guide for Writing Requirements proposed by the International Council on Systems Engineering. Indeed, although CNES engineers are not obliged to follow any controlled language in their writing of requirements, we believe that language regularities are likely to emerge from this task, mainly due to the writers’ experience. We are seeking to identify these regularities in order to use them as a basis for a new CNL for the writing of requirements. The issue is approached using natural language processing tools to identify sentences that do not comply with the rules or contain specific linguistic phenomena. We further review these sentences to understand why the recommendations cannot (or should not) always be applied when specifying large-scale projects.
      PubDate: 2017-03-01
       
  • Software requirements as an application domain for natural language
           processing
    • Authors: Themistoklis Diamantopoulos; Michael Roth; Andreas Symeonidis; Ewan Klein
      Abstract: Mapping functional requirements first to specifications and then to code is one of the most challenging tasks in software development. Since requirements are commonly written in natural language, they can be prone to ambiguity, incompleteness and inconsistency. Structured semantic representations allow requirements to be translated to formal models, which can be used to detect problems at an early stage of the development process through validation. Storing and querying such models can also facilitate software reuse. Several approaches constrain the input format of requirements to produce specifications, however they usually require considerable human effort in order to adopt domain-specific heuristics and/or controlled languages. We propose a mechanism that automates the mapping of requirements to formal representations using semantic role labeling. We describe the first publicly available dataset for this task, employ a hierarchical framework that allows requirements concepts to be annotated, and discuss how semantic role labeling can be adapted for parsing software requirements.
      PubDate: 2017-02-27
      DOI: 10.1007/s10579-017-9381-z
       
  • Replicability and reproducibility of research results for human language
           technology: introducing an LRE special section
    • Authors: António Branco; Kevin Bretonnel Cohen; Piek Vossen; Nancy Ide; Nicoletta Calzolari
      PubDate: 2017-02-16
      DOI: 10.1007/s10579-017-9380-0
       
  • A French clinical corpus with comprehensive semantic annotations:
           development of the Medical Entity and Relation LIMSI annOtated Text corpus
           (MERLOT)
    • Authors: Leonardo Campillos; Louise Deléger; Cyril Grouin; Thierry Hamon; Anne-Laure Ligozat; Aurélie Névéol
      Abstract: Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
      PubDate: 2017-02-15
      DOI: 10.1007/s10579-017-9382-y
       
  • An approach to measuring and annotating the confidence of Wiktionary
           translations
    • Authors: Antonio J. Roa-Valverde; Salvador Sanchez-Alonso; Miguel-Angel Sicilia; Dieter Fensel
      Abstract: Wiktionary is an online collaborative project based on the same principle than Wikipedia , where users can create, edit and delete entries containing lexical information. While the open nature of Wiktionary is the reason for its fast growth, it has also brought a problem: how reliable is the lexical information contained in every article? If we are planing to use Wiktionary translations as source content to accomplish a certain use case, we need to be able to answer this question and extract measures of their confidence . In this paper we present our work on assessing the quality of Wiktionary translations by introducing confidence metrics. Additionally, we describe our effort to share Wiktionary translations and the associated confidence values as linked data.
      PubDate: 2017-02-06
      DOI: 10.1007/s10579-017-9384-9
       
  • RST Signalling Corpus: a corpus of signals of coherence relations
    • Authors: Debopam Das; Maite Taboada
      Abstract: We present the RST Signalling Corpus (Das et al. in RST signalling corpus, LDC2015T10. https://catalog.ldc.upenn.edu/LDC2015T10, 2015), a corpus annotated for signals of coherence relations. The corpus is developed over the RST Discourse Treebank (Carlson et al. in RST Discourse Treebank, LDC2002T07. https://catalog.ldc.upenn.edu/LDC2002T07, 2002) which is annotated for coherence relations. In the RST Signalling Corpus, these relations are further annotated with signalling information. The corpus includes annotation not only for discourse markers which are considered to be the most typical (or sometimes the only type of) signals in discourse, but also for a wide array of other signals such as reference, lexical, semantic, syntactic, graphical and genre features as potential indicators of coherence relations. We describe the research underlying the development of the corpus and the annotation process, and provide details of the corpus. We also present the results of an inter-annotator agreement study, illustrating the validity and reproducibility of the annotation. The corpus is available through the Linguistic Data Consortium, and can be used to investigate the psycholinguistic mechanisms behind the interpretation of relations through signalling, and also to develop discourse-specific computational systems such as discourse parsing applications.
      PubDate: 2017-02-03
      DOI: 10.1007/s10579-017-9383-x
       
  • Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
    • Authors: Bharat Ram Ambati; Tejaswini Deoskar; Mark Steedman
      Abstract: In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers.
      PubDate: 2017-01-25
      DOI: 10.1007/s10579-017-9379-6
       
  • Building and evaluating web corpora representing national varieties of
           English
    • Authors: Paul Cook; Laurel J. Brinton
      Abstract: Corpora are essential resources for language studies, as well as for training statistical natural language processing systems. Although very large English corpora have been built, only relatively small corpora are available for many varieties of English. National top-level domains (e.g., .au, .ca) could be exploited to automatically build web corpora, but it is unclear whether such corpora would reflect the corresponding national varieties of English; i.e., would a web corpus built from the .ca domain correspond to Canadian English? In this article we build web corpora from national top-level domains corresponding to countries in which English is widely spoken. We then carry out statistical analyses of these corpora in terms of keywords, measures of corpus comparison based on the Chi-square test and spelling variants, and the frequencies of words known to be marked in particular varieties of English. We find evidence that the web corpora indeed reflect the corresponding national varieties of English. We then demonstrate, through a case study on the analysis of Canadianisms, that these corpora could be valuable lexicographical resources.
      PubDate: 2017-01-06
      DOI: 10.1007/s10579-016-9378-z
       
  • TweetLID: a benchmark for tweet language identification
    • Authors: Arkaitz Zubiaga; Iñaki San Vicente; Pablo Gamallo; José Ramom Pichel; Iñaki Alegria; Nora Aranberri; Aitzol Ezeiza; Víctor Fresno
      Pages: 729 - 766
      Abstract: Language identification, as the task of determining the language a given text is written in, has progressed substantially in recent decades. However, three main issues remain still unresolved: (1) distinction of similar languages, (2) detection of multilingualism in a single document, and (3) identifying the language of short texts. In this paper, we describe our work on the development of a benchmark to encourage further research in these three directions, set forth an evaluation framework suitable for the task, and make a dataset of annotated tweets publicly available for research purposes. We also describe the shared task we organized to validate and assess the evaluation framework and dataset with systems submitted by seven different participants, and analyze the performance of these systems. The evaluation of the results submitted by the participants of the shared task helped us shed some light on the shortcomings of state-of-the-art language identification systems, and gives insight into the extent to which the brevity, multilingualism, and language similarity found in texts exacerbate the performance of language identifiers. Our dataset with nearly 35,000 tweets and the evaluation framework provide researchers and practitioners with suitable resources to further study the aforementioned issues on language identification within a common setting that enables to compare results with one another.
      PubDate: 2016-12-01
      DOI: 10.1007/s10579-015-9317-4
      Issue No: Vol. 50, No. 4 (2016)
       
  • Curras: an annotated corpus for the Palestinian Arabic dialect
    • Authors: Mustafa Jarrar; Nizar Habash; Faeq Alrimawi; Diyam Akra; Nasser Zalmout
      Abstract: In this article we present Curras, the first morphologically annotated corpus of the Palestinian Arabic dialect. Palestinian Arabic is one of the many primarily spoken dialects of the Arabic language. Arabic dialects are generally under-resourced compared to Modern Standard Arabic, the primarily written and official form of Arabic. We start in the article with a background description that situates Palestinian Arabic linguistically and historically and compares it to Modern Standard Arabic and Egyptian Arabic in terms of phonological, morphological, orthographic, and lexical variations. We then describe the methodology we developed to collect Palestinian Arabic text to guarantee a variety of representative domains and genres. We also discuss the annotation process we used, which extended previous efforts for annotation guideline development, and utilized existing automatic annotation solutions for Standard Arabic and Egyptian Arabic. The annotation guidelines and annotation meta-data are described in detail. The Curras Palestinian Arabic corpus consists of more than 56 K tokens, which are annotated with rich morphological and lexical features. The inter-annotator agreement results indicate a high degree of consistency.
      PubDate: 2016-12-08
      DOI: 10.1007/s10579-016-9370-7
       
 
 
JournalTOCs
School of Mathematical and Computer Sciences
Heriot-Watt University
Edinburgh, EH14 4AS, UK
Email: journaltocs@hw.ac.uk
Tel: +00 44 (0)131 4513762
Fax: +00 44 (0)131 4513327
 
Home (Search)
Subjects A-Z
Publishers A-Z
Customise
APIs
Your IP address: 54.161.161.92
 
About JournalTOCs
API
Help
News (blog, publications)
JournalTOCs on Twitter   JournalTOCs on Facebook

JournalTOCs © 2009-2016