for Journals by Title or ISSN
for Articles by Keywords
help
  Subjects -> SOCIAL SCIENCES (Total: 1568 journals)
    - BIRTH CONTROL (20 journals)
    - CHILDREN AND YOUTH (256 journals)
    - FOLKLORE (30 journals)
    - MATRIMONY (16 journals)
    - MEN'S INTERESTS (18 journals)
    - MEN'S STUDIES (91 journals)
    - SEXUALITY (52 journals)
    - SOCIAL SCIENCES (871 journals)
    - WOMEN'S INTERESTS (44 journals)
    - WOMEN'S STUDIES (170 journals)

SOCIAL SCIENCES (871 journals)            First | 1 2 3 4 5     

Showing 401 - 136 of 136 Journals sorted alphabetically
Investigación y Desarrollo     Open Access   (Followers: 1)
Investigaciones Geográficas (Esp)     Open Access  
Irish Journal of Applied Social Studies     Open Access   (Followers: 5)
Issues in Social Science     Open Access   (Followers: 5)
Ithaca : Viaggio nella Scienza     Open Access  
IULC Working Papers     Open Access  
Ius et Praxis     Open Access  
JICSA : Journal of Islamic Civilization in Southeast Asia     Open Access  
Journal for New Generation Sciences     Open Access   (Followers: 4)
Journal for Semitics     Full-text available via subscription   (Followers: 8)
Journal of Addiction & Prevention     Open Access   (Followers: 1)
Journal of Advanced Academic Research     Open Access   (Followers: 2)
Journal of Agriculture and Social Research (JASR)     Open Access   (Followers: 5)
Journal of Agriculture, Forestry and the Social Sciences     Full-text available via subscription   (Followers: 5)
Journal of Applied Social Psychology     Hybrid Journal   (Followers: 52)
Journal of Applied Social Science     Hybrid Journal   (Followers: 17)
Journal of Arts and Social Sciences     Open Access  
Journal of ASIAN Behavioural Studies     Open Access   (Followers: 4)
Journal of Cognition and Culture     Hybrid Journal   (Followers: 16)
Journal of Comparative Social Welfare     Hybrid Journal   (Followers: 18)
Journal of Contemporary African Studies     Hybrid Journal   (Followers: 4)
Journal of Critical Race inquiry     Open Access   (Followers: 10)
Journal of Cultural Economy     Hybrid Journal   (Followers: 9)
Journal of Cultural Heritage     Full-text available via subscription   (Followers: 17)
Journal of Development Effectiveness     Hybrid Journal   (Followers: 5)
Journal of Educational Social Studies     Open Access   (Followers: 9)
Journal of Family & Consumer Sciences     Full-text available via subscription   (Followers: 1)
Journal of Family Studies     Hybrid Journal   (Followers: 10)
Journal of Globalization and Development     Hybrid Journal   (Followers: 12)
Journal of Human Security     Open Access   (Followers: 10)
Journal of Humanity     Open Access   (Followers: 5)
Journal of Ilahiyat Researches     Open Access  
Journal of Interdisciplinary Gender Studies: JIGS     Full-text available via subscription   (Followers: 17)
Journal of Iran Cultural Research     Open Access   (Followers: 1)
Journal of Korean Studies     Full-text available via subscription   (Followers: 12)
Journal of Language and Social Psychology     Hybrid Journal   (Followers: 13)
Journal of Markets & Morality     Partially Free  
Journal of Mediterranean Knowledge     Open Access   (Followers: 6)
Journal of Men, Masculinities and Spirituality     Full-text available via subscription   (Followers: 13)
Journal of Methods and Measurement in the Social Sciences     Open Access   (Followers: 4)
Journal of Migration and Refugee Issues, The     Full-text available via subscription   (Followers: 34)
Journal of Organisational Transformation & Social Change     Hybrid Journal   (Followers: 7)
Journal of Pan African Studies     Open Access   (Followers: 2)
Journal of Personality and Social Psychology     Full-text available via subscription   (Followers: 292, SJR: 4.302, CiteScore: 6)
Journal of Policy Practice     Hybrid Journal   (Followers: 5)
Journal of Poverty and Social Justice     Full-text available via subscription   (Followers: 28)
Journal of Progressive Research in Social Sciences     Open Access   (Followers: 4)
Journal of Purdue Undergraduate Research     Open Access   (Followers: 1)
Journal of Relationships Research     Hybrid Journal   (Followers: 5)
Journal of Religion & Spirituality in Social Work: Social Thought     Hybrid Journal   (Followers: 10)
Journal of Research in National Development     Full-text available via subscription  
Journal of Responsible Innovation     Hybrid Journal   (Followers: 6)
Journal of Social Change     Open Access   (Followers: 7)
Journal of Social Development in Africa     Full-text available via subscription   (Followers: 5)
Journal of Social Distress and the Homeless     Hybrid Journal   (Followers: 6)
Journal of Social Intervention: Theory and Practice     Open Access   (Followers: 1)
Journal of Social Issues     Hybrid Journal   (Followers: 18)
Journal of Social Philosophy     Hybrid Journal   (Followers: 22)
Journal of Social Science Education : JSSE     Open Access  
Journal of Social Science Studies     Open Access   (Followers: 12)
Journal of Social Sciences     Open Access   (Followers: 16)
Journal of Social Structure     Open Access  
Journal of Social Studies Research     Full-text available via subscription   (Followers: 14)
Journal of Studies in Social Sciences     Open Access   (Followers: 6)
Journal of Technology in Human Services     Hybrid Journal   (Followers: 4)
Journal of the Bangladesh Association of Young Researchers     Open Access   (Followers: 1)
Journal of the Polynesian Society     Full-text available via subscription   (Followers: 7)
Journal of the Society for Social Work and Research     Full-text available via subscription   (Followers: 11)
Journal of the University of Ruhuna     Open Access   (Followers: 1)
Journal of Transnational American Studies     Open Access   (Followers: 3)
Journal of Trust Management     Open Access   (Followers: 4)
Journal Sampurasun : Interdisciplinary Studies for Cultural Heritage     Open Access   (Followers: 1)
Jurnal Abdimas     Open Access  
Jurnal Ilmiah Ilmu Sosial     Open Access   (Followers: 1)
Jurnal Ilmu Sosial dan Humaniora     Open Access  
Jurnal Kawistara     Open Access  
Jurnal Masyarakat dan Budaya     Open Access  
Jurnal Pendidikan Ilmu Sosial     Open Access   (Followers: 1)
Jurnal Sosial Humaniora     Open Access   (Followers: 2)
Jurnal Teori dan Praksis Pembelajaran IPS     Open Access  
Jurnal Terapan Abdimas     Open Access  
Just Policy: A Journal of Australian Social Policy     Full-text available via subscription   (Followers: 14)
Kaleidoscope     Open Access  
Kasetsart Journal of Social Sciences     Open Access   (Followers: 2)
Kervan. International Journal of Afro-Asiatic Studies     Open Access  
Kimün. Revista Interdisciplinaria de Formación Docente     Open Access  
Kırklareli Üniversitesi Sosyal Bilimler Dergisi     Open Access  
Knowledge Management for Development Journal     Hybrid Journal   (Followers: 10)
Kontext : Zeitschrift für Systemische Therapie und Familientherapie     Hybrid Journal  
Korean Social Science Journal     Hybrid Journal   (Followers: 2)
Kotuitui : New Zealand Journal of Social Sciences Online     Open Access   (Followers: 16)
Kulturwissenschaftliche Zeitschrift     Open Access   (Followers: 2)
KZfSS Kölner Zeitschrift für Soziologie und Sozialpsychologie     Hybrid Journal   (Followers: 17)
L'Homme. Europäische Zeitschrift für Feministische Geschichtswissenschaft     Hybrid Journal   (Followers: 4)
L'Ordinaire des Amériques     Open Access  
La Tercera Orilla     Open Access  
Labyrinthe     Open Access  
Language and Intercultural Communication     Hybrid Journal   (Followers: 23)
Language Resources and Evaluation     Hybrid Journal   (Followers: 5)
Lavboratorio : Revista de Estudios sobre Cambio Estructural y Desigualdad Social.     Open Access  
Lectio Socialis     Open Access  
Les Cahiers des dix     Full-text available via subscription   (Followers: 1)
Les Cahiers d’EMAM     Open Access   (Followers: 1)
Letras Verdes. Revista Latinoamericana de Estudios Socioambientales     Open Access   (Followers: 2)
Letters on Evolutionary Behavioral Science     Open Access  
Lex Social : Revista de Derechos Sociales     Open Access   (Followers: 1)
Lilith: A Feminist History Journal     Full-text available via subscription   (Followers: 4)
Liminar. Estudios Sociales y Humanisticos     Open Access   (Followers: 3)
Literacy Learning: The Middle Years     Full-text available via subscription   (Followers: 6)
Local-Global: Identity, Security, Community     Full-text available via subscription   (Followers: 4)
Loisir et Société / Society and Leisure     Hybrid Journal   (Followers: 1)
Lucero     Open Access  
Lúdicamente     Open Access   (Followers: 1)
Lutas Sociais     Open Access   (Followers: 1)
Lwati : A Journal of Contemporary Research     Full-text available via subscription   (Followers: 2)
Macedon Digest, The     Full-text available via subscription   (Followers: 1)
Maine Policy Review     Open Access   (Followers: 2)
Maskana     Open Access  
Mathématiques et sciences humaines     Open Access   (Followers: 7)
Mayéutica Revista Científica de Humanidades y Artes     Open Access  
McNair Scholars Research Journal     Open Access  
McNair Scholars Research Journal     Open Access  
Meanjin     Full-text available via subscription   (Followers: 1)
Meanjin     Full-text available via subscription   (Followers: 3)
Meanjin Papers     Full-text available via subscription   (Followers: 2)
Media Information Australia     Full-text available via subscription  
Media International Australia     Hybrid Journal   (Followers: 2)
Media International Australia     Full-text available via subscription   (Followers: 1)
Melbourne Journal of Politics     Full-text available via subscription   (Followers: 1)
Mémoire(s), identité(s), marginalité(s) dans le monde occidental contemporain     Open Access   (Followers: 1)
Memorias     Open Access   (Followers: 1)
Meridional : Revista Chilena de Estudios Latinoamericanos     Open Access  
methaodos.revista de ciencias sociales     Open Access   (Followers: 3)
Methodological Innovations     Open Access  
Methods, Data, Analyses     Open Access  
México y la Cuenca del Pacífico     Open Access  
Middle East Journal of Culture and Communication     Hybrid Journal   (Followers: 6)
Migration Action     Full-text available via subscription   (Followers: 5)
Mikarimin. Revista Científica Multidisciplinaria     Open Access  
Miscelánea Comillas. Revista de Ciencias Humanas y Sociales     Open Access  
Misión Jurídica     Open Access  
Mitologicas     Open Access  
Módulo Arquitectura - CUC     Open Access  
Monthly, The     Full-text available via subscription   (Followers: 1)
Moving the Social : Journal of Social History and the History of Social Movements     Full-text available via subscription   (Followers: 2)
Mukaddime     Open Access  
Mütefekkir     Open Access  
Müvészettörténeti Értesitö     Full-text available via subscription   (Followers: 1)
National Academy Science Letters     Hybrid Journal   (Followers: 5)
National Emergency Response     Full-text available via subscription   (Followers: 3)
National Observer     Full-text available via subscription   (Followers: 1)
Navigations : A First-Year College Composite     Open Access  
New Left Review     Full-text available via subscription   (Followers: 24)
New Perspectives on Turkey     Full-text available via subscription   (Followers: 3)
New Zealand International Review     Full-text available via subscription   (Followers: 18)
Newsletter of the Gypsy Lore Society     Hybrid Journal   (Followers: 1)
Nineteenth-Century Contexts: An Interdisciplinary Journal     Hybrid Journal   (Followers: 17)
Noesis. Revista de Ciencias Sociales y Humanidades     Open Access   (Followers: 2)
Nómadas     Open Access   (Followers: 1)
Nómadas. Revista Crítica de Ciencias Sociales y Jurídicas     Open Access   (Followers: 2)
Nordic Journal of Social Research     Open Access  
Northeast African Studies     Full-text available via subscription   (Followers: 2)
Nouvelles perspectives en sciences sociales : revue internationale de systémique complexe et d'études relationnelles     Full-text available via subscription   (Followers: 1)
Novos Cadernos NAEA     Open Access   (Followers: 1)
Novos Estudos - CEBRAP     Open Access   (Followers: 1)
Observatorio Latinoamericano y Caribeño     Open Access  
Occasional Series in Criminal Justice and International Studies     Full-text available via subscription   (Followers: 4)
Oceania     Hybrid Journal   (Followers: 6)
OGIRISI : a New Journal of African Studies     Open Access   (Followers: 1)
Öneri Dergisi     Open Access  
Opcion     Open Access  
Open Cultural Studies     Open Access  
Open Journal of Social Sciences     Open Access   (Followers: 6)
Opticon1826     Open Access   (Followers: 2)
Orbis. Revista Cientifica Ciencias Humanas     Open Access   (Followers: 2)
Orbith : Majalah Ilmiah Pengembangan Rekayasa dan Sosial     Open Access   (Followers: 1)
Oregon Undergraduate Research Journal     Open Access   (Followers: 1)
Österreichische Zeitschrift für Soziologie     Hybrid Journal   (Followers: 2)
Outlines. Critical Practice Studies     Open Access   (Followers: 1)
Pacific Northwest Journal of Undergraduate Research and Creative Activities     Open Access  
Pacific Science Review B: Humanities and Social Sciences     Open Access  
Palgrave Communications     Open Access   (Followers: 4)
Palimpsesto : Revista Científica de Estudios Sociales Iberoamericanos     Open Access  
Pandora's Box     Full-text available via subscription  
Panggung     Open Access  
Panorama     Open Access  
Papeles de Europa     Open Access  
Papeles de Trabajo     Open Access  
Parity     Full-text available via subscription   (Followers: 4)
Pecvnia : Revista de la Facultad de Ciencias Económicas y Empresariales, Universidad de León     Open Access  
Península     Open Access  
Pensamento & Realidade. Revista do Programa de Estudos Pós-Graduados em Administração     Open Access  
People and Place     Full-text available via subscription   (Followers: 3)
People and Society (Mens & Maatschappij)     Full-text available via subscription   (Followers: 3)
Percurso Acadêmico     Open Access  
Perfiles Latinoamericanos     Open Access   (Followers: 1)
Periférica. Revista para el análisis de la cultura y el territorio     Open Access  
Periodica Polytechnica Social and Management Sciences     Open Access  
Persona y Bioetica     Open Access  
Personality and Social Psychology Bulletin     Hybrid Journal   (Followers: 177)

  First | 1 2 3 4 5     

Journal Cover
Language Resources and Evaluation
Journal Prestige (SJR): 0.193
Citation Impact (citeScore): 1
Number of Followers: 5  
 
  Hybrid Journal Hybrid journal (It can contain Open Access articles)
ISSN (Print) 1574-0218 - ISSN (Online) 1574-020X
Published by Springer-Verlag Homepage  [2352 journals]
  • A longitudinal database of Irish political speech with annotations of
           speaker ability
    • Authors: Ailbhe Cullen; Naomi Harte
      Pages: 401 - 432
      Abstract: This paper presents the Irish Political Speech Database, an English-language database collected from Irish political recordings. The database is collected with automated indexing and content retrieval in mind, and thus is gathered from real-world recordings (such as television interviews and election rallies) which represent the nature and quality of recordings which will be encountered in practical applications. The database is labelled for six speaker attributes: boring; charismatic; enthusiastic; inspiring; likeable; and persuasive. Each of these traits is linked to the perceived ability or appeal of the speaker, and as such are relevant to a range of content retrieval and speech analysis tasks. The six base attributes are combined to form a metric of Overall Speaker Appeal. A set of baseline experiments is presented, which demonstrate the potential of this database for affective computing studies. Classification accuracies of up to 76% are achieved, with little feature or system optimisation.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9401-z
      Issue No: Vol. 52, No. 2 (2018)
       
  • A semi-automatic annotation tool for unobtrusive gesture analysis
    • Authors: Stijn De Beugher; Geert Brône; Toon Goedemé
      Pages: 433 - 460
      Abstract: In a variety of research fields, including linguistics, human–computer interaction research, psychology, sociology and behavioral studies, there is a growing interest in the role of gestural behavior related to speech and other modalities. The analysis of multimodal communication requires high-quality video data and detailed annotation of the different semiotic resources under scrutiny. In the majority of cases, the annotation of hand position, hand motion, gesture type, etc. is done manually, which is a time-consuming enterprise requiring multiple annotators and substantial resources. In this paper we present a semi-automatic alternative, in which the focus lies on minimizing the manual workload while guaranteeing highly accurate annotations. First, we discuss our approach, which consists of several processing steps such as identifying the hands in images, calculating motion of the hands, segmenting the recording in gesture and non-gesture events, etc. Second, we validate our approach against existing corpora in terms of accuracy and usefulness. The proposed approach is designed to provide annotations according to the McNeill (Hand and mind: what gestures reveal about thought, University of Chicago Press, Chicago, 1992) gesture space and the output is compatible with annotation tools such as ELAN or ANVIL.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9404-9
      Issue No: Vol. 52, No. 2 (2018)
       
  • Creating a reference data set for the summarization of discussion forum
           threads
    • Authors: Suzan Verberne; Emiel Krahmer; Iris Hendrickx; Sander Wubben; Antal van den Bosch
      Pages: 461 - 483
      Abstract: In this paper we address extractive summarization of long threads in online discussion fora. We present an elaborate user evaluation study to determine human preferences in forum summarization and to create a reference data set. We showed long threads to ten different raters and asked them to create a summary by selecting the posts that they considered to be the most important for the thread. We study the agreement between human raters on the summarization task, and we show how multiple reference summaries can be combined to develop a successful model for automatic summarization. We found that although the inter-rater agreement for the summarization task was slight to fair, the automatic summarizer obtained reasonable results in terms of precision, recall, and ROUGE. Moreover, when human raters were asked to choose between the summary created by another human and the summary created by our model in a blind side-by-side comparison, they judged the model’s summary equal to or better than the human summary in over half of the cases. This shows that even for a summarization task with low inter-rater agreement, a model can be trained that generates sensible summaries. In addition, we investigated the potential for personalized summarization. However, the results for the three raters involved in this experiment were inconclusive. We release the reference summaries as a publicly available dataset.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9389-4
      Issue No: Vol. 52, No. 2 (2018)
       
  • Real-word error correction with trigrams: correcting multiple errors in a
           sentence
    • Authors: Seyed MohammadSadegh Dashti
      Pages: 485 - 502
      Abstract: Spelling correction is a fundamental task in text mining. In this study, we assess the real-word error correction model proposed by Mays, Damerau and Mercer and describe several drawbacks of the model. We propose a new variation which focuses on detecting and correcting multiple real-word errors in a sentence, by manipulating a probabilistic context-free grammar to discriminate between items in the search space. We test our approach on the Wall Street Journal corpus and show that it outperforms Hirst and Budanitsky’s WordNet-based method and Wilcox-O’Hearn, Hirst, and Budanitsky’s fixed windows size method.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9397-4
      Issue No: Vol. 52, No. 2 (2018)
       
  • Creation and evaluation of large keyphrase extraction collections with
           multiple opinions
    • Authors: Lucas Sterckx; Thomas Demeester; Johannes Deleu; Chris Develder
      Pages: 503 - 532
      Abstract: While several automatic keyphrase extraction (AKE) techniques have been developed and analyzed, there is little consensus on the definition of the task and a lack of overview of the effectiveness of different techniques. Proper evaluation of keyphrase extraction requires large test collections with multiple opinions, currently not available for research. In this paper, we (i) present a set of test collections derived from various sources with multiple annotations (which we also refer to as opinions in the remained of the paper) for each document, (ii) systematically evaluate keyphrase extraction using several supervised and unsupervised AKE techniques, (iii) and experimentally analyze the effects of disagreement on AKE evaluation. Our newly created set of test collections spans different types of topical content from general news and magazines, and is annotated with multiple annotations per article by a large annotator panel. Our annotator study shows that for a given document there seems to be a large disagreement on the preferred keyphrases, suggesting the need for multiple opinions per document. A first systematic evaluation of ranking and classification of keyphrases using both unsupervised and supervised AKE techniques on the test collections shows a superior effectiveness of supervised models, even for a low annotation effort and with basic positional and frequency features, and highlights the importance of a suitable keyphrase candidate generation approach. We also study the influence of multiple opinions, training data and document length on evaluation of keyphrase extraction. Our new test collection for keyphrase extraction is one of the largest of its kind and will be made available to stimulate future work to improve reliable evaluation of new keyphrase extractors.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9395-6
      Issue No: Vol. 52, No. 2 (2018)
       
  • SFU Review SP -NEG: a Spanish corpus annotated with negation for sentiment
           analysis. A typology of negation patterns
    • Authors: Salud María Jiménez-Zafra; Mariona Taulé; M. Teresa Martín-Valdivia; L. Alfonso Ureña-López; M. Antónia Martí
      Pages: 533 - 569
      Abstract: In this paper, we present SFU ReviewSP-NEG, the first Spanish corpus annotated with negation with a wide coverage freely available. We describe the methodology applied in the annotation of the corpus including the tagset, the linguistic criteria and the inter-annotator agreement tests. We also include a complete typology of negation patterns in Spanish. This typology has the advantage that it is easy to express in terms of a tagset for corpus annotation: the types are clearly defined, which avoids ambiguity in the annotation process, and they provide wide coverage (i.e. they resolved all the cases occurring in the corpus). We use the SFU ReviewSP as a base in order to make the annotations. The corpus consists of 400 reviews, 221,866 words and 9455 sentences, out of which 3022 sentences contain at least one negation structure.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9391-x
      Issue No: Vol. 52, No. 2 (2018)
       
  • A French clinical corpus with comprehensive semantic annotations:
           development of the Medical Entity and Relation LIMSI annOtated Text corpus
           (MERLOT)
    • Authors: Leonardo Campillos; Louise Deléger; Cyril Grouin; Thierry Hamon; Anne-Laure Ligozat; Aurélie Névéol
      Pages: 571 - 601
      Abstract: Quality annotated resources are essential for Natural Language Processing. The objective of this work is to present a corpus of clinical narratives in French annotated for linguistic, semantic and structural information, aimed at clinical information extraction. Six annotators contributed to the corpus annotation, using a comprehensive annotation scheme covering 21 entities, 11 attributes and 37 relations. All annotators trained on a small, common portion of the corpus before proceeding independently. An automatic tool was used to produce entity and attribute pre-annotations. About a tenth of the corpus was doubly annotated and annotation differences were resolved in consensus meetings. To ensure annotation consistency throughout the corpus, we devised harmonization tools to automatically identify annotation differences to be addressed to improve the overall corpus quality. The annotation project spanned over 24 months and resulted in a corpus comprising 500 documents (148,476 tokens) annotated with 44,740 entities and 26,478 relations. The average inter-annotator agreement is 0.793 F-measure for entities and 0.789 for relations. The performance of the pre-annotation tool for entities reached 0.814 F-measure when sufficient training data was available. The performance of our entity pre-annotation tool shows the value of the corpus to build and evaluate information extraction methods. In addition, we introduced harmonization methods that further improved the quality of annotations in the corpus.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9382-y
      Issue No: Vol. 52, No. 2 (2018)
       
  • What’s missing in geographical parsing'
    • Authors: Milan Gritta; Mohammad Taher Pilehvar; Nut Limsopatham; Nigel Collier
      Pages: 603 - 623
      Abstract: Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9385-8
      Issue No: Vol. 52, No. 2 (2018)
       
  • BLARK for multi-dialect languages: towards the Kurdish BLARK
    • Authors: Hossein Hassani
      Pages: 625 - 644
      Abstract: In this paper we introduce the Kurdish BLARK (Basic Language Resource Kit). The original BLARK has not considered multi-dialect characteristics and generally has targeted reasonably well-resourced languages. To consider these two features, we extended BLARK and applied the proposed extension to Kurdish. Kurdish language not only faces a paucity in resources, but also embraces several dialects within a complex linguistic context. This paper presents the Kurdish BLARK and shows that from Natural language processing and computational linguistics perspectives the revised BLARK provides a more applicable view of languages with similar characteristics to Kurdish.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9400-0
      Issue No: Vol. 52, No. 2 (2018)
       
  • Spanish sentiment analysis in Twitter at the TASS workshop
    • Authors: Ferran Pla; Lluís-F. Hurtado
      Pages: 645 - 672
      Abstract: This paper describes a support vector machine-based approach to different tasks related to sentiment analysis in Twitter for Spanish. We focus on parameter optimization of the models and the combination of several models by means of voting techniques. We evaluate the proposed approach in all the tasks that were defined in the five editions of the TASS workshop, between 2012 and 2016. TASS has become a framework for sentiment analysis tasks that are focused on the Spanish language. We describe our participation in this competition and the results achieved, and then we provide an analysis of and comparison with the best approaches of the teams who participated in all the tasks defined in the TASS workshops. To our knowledge, our results exceed those published to date in the sentiment analysis tasks of the TASS workshops.
      PubDate: 2018-06-01
      DOI: 10.1007/s10579-017-9394-7
      Issue No: Vol. 52, No. 2 (2018)
       
  • Creation of an annotated corpus of Old and Middle Hungarian court records
           and private correspondence
    • Authors: Attila Novák; Katalin Gugán; Mónika Varga; Adrienne Dömötör
      Pages: 1 - 28
      Abstract: The paper introduces a novel annotated corpus of Old and Middle Hungarian (16–18 century), the texts of which were selected in order to approximate the vernacular of the given historical periods as closely as possible. The corpus consists of testimonies of witnesses in trials and samples of private correspondence. The texts are not only analyzed morphologically, but each file contains metadata that would also facilitate sociolinguistic research. The texts were segmented into clauses, manually normalized and morphosyntactically annotated using an annotation system consisting of the PurePos PoS tagger and the Hungarian morphological analyzer HuMor originally developed for Modern Hungarian but adapted to analyze Old and Middle Hungarian morphological constructions. The automatically disambiguated morphological annotation was manually checked and corrected using an easy-to-use web-based manual disambiguation interface. The normalization process and the manual validation of the annotation required extensive teamwork and provided continuous feedback for the refinement of the computational morphology and iterative retraining of the statistical models of the tagger. The paper discusses some of the typical problems that occurred during the normalization procedure and their tentative solutions. Besides, we also describe the automatic annotation tools, the process of semi-automatic disambiguation, and the query interface, a special function of which also makes correction of the annotation possible. Displaying the original, the normalized and the parsed versions of the selected texts, the beta version of the first fully normalized and annotated historical corpus of Hungarian is freely accessible at the address http://tmk.nytud.hu/.
      PubDate: 2018-03-01
      DOI: 10.1007/s10579-017-9393-8
      Issue No: Vol. 52, No. 1 (2018)
       
  • The PROIEL treebank family: a standard for early attestations of
           Indo-European languages
    • Authors: Hanne Eckhoff; Kristin Bech; Gerlof Bouma; Kristine Eide; Dag Haug; Odd Einar Haugen; Marius Jøhndal
      Pages: 29 - 65
      Abstract: This article describes a family of dependency treebanks of early attestations of Indo-European languages originating in the parallel treebank built by the members of the project pragmatic resources in old Indo-European languages. The treebanks all share a set of open-source software tools, including a web annotation interface, and a set of annotation schemes and guidelines developed especially for the project languages. The treebanks use an enriched dependency grammar scheme complemented by detailed morphological tags, which have proved sufficient to give detailed descriptions of these richly inflected languages, and which have been easy to adapt to new languages. We describe the tools and annotation schemes and discuss some challenges posed by the various languages that have been annotated. We also discuss problems with tokenisation, sentence division and lemmatisation, commonly encountered in ancient and mediaeval texts, and challenges associated with low levels of standardisation and ongoing morphological and syntactic change.
      PubDate: 2018-03-01
      DOI: 10.1007/s10579-017-9388-5
      Issue No: Vol. 52, No. 1 (2018)
       
  • RST Signalling Corpus: a corpus of signals of coherence relations
    • Authors: Debopam Das; Maite Taboada
      Pages: 149 - 184
      Abstract: We present the RST Signalling Corpus (Das et al. in RST signalling corpus, LDC2015T10. https://catalog.ldc.upenn.edu/LDC2015T10, 2015), a corpus annotated for signals of coherence relations. The corpus is developed over the RST Discourse Treebank (Carlson et al. in RST Discourse Treebank, LDC2002T07. https://catalog.ldc.upenn.edu/LDC2002T07, 2002) which is annotated for coherence relations. In the RST Signalling Corpus, these relations are further annotated with signalling information. The corpus includes annotation not only for discourse markers which are considered to be the most typical (or sometimes the only type of) signals in discourse, but also for a wide array of other signals such as reference, lexical, semantic, syntactic, graphical and genre features as potential indicators of coherence relations. We describe the research underlying the development of the corpus and the annotation process, and provide details of the corpus. We also present the results of an inter-annotator agreement study, illustrating the validity and reproducibility of the annotation. The corpus is available through the Linguistic Data Consortium, and can be used to investigate the psycholinguistic mechanisms behind the interpretation of relations through signalling, and also to develop discourse-specific computational systems such as discourse parsing applications.
      PubDate: 2018-03-01
      DOI: 10.1007/s10579-017-9383-x
      Issue No: Vol. 52, No. 1 (2018)
       
  • A flexible text analyzer based on ontologies: an application for detecting
           discriminatory language
    • Authors: Alberto Salguero; Macarena Espinilla
      Pages: 185 - 215
      Abstract: Language can be a tool to marginalize certain groups due to the fact that it may reflect a negative mentality caused by mental barriers or historical delays. In order to prevent misuse of language, several agents have carried out campaigns against discriminatory language, criticizing the use of some terms and phrases. However, there is an important gap in detecting discriminatory text in documents because language is very flexible and, usually, contains hidden features or relations. Furthermore, the adaptation of approaches and methodologies proposed in the literature for text analysis is complex due to the fact that these proposals are too rigid to be adapted to different purposes for which they were intended. The main novelty of the methodology is the use of ontologies to implement the rules that are used by the developed text analyzer, providing a great flexibility for the development of text analyzers and exploiting the ability to infer knowledge of the ontologies. A set of rules for detecting discriminatory language relevant to gender and people with disabilities is also presented in order to show how to extend the functionality of the text analyzer to different discriminatory text areas.
      PubDate: 2018-03-01
      DOI: 10.1007/s10579-017-9387-6
      Issue No: Vol. 52, No. 1 (2018)
       
  • Cross-language transfer of semantic annotation via targeted crowdsourcing:
           task design and evaluation
    • Authors: Evgeny A. Stepanov; Shammur Absar Chowdhury; Ali Orkan Bayer; Arindam Ghosh; Ioannis Klasinas; Marcos Calvo; Emilio Sanchis; Giuseppe Riccardi
      Pages: 341 - 364
      Abstract: Modern data-driven spoken language systems (SLS) require manual semantic annotation for training spoken language understanding parsers. Multilingual porting of SLS demands significant manual effort and language resources, as this manual annotation has to be replicated. Crowdsourcing is an accessible and cost-effective alternative to traditional methods of collecting and annotating data. The application of crowdsourcing to simple tasks has been well investigated. However, complex tasks, like cross-language semantic annotation transfer, may generate low judgment agreement and/or poor performance. The most serious issue in cross-language porting is the absence of reference annotations in the target language; thus, crowd quality control and the evaluation of the collected annotations is difficult. In this paper we investigate targeted crowdsourcing for semantic annotation transfer that delegates to crowds a complex task such as segmenting and labeling of concepts taken from a domain ontology; and evaluation using source language annotation. To test the applicability and effectiveness of the crowdsourced annotation transfer we have considered the case of close and distant language pairs: Italian–Spanish and Italian–Greek. The corpora annotated via crowdsourcing are evaluated against source and target language expert annotations. We demonstrate that the two evaluation references (source and target) highly correlate with each other; thus, drastically reduce the need for the target language reference annotations.
      PubDate: 2018-03-01
      DOI: 10.1007/s10579-017-9396-5
      Issue No: Vol. 52, No. 1 (2018)
       
  • SlangSD: building, expanding and using a sentiment dictionary of slang
           words for short-text sentiment classification
    • Authors: Liang Wu; Fred Morstatter; Huan Liu
      Abstract: Sentiment information about social media posts is increasingly considered an important resource for customer segmentation, market understanding, and tackling other socio-economic issues. However, sentiment in social media is difficult to measure since user-generated content is usually short and informal. Although many traditional sentiment analysis methods have been proposed, identifying slang sentiment words remains a challenging task for practitioners. Though some slang words are available in existing sentiment lexicons, with new slang being generated with emerging memes, a dedicated lexicon will be useful for researchers and practitioners. To this end, we propose to build a slang sentiment dictionary to aid sentiment analysis. It is laborious and time-consuming to collect a comprehensive list of slang words and label the sentiment polarity. We present an approach to leverage web resources to construct a Slang Sentiment Dictionary (SlangSD) that is easy to expand. SlangSD is publicly available for research purposes. We empirically show the advantages of using SlangSD, the newly-built slang sentiment word dictionary for sentiment classification, and provide examples demonstrating its ease of use with a sentiment analysis system.
      PubDate: 2018-05-18
      DOI: 10.1007/s10579-018-9416-0
       
  • A comparison of graph-based word sense induction clustering algorithms in
           a pseudoword evaluation framework
    • Authors: Flavio Massimiliano Cecchini; Martin Riedl; Elisabetta Fersini; Chris Biemann
      Abstract: This article presents a comparison of different Word Sense Induction (wsi) clustering algorithms on two novel pseudoword data sets of semantic-similarity and co-occurrence-based word graphs, with a special focus on the detection of homonymic polysemy. We follow the original definition of a pseudoword as the combination of two monosemous terms and their contexts to simulate a polysemous word. The evaluation is performed comparing the algorithm’s output on a pseudoword’s ego word graph (i.e., a graph that represents the pseudoword’s context in the corpus) with the known subdivision given by the components corresponding to the monosemous source words forming the pseudoword. The main contribution of this article is to present a self-sufficient pseudoword-based evaluation framework for wsi graph-based clustering algorithms, thereby defining a new evaluation measure (top2) and a secondary clustering process (hyperclustering). To our knowledge, we are the first to conduct and discuss a large-scale systematic pseudoword evaluation targeting the induction of coarse-grained homonymous word senses across a large number of graph clustering algorithms.
      PubDate: 2018-03-24
      DOI: 10.1007/s10579-018-9415-1
       
  • TermFinder: log-likelihood comparison and phrase-based statistical machine
           translation models for bilingual terminology extraction
    • Authors: Rejwanul Haque; Sergio Penkale; Andy Way
      Abstract: Bilingual termbanks are important for many natural language processing applications, especially in translation workflows in industrial settings. In this paper, we apply a log-likelihood comparison method to extract monolingual terminology from the source and target sides of a parallel corpus. The initial candidate terminology list is prepared by taking all arbitrary n-gram word sequences from the corpus. Then, a well-known statistical measure (the Dice coefficient) is employed in order to remove any multi-word terms with weak associations from the candidate term list. Thereafter, the log-likelihood comparison method is applied to rank the phrasal candidate term list. Then, using a phrase-based statistical machine translation model, we create a bilingual terminology with the extracted monolingual term lists. We integrate an external knowledge source—the Wikipedia cross-language link databases—into the terminology extraction (TE) model to assist two processes: (a) the ranking of the extracted terminology list, and (b) the selection of appropriate target terms for a source term. First, we report the performance of our monolingual TE model compared to a number of the state-of-the-art TE models on English-to-Turkish and English-to-Hindi data sets. Then, we evaluate our novel bilingual TE model on an English-to-Turkish data set, and report the automatic evaluation results. We also manually evaluate our novel TE model on English-to-Spanish and English-to-Hindi data sets, and observe excellent performance for all domains.
      PubDate: 2018-02-03
      DOI: 10.1007/s10579-018-9412-4
       
  • The corpus of Basque simplified texts (CBST)
    • Authors: Itziar Gonzalez-Dios; María Jesús Aranzabe; Arantza Díaz de Ilarraza
      Abstract: In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque.
      PubDate: 2017-11-18
      DOI: 10.1007/s10579-017-9407-6
       
  • The challenging task of summary evaluation: an overview
    • Authors: Elena Lloret; Laura Plaza; Ahmet Aker
      Abstract: Evaluation is crucial in the research and development of automatic summarization applications, in order to determine the appropriateness of a summary based on different criteria, such as the content it contains, and the way it is presented. To perform an adequate evaluation is of great relevance to ensure that automatic summaries can be useful for the context and/or application they are generated for. To this end, researchers must be aware of the evaluation metrics, approaches, and datasets that are available, in order to decide which of them would be the most suitable to use, or to be able to propose new ones, overcoming the possible limitations that existing methods may present. In this article, a critical and historical analysis of evaluation metrics, methods, and datasets for automatic summarization systems is presented, where the strengths and weaknesses of evaluation efforts are discussed and the major challenges to solve are identified. Therefore, a clear up-to-date overview of the evolution and progress of summarization evaluation is provided, giving the reader useful insights into the past, present and latest trends in the automatic evaluation of summaries.
      PubDate: 2017-09-02
      DOI: 10.1007/s10579-017-9399-2
       
 
 
JournalTOCs
School of Mathematical and Computer Sciences
Heriot-Watt University
Edinburgh, EH14 4AS, UK
Email: journaltocs@hw.ac.uk
Tel: +00 44 (0)131 4513762
Fax: +00 44 (0)131 4513327
 
Home (Search)
Subjects A-Z
Publishers A-Z
Customise
APIs
Your IP address: 54.82.99.169
 
About JournalTOCs
API
Help
News (blog, publications)
JournalTOCs on Twitter   JournalTOCs on Facebook

JournalTOCs © 2009-