Search results for: out-of-corpora

Out of Corpora

Author : Hilde Hasselgård
File Size : 46.3 MB
Format : PDF, Mobi
Download : 316
Read : 773
Download »
Main headings: Introduction. - I. Representing language use. - II. Grammar and lexis in English corpora. - III. Contrastive and translation studies. - IV. English abroad. - List of Stig Johansson's publications (selection).

Out of Corpora

Author : Hilde Hasselgård
File Size : 71.18 MB
Format : PDF
Download : 981
Read : 1224
Download »
Main headings: Introduction. - I. Representing language use. - II. Grammar and lexis in English corpora. - III. Contrastive and translation studies. - IV. English abroad. - List of Stig Johansson's publications (selection).

Building and Using Comparable Corpora

Author : Serge Sharoff
File Size : 54.90 MB
Format : PDF, ePub, Mobi
Download : 283
Read : 235
Download »
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field. The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.

Corpora in Applied Linguistics

Author : Susan Hunston
File Size : 30.59 MB
Format : PDF, ePub, Mobi
Download : 819
Read : 322
Download »

Incorporating Corpora

Author : Gunilla M. Anderman
File Size : 60.3 MB
Format : PDF, Mobi
Download : 424
Read : 806
Download »
Covering a number of European languages from Portuguese to Hungarian, this volume includes many new studies of translation patterns using parallel corpora focusing on particular linguistic features, as well as broader-ranging contributions on translation 'universals'.

Corpora for University Language Teachers

Author : Carol Taylor Torsello
File Size : 31.6 MB
Format : PDF, ePub, Docs
Download : 107
Read : 1117
Download »
This volume is made up of 17 chapters which have developed out of papers and workshop sessions presented at the event entitled «Corpora: Seminar and Workshops», held at the University of Padua, March 29-31, 2007. It maintains the straightforward, practical approach which characterized that event, meant as an introduction to the use of corpora even for novices. At the same time it goes into a wide range of different applications for corpora in language teaching and language research in higher education. One of these involves the creation and use of learner corpora. Another application involves corpus-assisted research into political discourse in the media. Language for special purposes is also focussed on as a research topic, an academic discipline, and language to be translated. Multimodal corpora are also considered. Proposals are made for corpus-based research into the language of films, and into translation (and mediation) universals. A corpus-based study of text complexity in reading tests is also presented. Large-scale corpora commercially available are also discussed. An online module for translator training is presented, as is an Internet-accessible corpus of Old English poetry.

Doing Corpus Linguistics

Author : William Crawford
File Size : 38.91 MB
Format : PDF, ePub, Docs
Download : 971
Read : 556
Download »
Doing Corpus Linguistics offers a practical step-by-step introduction to corpus linguistics, making use of widely available corpora and of a register analysis-based theoretical framework to provide students in Applied Linguistics and TESOL with the understanding and skills necessary to meaningfully analyze corpora and carry out successful corpus-based research. Divided into three parts – Introduction to Doing Corpus Linguistics and Register Analysis; Searches in Available Corpora; and Building Your Own Corpus, Analyzing Your Quantitative Results, and Making Sense of Data – the book emphasizes hands-on experience with performing language analysis research and in interpreting findings in a meaningful and engaging way. Readers are given multiple opportunities to analyze and apply language data by completing smaller tasks and corpus projects using publicly available corpora. The book also takes readers through the process of building a specialized corpus designed to answer a specific research question and provides detailed information on completing a final research project that includes both a written paper and an oral presentation of their specific research projects. Doing Corpus Linguistics provides students in applied linguistics and TESOL with the opportunity to gain proficiency in the technical and interpretive aspects of corpus research and to encourage them to participate in the growing field of corpus linguistics.

Multilingual Corpora and Multilingual Corpus Analysis

Author : Thomas Schmidt
File Size : 51.70 MB
Format : PDF, ePub
Download : 266
Read : 1238
Download »
This volume deals with different aspects of the creation and use of multilingual corpora. The term 'multilingual corpus' is understood in a comprehensive sense, meaning any systematic collection of empirical language data enabling linguists to carry out analyses of multilingual individuals, multilingual societies or multilingual communication. The individual contributions are thus concerned with a variety of spoken and written corpora ranging from learner and attrition corpora, language contact corpora and interpreting corpora to comparable and parallel corpora. The overarching aim of the volume is first to take stock of the variety of existing multilingual corpora, documenting possible corpus designs and uses, second to discuss methodological and technological challenges in the creation and analysis of multilingual corpora, and third to provide examples of linguistic analyses that were carried out on the basis of multilingual corpora.

Journals of the House of Lords

Author : Great Britain. Parliament. House of Lords
File Size : 33.11 MB
Format : PDF, Kindle
Download : 240
Read : 989
Download »
Appendices accompany vols. 64, 67-71.

Corpora in Translation

Author : Tengku Sepora Tengku Mahadi
File Size : 24.12 MB
Format : PDF, Docs
Download : 560
Read : 381
Download »
Corpora are among the hottest issues in translation studies affecting both pure and applied realms of the discipline. As for pure translation studies, corpora have done their part through contributions to the studies on translational language and translation universals. Yet, their recent contribution is within the borders of applied translation studies, i.e. translator training and translation aids. The former is the major focus of the present book. The present book in fact aims at providing readers with comprehensive information about corpora in translation studies in general, and corpora in translator education in particular. It further offers researchers and practitioners a comprehensive and up-to-date survey of studies done on corpora in translator education and provides a rich source of information on pros and cons of using different types of corpora as translation aids in the context of translation classrooms.

Translation Driven Corpora

Author : Federico Zanettin
File Size : 90.99 MB
Format : PDF, Docs
Download : 902
Read : 738
Download »
Electronic texts and text analysis tools have opened up a wealth of opportunities to higher education and language service providers, but learning to use these resources continues to pose challenges to scholars and professionals alike. Translation-Driven Corpora aims to introduce readers to corpus tools and methods which may be used in translation research and practice. Each chapter focuses on specific aspects of corpus creation and use. An introduction to corpora and overview of applications of corpus linguistics methodologies to translation studies is followed by a discussion of corpus design and acquisition. Different stages and tools involved in corpus compilation and use are outlined, from corpus encoding and annotation to indexing and data retrieval, and the various methods and techniques that allow end users to make sense of corpus data are described. The volume also offers detailed guidelines for the construction and analysis of multilingual corpora. Corpus creation and use are illustrated through practical examples and case studies, with each chapter outlining a set of tasks aimed at guiding researchers, students and translators to practice some of the methods and use some of the resources discussed. These tasks are meant as hands-on activities to be carried out using the materials and links available in an accompanying DVD. Suggested further readings at the end of each chapter are complemented by an extensive bibliography at the end of the volume. Translation-Driven Corpora is designed for use by teachers and students in the classroom or by researchers and professionals for self-learning. It is an invaluable resource for anyone interested in this fast growing area of scholarly and professional activity.

Corpora in the Foreign Language Classroom

Author :
File Size : 22.24 MB
Format : PDF, ePub
Download : 972
Read : 654
Download »
The papers published in this volume were originally presented at the Sixth International Conference on Teaching and Language Corpora (4-7 July 2004 Granada, Spain) and reflect the latest developments that have taken place in the field of the teaching applications of text corpora, with a special emphasis on their use in the foreign language classroom. The book is divided into three main sections. The first section sets the scene for what this collection of essays aims to be. It deals with the issue of what corpus linguistics can do not only for the understanding of the nature of language itself but also for so fundamental and miraculous a matter such as language learning and language acquisition. The second section tackles the issues of corpus design and corpus exploitation and provides the reader with a great variety of evidence in favour of corpora exploitation for the building of a successful teaching environment. The final section deals with practical applications of corpora in the foreign language classroom. Although each of the papers here reports particular experiences in very different teaching and learning contexts, as a whole they show that corpora can be used on the spot in a language teaching context by teachers and learners without extensive training in computational tools, and studies of linguistics features can be tailored to specific pedagogic context and learning requirements. The book represents a solid contribution to linguistic studies and language teaching and it is a good example of the diversity of the scientific lines in which corpus linguistics is involved at the present moment.

Using Corpora to Learn about Language and Discourse

Author : Linda Lombardo
File Size : 54.47 MB
Format : PDF
Download : 125
Read : 796
Download »
Considerable progress has been made in the use of corpora for research purposes to describe language in use, and more recently, through a CADS (corpus assisted discourse studies) approach, to identify the discourse features of specific text genres. While the potential benefits of working with corpora in the classroom have been recognised, there has been a lag in the promulgation of guidelines for carrying out meaningful corpus work with language learners and teachers in mind. The papers in this volume aim to make a contribution toward filling that gap by providing an in-depth account of innovative corpus work, most of which has actually been carried out with real learners in the classroom. Authors provide valuable insights into ways of structuring corpus work for specific target learners, as well as suggestions for resolving problematic issues that have arisen and avoiding errors that have been made with learners and in their own research and experimentation. The transparency and honesty with which they present their methodology and results, along with the successful techniques they have developed, constitute a step forward in defining good (and bad) practice in the use of corpora in learning.

Machine Learning in Translation Corpora Processing

Author : Krzysztof Wolk
File Size : 61.17 MB
Format : PDF, Docs
Download : 777
Read : 1309
Download »
This book reviews ways to improve statistical machine speech translation between Polish and English. Research has been conducted mostly on dictionary-based, rule-based, and syntax-based, machine translation techniques. Most popular methodologies and tools are not well-suited for the Polish language and therefore require adaptation, and language resources are lacking in parallel and monolingual data. The main objective of this volume to develop an automatic and robust Polish-to-English translation system to meet specific translation requirements and to develop bilingual textual resources by mining comparable corpora.

How to Use Corpora in Language Teaching

Author : John McH. Sinclair
File Size : 77.93 MB
Format : PDF, Mobi
Download : 932
Read : 940
Download »
After decades of being overlooked, corpus evidence is becoming an important component of the teaching and learning of languages. Above all, the profession needs guidance in the practicalities of using corpora, interpreting the results and applying them to the problems and opportunities of the classroom. This book is intensely practical, written mainly by a new generation of language teachers who are acknowledged experts in central aspects of the discipline. It offers advice on what to do in the classroom, how to cope with teachers' queries about language, what corpora to use including learner corpora and spoken corpora and how to handle the variability of language; it reports on some current research and explains how the access software is constructed, including an opportunity for the practitioner to write small but useful programs; and it takes a look into the future of corpora in language teaching.

Creation and Use of Historical English Corpora in Spain

Author : Nila Vazques
File Size : 44.37 MB
Format : PDF
Download : 571
Read : 1204
Download »
Even before the Helsinki Corpus was published, Spain had a good amount of Historical English researchers, such as the group directed by Teresa Fanego in Santiago de Compostela. In the last couple of decades, the number of scholars working in the field of Historical Corpus Linguistics has increased, and, nowadays, there are some interesting projects in Spain that will result in the publication of valuable material for scholars throughout the world. The aim of this volume is twofold. On the on...

Corpora Galore

Author : John M. Kirk
File Size : 55.22 MB
Format : PDF
Download : 195
Read : 710
Download »

Corpora and Discourse Studies

Author : Anthony McEnery
File Size : 84.61 MB
Format : PDF, ePub, Docs
Download : 272
Read : 986
Download »
This edited collection brings together contemporary research that uses corpus linguistics to carry out discourse analysis. The book takes an inclusive view of the meaning of discourse, covering different text-types or modes of language, including discourse as both social practice and as ideology or representation.

Corpora in Cognitive Linguistics

Author : Stefan Th. Gries
File Size : 50.42 MB
Format : PDF, ePub, Mobi
Download : 770
Read : 244
Download »
Cognitive Linguistics, the branch of linguistics that tries to "make one's account of human language accord with what is generally known about the mind and the brain," has become one of the most flourishing fields of contemporary linguistics. The chapters address many classic topics of Cognitive Linguistics. These topics include studies on the semantics of specific words (including polysemy and synonymy) as well as semantic characteristics of particular syntactic patterns / constructions (including constructional synonymy and the schematicity of constructions), the analysis of causatives, transitivity, and image-schematic aspects of posture verbs. The key characteristic of this volume is that all papers adopt the methodological perspective of Corpus Linguistics, the rapidly evolving branch of linguistics based on the computerized analysis of language used in authentic settings. Thus, the contributions do not only all provide various new insights in their respective fields, they also introduce new data as well as new corpus-based and quantitative methods of analysis. On the basis of their findings, the authors discuss both theoretical implications going well beyond the singular topics of the studies and show how the discipline of Cognitive Linguistics can benefit from the rigorous analysis of naturally-occurring language. The languages which are investigated are English, German, Dutch, and Russian, and the data come from a variety of different corpora. As such, the present volume will be of interest to a wide range of scholars with many different foci and interests and should pave the way for further integration of usage-based techniques of analysis within this exciting paradigm.

Best Practices for Spoken Corpora in Linguistic Research

Author : Şükriye Ruhi
File Size : 68.5 MB
Format : PDF, Mobi
Download : 378
Read : 833
Download »
A key concern of researchers involved in the creation and sharing of language resources is to attain maximum usability, reliability and longevity of these resources for present and future researchers in the language sciences. The view developed in this volume is that spoken corpora construction and sharing are major research endeavours that should also be laid open to academic debate in a manner that is more visible than is currently the case in corpus linguistics. The present volume brings together multiple research perspectives to bear on the question of what constitutes best practices for the construction of spoken corpora. The book brings into closer contact scholars whose specializations have often remained in relatively different streams of scientific investigation; that is, scholars whose work falls primarily in conversation analysis, pragmatics and discourse analysis, but who are involved in spoken corpus compilation, on the one hand, and scholars who also specialize in linguistics but who have been intensively involved in developing various infrastructures for spoken corpora, on the other hand. This combination of scholars brings into better relief the concerns of data providers, data curators and data users in linguistic research. This book is thus unique in that it highlights best practices from both the perspective of assembling, annotating and linguistic analysis of spoken corpora, as well as from the perspective of processing, archiving and disseminating spoken language. In doing so, the contributions emphasise not only the considerable promise that the rapid technological changes that society continues to experience in this area offer, but also possible dangers for the unwary.