Skip to main content

Sale until 1 Feb: Up to 30% off selected books.

Walter de Gruyter GmbH

Linguistic Corpora and Big Data in Spanish and Portuguese

No reviews yet
Product Code: 9783110781458
ISBN13: 9783110781458
Condition: New
$132.99
In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacr?ica del espa?l (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.


Author: Miguel Calder? Campos
Publisher: Walter de Gruyter GmbH
Publication Date: Oct 21, 2024
Number of Pages: NA pages
Language: English
Binding: Hardcover
ISBN-10: 311078145X
ISBN-13: 9783110781458

Linguistic Corpora and Big Data in Spanish and Portuguese

$132.99
 
In recent decades, corpus linguistics has experienced tremendous development in the Hispanic world, along two opposite but complementary approaches: increase in corpus size (corpus linguistics as Big Data) and improvement in document selection and data annotation (corpus linguistics as High Quality Data). The first approach has led to the creation of massive corpora such as EsTenTen; at the same time, it has promoted the use of the web and social networks as corpora. The second perspective gives rise to specialized corpora such as Post Scriptum or Oralia Diacr?ica del espa?l (ODE). The contributions gathered in this volume combine both methods in order to exploit their advantages and to overcome their possible limitations. On the one hand, it addresses the creation and design of small corpora focused on data quality; on the other hand, it offers case studies that make use of both specialized corpora and massive data extracted from the web. Highlighting the complementary nature of both methods is the main idea of this book.


Author: Miguel Calder? Campos
Publisher: Walter de Gruyter GmbH
Publication Date: Oct 21, 2024
Number of Pages: NA pages
Language: English
Binding: Hardcover
ISBN-10: 311078145X
ISBN-13: 9783110781458
 

Customer Reviews

This product hasn't received any reviews yet. Be the first to review this product!

Faster Shipping

Delivery in 3-8 days

Easy Returns

14 days returns

Discount upto 30%

Monthly discount on books

Outstanding Customer Service

Support 24 hours a day