Computational linguistics
Part of a series on |
Linguistics |
---|
Portal |
Computational linguistics is an
Origins
The field overlapped with artificial intelligence since the efforts in the United States in the 1950s to use computers to automatically translate texts from foreign languages, particularly Russian scientific journals, into English.[1] Since rule-based approaches were able to make arithmetic (systematic) calculations much faster and more accurately than humans, it was expected that lexicon, morphology, syntax and semantics can be learned using explicit rules, as well. After the failure of rule-based approaches, David Hays[2] coined the term in order to distinguish the field from AI and co-founded both the Association for Computational Linguistics (ACL) and the International Committee on Computational Linguistics (ICCL) in the 1970s and 1980s. What started as an effort to translate between languages evolved into a much wider field of natural language processing.[3][4]
Annotated corpora
In order to be able to meticulously study the
Japanese sentence corpora were analyzed and a pattern of
Modeling language acquisition
The fact that during language acquisition, children are largely only exposed to positive evidence,[8] meaning that the only evidence for what is a correct form is provided, and no evidence for what is not correct,[9] was a limitation for the models at the time because the now available deep learning models were not available in late 1980s.[10]
It has been shown that languages can be learned with a combination of simple input presented incrementally as the child develops better memory and longer attention span,[11] which explained the long period of language acquisition in human infants and children.[11]
Robots have been used to test linguistic theories.[12] Enabled to learn as children might, models were created based on an affordance model in which mappings between actions, perceptions, and effects were created and linked to spoken words. Crucially, these robots were able to acquire functioning word-to-meaning mappings without needing grammatical structure.
Using the
Chomsky's theories
Chomsky's theories have influenced computational linguistics, particularly in understanding how infants learn complex grammatical structures, such as those described in Chomsky normal form.[14] Attempts have been made to determine how an infant learns a "non-normal grammar" as theorized by Chomsky normal form.[9] Research in this area combines structural approaches with computational models to analyze large linguistic corpora like the Penn Treebank, helping to uncover patterns in language acquisition.[15]
See also
- Artificial intelligence in fiction
- Collostructional analysis
- Computational lexicology
- Computational Linguistics (journal)
- Computational models of language acquisition
- Computational semantics
- Computational semiotics
- Computer-assisted reviewing
- Dialog systems
- Glottochronology
- Grammar induction
- Human speechome project
- Internet linguistics
- Lexicostatistics
- Natural language processing
- Natural language user interface
- Quantitative linguistics
- Semantic relatedness
- Semantometrics
- Systemic functional linguistics
- Translation memory
- Universal Networking Language
References
- ^ John Hutchins: Retrospect and prospect in computer-based translation. Archived 2008-04-14 at the Wayback Machine Proceedings of MT Summit VII, 1999, pp. 30–44.
- ^ "Deceased members". ICCL members. Archived from the original on 17 May 2017. Retrieved 15 November 2017.
- ^ Natural Language Processing by Liz Liddy, Eduard Hovy, Jimmy Lin, John Prager, Dragomir Radev, Lucy Vanderwende, Ralph Weischedel
- ^ Arnold B. Barach: Translating Machine 1975: And the Changes To Come.
- ^ Marcus, M. & Marcinkiewicz, M. (1993). "Building a large annotated corpus of English: The Penn Treebank" (PDF). Computational Linguistics. 19 (2): 313–330. Archived (PDF) from the original on 2022-10-09.
- ^ Taylor, Ann (2003). "1". Treebanks. Spring Netherlands. pp. 5–22.
- .
- ^ Bowerman, M. (1988). The "no negative evidence" problem: How do children avoid constructing an overly general grammar. Explaining language universals.
- ^ a b Braine, M.D.S. (1971). On two types of models of the internalization of grammars. In D.I. Slobin (Ed.), The ontogenesis of grammar: A theoretical perspective. New York: Academic Press.
- ISBN 978-0-387-19557-5.
- ^ S2CID 2105042.
- S2CID 977486.
- PMID 22427981.
- ^ Yogita, Bansal (2016). "Insight to Computational Linguistics" (PDF). International Journal 4.10. p. 94. Retrieved September 22, 2024.
- ^ Yogita, Bansal (2016). "Insight to Computational Linguistics" (PDF). International Journal 4.10. p. 94. Retrieved September 22, 2024.
Further reading
- Bates, M (1995). "Models of natural language understanding". Proceedings of the National Academy of Sciences of the United States of America. 92 (22): 9977–9982. PMID 7479812.
- Steven Bird, Ewan Klein, and Edward Loper (2009). Natural Language Processing with Python. O'Reilly Media. ISBN 978-0-596-51649-9.
- Daniel Jurafsky and James H. Martin (2008). Speech and Language Processing, 2nd edition. Pearson Prentice Hall. ISBN 978-0-13-187321-6.
- Mohamed Zakaria KURDI (2016). Natural Language Processing and Computational Linguistics: speech, morphology, and syntax, Volume 1. ISTE-Wiley. ISBN 978-1848218482.
- Mohamed Zakaria KURDI (2017). Natural Language Processing and Computational Linguistics: semantics, discourse, and applications, Volume 2. ISTE-Wiley. ISBN 978-1848219212.
External links
- Association for Computational Linguistics (ACL)
- CICLing annual conferences on Computational Linguistics Archived 2019-02-06 at the Wayback Machine
- Computational Linguistics – Applications workshop
- Free online introductory book on Computational Linguistics at the Wayback Machine (archived January 25, 2008)
- Language Technology World
- Resources for Text, Speech and Language Processing
- The Research Group in Computational Linguistics Archived 2013-08-01 at the Wayback Machine