Natural-language understanding

Natural-language understanding (NLU) or natural-language interpretation (NLI)

AI-hard problem.^[2]

There is considerable commercial interest in the field because of its application to

text categorization, voice-activation, archiving, and large-scale content analysis

History

The program

MIT, is one of the earliest known attempts at natural-language understanding by a computer.^[6]^[7]^[8]^[9]^[10] Eight years after John McCarthy coined the term artificial intelligence

, Bobrow's dissertation (titled Natural Language Input for a Computer Problem Solving System) showed how a computer could understand simple natural language input to solve algebra word problems.

A year later, in 1965, Joseph Weizenbaum at MIT wrote ELIZA, an interactive program that carried on a dialogue in English on any topic, the most popular being psychotherapy. ELIZA worked by simple parsing and substitution of key words into canned phrases and Weizenbaum sidestepped the problem of giving the program a database of real-world knowledge or a rich lexicon. Yet ELIZA gained surprising popularity as a toy project and can be seen as a very early precursor to current commercial systems such as those used by Ask.com.^[11]

In 1969,

Janet Kolodner

.

In 1970,

finite state automata

that were called recursively. ATNs and their more general format called "generalized ATNs" continued to be used for a number of years.

In 1971, Terry Winograd finished writing SHRDLU for his PhD thesis at MIT. SHRDLU could understand simple English sentences in a restricted world of children's blocks to direct a robotic arm to move items. The successful demonstration of SHRDLU provided significant momentum for continued research in the field.^[14]^[15] Winograd continued to be a major influence in the field with the publication of his book Language as a Cognitive Process.^[16] At Stanford, Winograd would later advise Larry Page, who co-founded Google.

In the 1970s and 1980s, the natural language processing group at

Symantec Corporation originally as a company for developing a natural language interface for database queries on personal computers. However, with the advent of mouse-driven graphical user interfaces, Symantec changed direction. A number of other commercial efforts were started around the same time, e.g., Larry R. Harris at the Artificial Intelligence Corporation and Roger Schank and his students at Cognitive Systems Corp.^[17]^[18] In 1983, Michael Dyer developed the BORIS system at Yale which bore similarities to the work of Roger Schank and W. G. Lehnert.^[19]

The third millennium saw the introduction of systems using machine learning for text classification, such as the IBM

Watson. However, experts debate how much "understanding" such systems demonstrate: e.g., according to John Searle, Watson did not even understand the questions.^[20]

Patom Theory, supports this assessment. Natural language processing has made inroads for applications to support human productivity in service and e-commerce, but this has largely been made possible by narrowing the scope of the application. There are thousands of ways to request something in a human language that still defies conventional natural language processing.^{[citation needed]} According to Wibe Wagemans, "To have a meaningful conversation with machines is only possible when we match every word to the correct meaning based on the meanings of the other words in the sentence – just like a 3-year-old does without guesswork."^[21]

Scope and context

The umbrella term "natural-language understanding" can be applied to a diverse set of computer applications, ranging from small, relatively simple tasks such as short commands issued to robots, to highly complex endeavors such as the full comprehension of newspaper articles or poetry passages. Many real-world applications fall between the two extremes, for instance text classification for the automatic analysis of emails and their routing to a suitable department in a corporation does not require an in-depth understanding of the text,^[22] but needs to deal with a much larger vocabulary and more diverse syntax than the management of simple queries to database tables with fixed schemata.

Throughout the years various attempts at processing natural language or English-like sentences presented to computers have taken place at varying degrees of complexity. Some attempts have not resulted in systems with deep understanding, but have helped overall system usability. For example,

first order logic

) of the semantics of natural language sentences.

Hence the breadth and depth of "understanding" aimed at by a system determine both the complexity of the system (and the implied challenges) and the types of applications it can deal with. The "breadth" of a system is measured by the sizes of its vocabulary and grammar. The "depth" is measured by the degree to which its understanding approximates that of a fluent native speaker. At the narrowest and shallowest, English-like command interpreters require minimal complexity, but have a small range of applications. Narrow but deep systems explore and model mechanisms of understanding,^[25] but they still have limited application. Systems that attempt to understand the contents of a document such as a news release beyond simple keyword matching and to judge its suitability for a user are broader and require significant complexity,^[26] but they are still somewhat shallow. Systems that are both very broad and very deep are beyond the current state of the art.

Components and architecture

Regardless of the approach used, most natural-language-understanding systems share some common components. The system needs a

Wordnet lexicon required many person-years of effort.^[27]

The system also needs theory from

Semantic parsers convert natural-language texts into formal meaning representations.^[32]

Advanced applications of natural-language understanding also attempt to incorporate logical
logical deduction to arrive at conclusions. Therefore, systems based on functional languages such as Lisp need to include a subsystem to represent logical assertions, while logic-oriented systems such as those using the language Prolog generally rely on an extension of the built-in logical representation framework.^[33]^[34]

The management of
context in natural-language understanding can present special challenges. A large variety of examples and counter examples have resulted in multiple approaches to the formal modeling of context, each with specific strengths and weaknesses.^[35]^[36]

See also

Computational semantics

Computational linguistics

Discourse representation theory

Deep linguistic processing

History of natural language processing

Information extraction

Mathematica^[37]^[38]^[39]

Natural-language processing

Natural-language programming

Natural-language user interface
Siri (software)

Wolfram Alpha

Open information extraction

Part-of-speech tagging

Speech recognition

Notes

^ Semaan, P. (2012). Natural Language Generation: An Overview. Journal of Computer Science & Research (JCSCR)-ISSN, 50-57

^ Roman V. Yampolskiy. Turing Test as a Defining Feature of AI-Completeness . In Artificial Intelligence, Evolutionary Computation and Metaheuristics (AIECM) --In the footsteps of Alan Turing. Xin-She Yang (Ed.). pp. 3-17. (Chapter 1). Springer, London. 2013. http://cecs.louisville.edu/ry/TuringTestasaDefiningFeature04270003.pdf

^ Van Harmelen, Frank, Vladimir Lifschitz, and Bruce Porter, eds. Handbook of knowledge representation. Vol. 1. Elsevier, 2008.

^ Macherey, Klaus, Franz Josef Och, and Hermann Ney. "Natural language understanding using statistical machine translation." Seventh European Conference on Speech Communication and Technology. 2001.

^ Hirschman, Lynette, and Robert Gaizauskas. "Natural language question answering: the view from here." natural language engineering 7.4 (2001): 275-300.

American Association for Artificial Intelligence Brief History of AI [1]

Daniel Bobrow's PhD Thesis Natural Language Input for a Computer Problem Solving System
.

ISBN 1-56881-205-1
page 286

ISBN 0-13-790395-2, http://aima.cs.berkeley.edu/
, p. 19

ISBN 0-262-58150-7
page 278

ISBN 0-7167-0463-3
pages 188-189

^ Roger Schank, 1969, A conceptual dependency parser for natural language Proceedings of the 1969 conference on Computational linguistics, Sång-Säby, Sweden, pages 1-3

^ Woods, William A (1970). "Transition Network Grammars for Natural Language Analysis". Communications of the ACM 13 (10): 591–606 [2]

ISBN 0-415-19332-X
page 89

^ Terry Winograd's SHRDLU page at Stanford SHRDLU

^ Winograd, Terry (1983), Language as a Cognitive Process, Addison–Wesley, Reading, MA.

^ Larry R. Harris, Research at the Artificial Intelligence corp. ACM SIGART Bulletin, issue 79, January 1982 [3]

ISBN 0-89859-767-6
page xiii

ISBN 0-262-04073-5

^ Searle, John (23 February 2011). "Watson Doesn't Know It Won on 'Jeopardy!'". Wall Street Journal.

^ Brandon, John (2016-07-12). "What Natural Language Understanding tech means for chatbots". VentureBeat. Retrieved 2024-02-29.

ISBN 3-540-73350-7

^ InfoWorld, Nov 13, 1989, page 144

^ InfoWorld, April 19, 1984, page 71

^ Building Working Models of Full Natural-Language Understanding in Limited Pragmatic Domains by James Mason 2010 [4]

ISBN 1-55860-754-4
page 289

^ G. A. Miller, R. Beckwith, C. D. Fellbaum, D. Gross, K. Miller. 1990. WordNet: An online lexical database. Int. J. Lexicograph. 3, 4, pp. 235-244.

ISBN 0-415-16792-2
page 209

ISBN 0-89838-287-4

ISBN 0-7923-8571-3

ISBN 0-8058-2166-X

^ Wong, Yuk Wah, and Raymond J. Mooney. "Learning for semantic parsing with statistical machine translation." Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. Association for Computational Linguistics, 2006.

ISBN 0-13-629478-2

ISBN 0-201-18053-7

ISBN 0-262-18192-4
page 111

ISBN 0-7923-6350-7

^ Programming with Natural Language Is Actually Going to Work—Wolfram Blog

^ Van Valin, Jr, Robert D. "From NLP to NLU" (PDF).

^ Ball, John. "multi-lingual NLU by Pat Inc". Pat.ai.

v
t
e
Natural language processing
General terms

AI-complete

Bag-of-words

n-gram
Bigram

Trigram

Computational linguistics

Natural-language understanding

Stop words

Text processing

Text analysis

Argument mining

Collocation extraction

Concept mining

Coreference resolution

Deep linguistic processing

Distant reading

Information extraction

Named-entity recognition

Ontology learning

Parsing
Semantic parsing

Syntactic parsing

Part-of-speech tagging

Semantic analysis

Semantic role labeling

Semantic decomposition

Semantic similarity

Sentiment analysis

Terminology extraction

Text mining

Textual entailment

Truecasing

Word-sense disambiguation

Word-sense induction

Text segmentation

Compound-term processing

Lemmatisation

Lexical analysis

Text chunking

Stemming

Sentence segmentation

Word segmentation

Automatic summarization

Multi-document summarization

Sentence extraction

Text simplification

Machine translation

Computer-assisted

Example-based

Rule-based

Statistical

Transfer-based

Neural

Distributional semantics models

BERT

Document-term matrix

Explicit semantic analysis

fastText

GloVe

Language model (large)

Latent semantic analysis

Seq2seq

Word embedding

Word2vec

Language resources,
datasets and corpora
Types and
standards

Corpus linguistics

Lexical resource

Linguistic Linked Open Data

Machine-readable dictionary

Parallel text

PropBank

Semantic network

Simple Knowledge Organization System

Speech corpus

Text corpus

Thesaurus (information retrieval)

Treebank

Universal Dependencies

Data

BabelNet

Bank of English

DBpedia

FrameNet

Google Ngram Viewer

UBY

WordNet

Automatic identification
and data capture

Speech recognition

Speech segmentation

Speech synthesis

Natural language generation

Optical character recognition

Topic model

Document classification

Latent Dirichlet allocation

Pachinko allocation

Computer-assisted
reviewing

Automated essay scoring

Concordancer

Grammar checker

Predictive text

Pronunciation assessment

Spell checker

Syntax guessing

Natural language
user interface

Chatbot

Interactive fiction

Question answering

Virtual assistant

Voice user interface

Related

Formal semantics

Hallucination

Natural Language Toolkit

spaCy

Retrieved from "https://en.wikipedia.org/w/index.php?title=Natural-language_understanding&oldid=1214380408"

[1] Semaan, P. (2012). Natural Language Generation: An Overview. Journal of Computer Science & Research (JCSCR)-ISSN, 50-57

[2] Roman V. Yampolskiy. Turing Test as a Defining Feature of AI-Completeness . In Artificial Intelligence, Evolutionary Computation and Metaheuristics (AIECM) --In the footsteps of Alan Turing. Xin-She Yang (Ed.). pp. 3-17. (Chapter 1). Springer, London. 2013. http://cecs.louisville.edu/ry/TuringTestasaDefiningFeature04270003.pdf

[3] Van Harmelen, Frank, Vladimir Lifschitz, and Bruce Porter, eds. Handbook of knowledge representation. Vol. 1. Elsevier, 2008.

[4] Macherey, Klaus, Franz Josef Och, and Hermann Ney. "Natural language understanding using statistical machine translation." Seventh European Conference on Speech Communication and Technology. 2001.

[5] Hirschman, Lynette, and Robert Gaizauskas. "Natural language question answering: the view from here." natural language engineering 7.4 (2001): 275-300.

[6] American Association for Artificial Intelligence Brief History of AI [1]

[7] Daniel Bobrow's PhD Thesis Natural Language Input for a Computer Problem Solving System
.

[8] ISBN 1-56881-205-1
page 286

[9] ISBN 0-13-790395-2, http://aima.cs.berkeley.edu/
, p. 19

[10] ISBN 0-262-58150-7
page 278

[11] ISBN 0-7167-0463-3
pages 188-189

[12] Roger Schank, 1969, A conceptual dependency parser for natural language Proceedings of the 1969 conference on Computational linguistics, Sång-Säby, Sweden, pages 1-3

[13] Woods, William A (1970). "Transition Network Grammars for Natural Language Analysis". Communications of the ACM 13 (10): 591–606 [2]

[14] ISBN 0-415-19332-X
page 89

[15] Terry Winograd's SHRDLU page at Stanford SHRDLU

[16] Winograd, Terry (1983), Language as a Cognitive Process, Addison–Wesley, Reading, MA.

[17] Larry R. Harris, Research at the Artificial Intelligence corp. ACM SIGART Bulletin, issue 79, January 1982 [3]

[18] ISBN 0-89859-767-6
page xiii

[19] ISBN 0-262-04073-5

[20] Searle, John (23 February 2011). "Watson Doesn't Know It Won on 'Jeopardy!'". Wall Street Journal.

[21] Brandon, John (2016-07-12). "What Natural Language Understanding tech means for chatbots". VentureBeat. Retrieved 2024-02-29.

[22] ISBN 3-540-73350-7

[23] InfoWorld, Nov 13, 1989, page 144

[24] InfoWorld, April 19, 1984, page 71

[25] Building Working Models of Full Natural-Language Understanding in Limited Pragmatic Domains by James Mason 2010 [4]

[26] ISBN 1-55860-754-4
page 289

[27] G. A. Miller, R. Beckwith, C. D. Fellbaum, D. Gross, K. Miller. 1990. WordNet: An online lexical database. Int. J. Lexicograph. 3, 4, pp. 235-244.

[28] ISBN 0-415-16792-2
page 209

[29] ISBN 0-89838-287-4

[30] ISBN 0-7923-8571-3

[31] ISBN 0-8058-2166-X

[32] Wong, Yuk Wah, and Raymond J. Mooney. "Learning for semantic parsing with statistical machine translation." Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics. Association for Computational Linguistics, 2006.

[33] ISBN 0-13-629478-2

[34] ISBN 0-201-18053-7

[35] ISBN 0-262-18192-4
page 111

[36] ISBN 0-7923-6350-7

[37] Programming with Natural Language Is Actually Going to Work—Wolfram Blog

[38] Van Valin, Jr, Robert D. "From NLP to NLU" (PDF).

[39] Ball, John. "multi-lingual NLU by Pat Inc". Pat.ai.

[2]

[6]

[7]

[8]

[9]

[10]

[11]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[25]

[26]

[27]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]