What Is Natural Language Processing (NLP)?

Natural language processing (NLP) is a fast-growing type of artificial intelligence (AI). You've probably benefitted from NLP if you've asked your smartphone for directions to a restaurant or run a spell-check on a word processing program. Read on to learn more about NLP, its history and development, and how it's being used.

Artificial intelligence robots working in an office. — Image source: Getty Images.

What is natural language processing?

NLP is an increasingly common branch of AI, found in everything from smartphones to home kitchens, and involves the ability of computers to understand spoken language and text. Although most uses are fairly pedestrian -- Alexa, what is AI? -- NLP is becoming more sophisticated.

For example, tools like Generative Pre-trained Transformer 3 (GPT-3), developed by OpenAI, use a neural network machine learning model that can not only code but also write articles and answer questions, frequently in a manner virtually indistinguishable from a human response.

Machine Learning

Machine learning is a technique by which artificial intelligence programs are trained.

Development of natural language processing

We generally think of AI and its branches, like NLP, as relatively recent phenomena. But it actually dates back to 1950, when famed mathematician Alan Turing developed criteria for determining whether a computer was capable of AI. One criterion for the test involved deciding whether the computer could interpret and generate natural language.

For the next 50 years, linguists developed NLP using painstaking trial-and-error rules. In the 1990s, however, computers became much faster and more capable of doing calculations in seconds, even those that previously took hours or days.

The result was a shift to statistical methods to develop NLP capabilities. The current approach to NLP uses both linguistic and statistical methods to interpret and respond to instructions.

NLP involves two basic steps: data preprocessing and algorithm development. Data preprocessing generally uses at least one of four steps to allow the machine to work with text data:

Tokenization: The text is broken down into digestible parts.
Stop word removal: Common words are removed from the text, providing more information from unique words.
Lemmatization: Words are reduced to their root forms.
Part-of-speech tagging: Words are broken down into parts of speech (i.e., nouns, verbs, and so forth).

Once data preprocessing has been completed, an algorithm takes over. Two main types are used in NLP:

Rules-based: This approach uses linguistic techniques that allow the computer to process the information. It's similar to the approach used in the very early days of AI.
Machine learning-based: Statistical systems are used to teach the computer to process the information, often using a combination of machine learning and neural networks.

Semi-Supervised Machine Learning

A hybrid approach that takes advantage of both supervised and unsupervised learning as a single, automated analysis.

Natural language processing uses

Tools like Apple's (AAPL -0.34%) virtual assistant, Siri, and Amazon's (AMZN +0.11%) cloud-based voice service, Alexa, have made NLP commonplace. But a number of fears specific to NLP worry many experts.

Part of that can be attributed to the vast number of tasks humans currently do that computers could do in the coming years. For example, tech giant IBM (IBM +0.51%) has cited a handful of potential uses of NLP for real-world applications that may have escaped many people's attention:

Computer chips inside an outline of a human body. — Image source: Getty Images.

Spam detection: NLP can use a number of rules to clear out emails and texts that (hopefully) are spam, weeding out messages that include poor spelling, bad grammar, inappropriate urgency, and other common features of nuisance messages.

Machine translation: Translating words from one language into another isn't always a straightforward proposition. IBM notes, for example, that the English phrase, "The spirit is willing, but the flesh is weak," was originally translated into Russian as "The vodka is good, but the meat is rotten." The Google (GOOG -3.14%) (GOOGL -3.18%) Translate service remains the standard.

Virtual agents and chatbots: As better algorithms are developed, chatbots are becoming slightly less aggravating than automated telephone customer service responses. Likewise, virtual agents like Siri and Alexa continue to improve, recognizing context and providing better answers.

Social media sentiment: NLP is not only becoming a defense against disinformation and hate speech on social media but can also tell companies how their customers truly feel about their products, promotions, and events. Sentiment analysis can help companies design better products and advertising.

Text summarization: This uses NLP to synthesize and summarize large volumes of text. Some applications can even use rudimentary reasoning to generate context to the summaries, as well as conclusions.

Natural language processing innovation

Already, innovators are making progress on NLP-based tools that could eventually take the place of people. Consider the underappreciated chief of staff or the administrative assistant who schedules and takes notes during meetings and generally keeps the organizational trains running on time.

The start-up Xembly is using an automated, NLP-powered platform to handle many office jobs that often get lost in the shuffle. Its conversational AI agent, Xena, can listen to meetings, take notes, schedule meetings through Slack or email, remind people of action items, and even understand who is talking to whom when there is more than one speaker.

Although Xena may never be able to clear out the refrigerator in your office building or ensure everyone actually signs a birthday card, the agent is likely a harbinger of bigger things to come in the NLP world. Smart companies are already considering how to utilize NLP and other AI tools to make their workplaces more efficient and profitable. And smart investors will pay attention to these tools and how they're used as they continue to develop.

The Motley Fool has positions in and recommends Alphabet, Amazon, Apple, and International Business Machines. The Motley Fool has a disclosure policy.