Artificial intelligence

Page semi-protected
From Wikipedia, the free encyclopedia

Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by humans or by other animals. Example tasks in which this is done include speech recognition, computer vision, translation between (natural) languages, as well as other mappings of inputs.[1]

AI applications include advanced web search engines (e.g., Google Search), recommendation systems (used by YouTube, Amazon, and Netflix), understanding human speech (such as Siri and Alexa), self-driving cars (e.g., Waymo), generative or creative tools (ChatGPT and AI art), automated decision-making, and competing at the highest level in strategic game systems (such as chess and Go).[2]

As machines become increasingly capable, tasks considered to require "intelligence" are often removed from the definition of AI, a phenomenon known as the AI effect.[3] For instance, optical character recognition is frequently excluded from things considered to be AI, having become a routine technology.[4]

Artificial intelligence was founded as an academic discipline in 1956, and in the years since it has experienced several waves of optimism,[5][6] followed by disappointment and the loss of funding (known as an "AI winter"),[7][8] followed by new approaches, success, and renewed funding.[6][9] AI research has tried and discarded many different approaches, including simulating the brain, modeling human problem solving, formal logic, large databases of knowledge, and imitating animal behavior. In the first decades of the 21st century, highly mathematical and statistical machine learning has dominated the field, and this technique has proved highly successful, helping to solve many challenging problems throughout industry and academia.[9][10]

The various sub-fields of AI research are centered around particular goals and the use of particular tools. The traditional goals of AI research include reasoning, knowledge representation, planning, learning, natural language processing, perception, and the ability to move and manipulate objects.[a] General intelligence (the ability to solve an arbitrary problem) is among the field's long-term goals.[11] To solve these problems, AI researchers have adapted and integrated a wide range of problem-solving techniques, including search and mathematical optimization, formal logic, artificial neural networks, and methods based on statistics, probability, and economics. AI also draws upon computer science, psychology, linguistics, philosophy, and many other fields.

The field was founded on the assumption that human intelligence "can be so precisely described that a machine can be made to simulate it".[b] This raised philosophical arguments about the mind and the ethical consequences of creating artificial beings endowed with human-like intelligence; these issues have previously been explored by myth, fiction (science fiction), and philosophy since antiquity.[13] Computer scientists and philosophers have since suggested that AI may become an existential risk to humanity if its rational capacities are not steered towards goals beneficial to humankind.[c] Economists have frequently highlighted the risks of redundancies from AI, and speculated about unemployment if there is no adequate social policy for full employment.[14] The term artificial intelligence has also been criticized for overhyping AI's true technological capabilities.[15][16][17]

History

Silver didrachma from Crete depicting Talos, an ancient mythical automaton with artificial intelligence (c. 300 BC)

Artificial beings with intelligence appeared as storytelling devices in antiquity,[18] and have been common in fiction, as in Mary Shelley's Frankenstein or Karel Čapek's R.U.R.[19] These characters and their fates raised many of the same issues now discussed in the ethics of artificial intelligence.[20]

The study of mechanical or "formal" reasoning began with philosophers and mathematicians in antiquity. The study of mathematical logic led directly to Alan Turing's theory of computation, which suggested that a machine, by shuffling symbols as simple as "0" and "1", could simulate any conceivable act of mathematical deduction. This insight that digital computers can simulate any process of formal reasoning is known as the Church–Turing thesis.[21] This, along with concurrent discoveries in neurobiology, information theory and cybernetics, led researchers to consider the possibility of building an electronic brain.[22] The first work that is now generally recognized as AI was McCullouch and Pitts' 1943 formal design for Turing-complete "artificial neurons".[23]

By the 1950s, two visions for how to achieve machine intelligence emerged. One vision, known as Symbolic AI or GOFAI, was to use computers to create a symbolic representation of the world and systems that could reason about the world. Proponents included Allen Newell, Herbert A. Simon, and Marvin Minsky. Closely associated with this approach was the "heuristic search" approach, which likened intelligence to a problem of exploring a space of possibilities for answers.

The second vision, known as the connectionist approach, sought to achieve intelligence through learning. Proponents of this approach, most prominently Frank Rosenblatt, sought to connect Perceptron in ways inspired by connections of neurons.[24] James Manyika and others have compared the two approaches to the mind (Symbolic AI) and the brain (connectionist). Manyika argues that symbolic approaches dominated the push for artificial intelligence in this period, due in part to its connection to intellectual traditions of Descartes, Boole, Gottlob Frege, Bertrand Russell, and others. Connectionist approaches based on cybernetics or artificial neural networks were pushed to the background but have gained new prominence in recent decades.[25]

The field of AI research was born at a workshop at Dartmouth College in 1956.[d][28] The attendees became the founders and leaders of AI research.[e] They and their students produced programs that the press described as "astonishing":[f] computers were learning checkers strategies, solving word problems in algebra, proving logical theorems and speaking English.[g][30]

By the middle of the 1960s, research in the U.S. was heavily funded by the Department of Defense[31] and laboratories had been established around the world.[32]

Researchers in the 1960s and the 1970s were convinced that symbolic approaches would eventually succeed in creating a machine with artificial general intelligence and considered this the goal of their field.[33] Herbert Simon predicted, "machines will be capable, within twenty years, of doing any work a man can do".[34] Marvin Minsky agreed, writing, "within a generation ... the problem of creating 'artificial intelligence' will substantially be solved".[35]

They had failed to recognize the difficulty of some of the remaining tasks. Progress slowed and in 1974, in response to the criticism of Sir James Lighthill[36] and ongoing pressure from the US Congress to fund more productive projects, both the U.S. and British governments cut off exploratory research in AI. The next few years would later be called an "AI winter", a period when obtaining funding for AI projects was difficult.[7]

In the early 1980s, AI research was revived by the commercial success of expert systems,[37] a form of AI program that simulated the knowledge and analytical skills of human experts. By 1985, the market for AI had reached over a billion dollars. At the same time, Japan's fifth generation computer project inspired the U.S. and British governments to restore funding for academic research.[6] However, beginning with the collapse of the Lisp Machine market in 1987, AI once again fell into disrepute, and a second, longer-lasting winter began.[8]

Many researchers began to doubt that the symbolic approach would be able to imitate all the processes of human cognition, especially perception, robotics, learning and pattern recognition. A number of researchers began to look into "sub-symbolic" approaches to specific AI problems.[38] Robotics researchers, such as Rodney Brooks, rejected symbolic AI and focused on the basic engineering problems that would allow robots to move, survive, and learn their environment.[h]

Interest in neural networks and "connectionism" was revived by Geoffrey Hinton, David Rumelhart and others in the middle of the 1980s.[43] Soft computing tools were developed in the 1980s, such as neural networks, fuzzy systems, Grey system theory, evolutionary computation and many tools drawn from statistics or mathematical optimization.

AI gradually restored its reputation in the late 1990s and early 21st century by finding specific solutions to specific problems. The narrow focus allowed researchers to produce verifiable results, exploit more mathematical methods, and collaborate with other fields (such as statistics, economics and mathematics).[44] By 2000, solutions developed by AI researchers were being widely used, although in the 1990s they were rarely described as "artificial intelligence".[10]

Faster computers, algorithmic improvements and access to large amounts of data enabled advances in machine learning and perception; data-hungry deep learning methods started to dominate accuracy benchmarks around 2012.[45] According to Bloomberg's Jack Clark, 2015 was a landmark year for artificial intelligence, with the number of software projects that use AI within Google increased from a "sporadic usage" in 2012 to more than 2,700 projects.[i] He attributed this to an increase in affordable neural networks, due to a rise in cloud computing infrastructure and to an increase in research tools and datasets.[9]

In a 2017 survey, one in five companies reported they had "incorporated AI in some offerings or processes".[46] The amount of research into AI (measured by total publications) increased by 50% in the years 2015–2019.[47]

Numerous academic researchers became concerned that AI was no longer pursuing the original goal of creating versatile, fully intelligent machines. Much of current research involves statistical AI, which is overwhelmingly used to solve specific problems, even highly successful techniques such as deep learning. This concern has led to the subfield of artificial general intelligence (or "AGI"), which had several well-funded institutions by the 2010s.[11]

In April 2023, computer scientist Jaron Lanier published an alternative view of AI in The New Yorker as less intelligent than the name, and popular culture, may suggest. Lanier concludes his essay as follows: "Think of people. People are the answer to the problems of bits."[48][49]

Goals

The general problem of simulating (or creating) intelligence has been broken down into sub-problems. These consist of particular traits or capabilities that researchers expect an intelligent system to display. The traits described below have received the most attention.[a]

Reasoning, problem-solving

Early researchers developed algorithms that imitated step-by-step reasoning that humans use when they solve puzzles or make logical deductions.[50] By the late 1980s and 1990s, AI research had developed methods for dealing with uncertain or incomplete information, employing concepts from probability and economics.[51]

Many of these algorithms proved to be insufficient for solving large reasoning problems because they experienced a "combinatorial explosion": they became exponentially slower as the problems grew larger.[52] Even humans rarely use the step-by-step deduction that early AI research could model. They solve most of their problems using fast, intuitive judgments.[53]

Knowledge representation

An ontology represents knowledge as a set of concepts within a domain and the relationships between those concepts.

Knowledge representation and knowledge engineering[54] allow AI programs to answer questions intelligently and make deductions about real-world facts.

A representation of "what exists" is an ontology: the set of objects, relations, concepts, and properties formally described so that software agents can interpret them.[55] The most general ontologies are called upper ontologies, which attempt to provide a foundation for all other knowledge and act as mediators between domain ontologies that cover specific knowledge about a particular knowledge domain (field of interest or area of concern). A truly intelligent program would also need access to commonsense knowledge; the set of facts that an average person knows. The semantics of an ontology is typically represented in description logic, such as the Web Ontology Language.[56]

AI research has developed tools to represent specific domains, such as objects, properties, categories and relations between objects;[56] situations, events, states and time;[57] causes and effects;[58] knowledge about knowledge (what we know about what other people know);.[59] default reasoning (things that humans assume are true until they are told differently and will remain true even when other facts are changing);[60] as well as other domains. Among the most difficult problems in AI are: the breadth of commonsense knowledge (the number of atomic facts that the average person knows is enormous);[61] and the sub-symbolic form of most commonsense knowledge (much of what people know is not represented as "facts" or "statements" that they could express verbally).[53]

Formal knowledge representations are used in content-based indexing and retrieval,[62] scene interpretation,[63] clinical decision support,[64] knowledge discovery (mining "interesting" and actionable inferences from large databases),[65] and other areas.[66]

Learning

Machine learning (ML), a fundamental concept of AI research since the field's inception,[j] is the study of computer algorithms that improve automatically through experience.[k]

Unsupervised learning finds patterns in a stream of input.

Supervised learning requires a human to label the input data first, and comes in two main varieties: classification and numerical regression. Classification is used to determine what category something belongs in – the program sees a number of examples of things from several categories and will learn to classify new inputs. Regression is the attempt to produce a function that describes the relationship between inputs and outputs and predicts how the outputs should change as the inputs change. Both classifiers and regression learners can be viewed as "function approximators" trying to learn an unknown (possibly implicit) function; for example, a spam classifier can be viewed as learning a function that maps from the text of an email to one of two categories, "spam" or "not spam".[70]

In reinforcement learning the agent is rewarded for good responses and punished for bad ones. The agent classifies its responses to form a strategy for operating in its problem space.[71]

Transfer learning is when the knowledge gained from one problem is applied to a new problem.[72]

Computational learning theory can assess learners by computational complexity, by sample complexity (how much data is required), or by other notions of optimization.[73]

Natural language processing

A parse tree represents the syntactic structure of a sentence according to some formal grammar.

Natural language processing (NLP)[74] allows machines to read and understand human language. A sufficiently powerful natural language processing system would enable natural-language user interfaces and the acquisition of knowledge directly from human-written sources, such as newswire texts. Some straightforward applications of NLP include information retrieval, question answering and machine translation.[75]

Symbolic AI used formal syntax to translate the deep structure of sentences into logic. This failed to produce useful applications, due to the intractability of logic[52] and the breadth of commonsense knowledge.[61] Modern statistical techniques include co-occurrence frequencies (how often one word appears near another), "Keyword spotting" (searching for a particular word to retrieve information), transformer-based deep learning (which finds patterns in text), and others.[76] They have achieved acceptable accuracy at the page or paragraph level, and, by 2019, could generate coherent text.[77]

Perception

Feature detection (pictured: edge detection) helps AI compose informative abstract structures out of raw data.

Machine perception[78] is the ability to use input from sensors (such as cameras, microphones, wireless signals, and active lidar, sonar, radar, and tactile sensors) to deduce aspects of the world. Applications include speech recognition,[79] facial recognition, and object recognition.[80] Computer vision is the ability to analyze visual input.[81]

Social intelligence

Kismet, a robot with rudimentary social skills[82]

Affective computing is an interdisciplinary umbrella that comprises systems that recognize, interpret, process or simulate human feeling, emotion and mood.[83] For example, some virtual assistants are programmed to speak conversationally or even to banter humorously; it makes them appear more sensitive to the emotional dynamics of human interaction, or to otherwise facilitate human–computer interaction. However, this tends to give naïve users an unrealistic conception of how intelligent existing computer agents actually are.[84] Moderate successes related to affective computing include textual sentiment analysis and, more recently, multimodal sentiment analysis), wherein AI classifies the affects displayed by a videotaped subject.[85]

General intelligence

A machine with general intelligence can solve a wide variety of problems with breadth and versatility similar to human intelligence. There are several competing ideas about how to develop artificial general intelligence. Hans Moravec and Marvin Minsky argue that work in different individual domains can be incorporated into an advanced multi-agent system or cognitive architecture with general intelligence.[86] Pedro Domingos hopes that there is a conceptually straightforward, but mathematically difficult, "master algorithm" that could lead to AGI.[87] Others believe that anthropomorphic features like an artificial brain[88] or simulated child development[l] will someday reach a critical point where general intelligence emerges.

Tools

Search and optimization

AI can solve many problems by intelligently searching through many possible solutions.[89] Reasoning can be reduced to performing a search. For example, logical proof can be viewed as searching for a path that leads from premises to conclusions, where each step is the application of an inference rule.[90] Planning algorithms search through trees of goals and subgoals, attempting to find a path to a target goal, a process called means-ends analysis.[91] Robotics algorithms for moving limbs and grasping objects use local searches in configuration space.[92]

Simple exhaustive searches[93] are rarely sufficient for most real-world problems: the search space (the number of places to search) quickly grows to astronomical numbers. The result is a search that is too slow or never completes. The solution, for many problems, is to use "heuristics" or "rules of thumb" that prioritize choices in favor of those more likely to reach a goal and to do so in a shorter number of steps. In some search methodologies, heuristics can also serve to eliminate some choices unlikely to lead to a goal (called "pruning the search tree"). Heuristics supply the program with a "best guess" for the path on which the solution lies.[94] Heuristics limit the search for solutions into a smaller sample size.[95]

A very different kind of search came to prominence in the 1990s, based on the mathematical theory of optimization. For many problems, it is possible to begin the search with some form of a guess and then refine the guess incrementally until no more refinements can be made. These algorithms can be visualized as blind hill climbing: we begin the search at a random point on the landscape, and then, by jumps or steps, we keep moving our guess uphill, until we reach the top. Other related optimization algorithms include random optimization, beam search and metaheuristics like simulated annealing.[96] Evolutionary computation uses a form of optimization search. For example, they may begin with a population of organisms (the guesses) and then allow them to mutate and recombine, selecting only the fittest to survive each generation (refining the guesses). Classic evolutionary algorithms include genetic algorithms, gene expression programming, and genetic programming.[97] Alternatively, distributed search processes can coordinate via swarm intelligence algorithms. Two popular swarm algorithms used in search are particle swarm optimization (inspired by bird flocking) and ant colony optimization (inspired by ant trails).[98]

Logic

Logic[99] is used for knowledge representation and problem-solving, but it can be applied to other problems as well. For example, the satplan algorithm uses logic for planning[100] and inductive logic programming is a method for learning.[101]

Several different forms of logic are used in AI research. Propositional logic[102] involves truth functions such as "or" and "not". First-order logic[103] adds quantifiers and predicates and can express facts about objects, their properties, and their relations with each other. Fuzzy logic assigns a "degree of truth" (between 0 and 1) to vague statements such as "Alice is old" (or rich, or tall, or hungry), that are too linguistically imprecise to be completely true or false.[104] Default logics, non-monotonic logics and circumscription are forms of logic designed to help with default reasoning and the qualification problem.[60] Several extensions of logic have been designed to handle specific domains of knowledge, such as description logics;[56] situation calculus, event calculus and fluent calculus (for representing events and time);[57] causal calculus;[58] belief calculus (belief revision); and modal logics.[59] Logics to model contradictory or inconsistent statements arising in multi-agent systems have also been designed, such as paraconsistent logics.[105]

Probabilistic methods for uncertain reasoning

Expectation-maximization clustering of Old Faithful eruption data starts from a random guess but then successfully converges on an accurate clustering of the two physically distinct modes of eruption.

Many problems in AI (including in reasoning, planning, learning, perception, and robotics) require the agent to operate with incomplete or uncertain information. AI researchers have devised a number of tools to solve these problems using methods from probability theory and economics.[106] Bayesian networks[107] are a very general tool that can be used for various problems, including reasoning (using the Bayesian inference algorithm),[m][109] learning (using the expectation-maximization algorithm),[n][111] planning (using decision networks)[112] and perception (using dynamic Bayesian networks).[113] Probabilistic algorithms can also be used for filtering, prediction, smoothing and finding explanations for streams of data, helping perception systems to analyze processes that occur over time (e.g., hidden Markov models or Kalman filters).[113]

A key concept from the science of economics is "utility", a measure of how valuable something is to an intelligent agent. Precise mathematical tools have been developed that analyze how an agent can make choices and plan, using decision theory, decision analysis,[114] and information value theory.[115] These tools include models such as Markov decision processes,[116] dynamic decision networks,[113] game theory and mechanism design.[117]

Classifiers and statistical learning methods

The simplest AI applications can be divided into two types: classifiers ("if shiny then diamond") and controllers ("if diamond then pick up"). Controllers do, however, also classify conditions before inferring actions, and therefore classification forms a central part of many AI systems. Classifiers are functions that use pattern matching to determine the closest match. They can be tuned according to examples, making them very attractive for use in AI. These examples are known as observations or patterns. In supervised learning, each pattern belongs to a certain predefined class. A class is a decision that has to be made. All the observations combined with their class labels are known as a data set. When a new observation is received, that observation is classified based on previous experience.[118]

A classifier can be trained in various ways; there are many statistical and machine learning approaches. The decision tree is the simplest and most widely used symbolic machine learning algorithm.[119] K-nearest neighbor algorithm was the most widely used analogical AI until the mid-1990s.[120] Kernel methods such as the support vector machine (SVM) displaced k-nearest neighbor in the 1990s.[121] The naive Bayes classifier is reportedly the "most widely used learner"[122] at Google, due in part to its scalability.[123] Neural networks are also used for classification.[124]

Classifier performance depends greatly on the characteristics of the data to be classified, such as the dataset size, distribution of samples across classes, dimensionality, and the level of noise. Model-based classifiers perform well if the assumed model is an extremely good fit for the actual data. Otherwise, if no matching model is available, and if accuracy (rather than speed or scalability) is the sole concern, conventional wisdom is that discriminative classifiers (especially SVM) tend to be more accurate than model-based classifiers such as "naive Bayes" on most practical data sets.[125]

Artificial neural networks

A neural network is an interconnected group of nodes, akin to the vast network of neurons in the human brain.

Neural networks[124] were inspired by the architecture of neurons in the human brain. A simple "neuron" N accepts input from other neurons, each of which, when activated (or "fired"), casts a weighted "vote" for or against whether neuron N should itself activate. Learning requires an algorithm to adjust these weights based on the training data; one simple algorithm (dubbed "fire together, wire together") is to increase the weight between two connected neurons when the activation of one triggers the successful activation of another. Neurons have a continuous spectrum of activation; in addition, neurons can process inputs in a nonlinear way rather than weighing straightforward votes.

Modern neural networks model complex relationships between inputs and outputs and find patterns in data. They can learn continuous functions and even digital logical operations. Neural networks can be viewed as a type of mathematical optimization – they perform gradient descent on a multi-dimensional topology that was created by training the network. The most common training technique is the backpropagation algorithm.[126] Other learning techniques for neural networks are Hebbian learning ("fire together, wire together"), GMDH or competitive learning.[127]

The main categories of networks are acyclic or feedforward neural networks (where the signal passes in only one direction) and recurrent neural networks (which allow feedback and short-term memories of previous input events). Among the most popular feedforward networks are perceptrons, multi-layer perceptrons and radial basis networks.[128]

Deep learning

Representing Images on Multiple Layers of Abstraction in Deep Learning
Representing images on multiple layers of abstraction in deep learning[129]

Deep learning[130] uses several layers of neurons between the network's inputs and outputs. The multiple layers can progressively extract higher-level features from the raw input. For example, in image processing, lower layers may identify edges, while higher layers may identify the concepts relevant to a human such as digits or letters or faces.[131] Deep learning has drastically improved the performance of programs in many important subfields of artificial intelligence, including computer vision, speech recognition, image classification[132] and others.

Deep learning often uses convolutional neural networks for many or all of its layers. In a convolutional layer, each neuron receives input from only a restricted area of the previous layer called the neuron's receptive field. This can substantially reduce the number of weighted connections between neurons,[133] and creates a hierarchy similar to the organization of the animal visual cortex.[134]

In a recurrent neural network (RNN) the signal will propagate through a layer more than once;[135] thus, an RNN is an example of deep learning.[136] RNNs can be trained by gradient descent,[137] however long-term gradients which are back-propagated can "vanish" (that is, they can tend to zero) or "explode" (that is, they can tend to infinity), known as the vanishing gradient problem.[138] The long short term memory (LSTM) technique can prevent this in most cases.[139]

Specialized languages and hardware

Specialized languages for artificial intelligence have been developed, such as Lisp, Prolog, TensorFlow and many others. Hardware developed for AI includes AI accelerators and neuromorphic computing.

Applications

For this project of the artist Joseph Ayerle the AI had to learn the typical patterns in the colors and brushstrokes of Renaissance painter Raphael. The portrait shows the face of the actress Ornella Muti, "painted" by AI in the style of Raphael.


AI is relevant to any intellectual task.[140] Modern artificial intelligence techniques are pervasive and are too numerous to list here.[141] Frequently, when a technique reaches mainstream use, it is no longer considered artificial intelligence; this phenomenon is described as the AI effect.[142]

In the 2010s, AI applications were at the heart of the most commercially successful areas of computing, and have become a ubiquitous feature of daily life. AI is used in search engines (such as Google Search), targeting online advertisements,[143] recommendation systems (offered by Netflix, YouTube or Amazon), driving internet traffic,[144][145] targeted advertising (AdSense, Facebook), virtual assistants (such as Siri or Alexa),[146] autonomous vehicles (including drones, ADAS and self-driving cars), automatic language translation (Microsoft Translator, Google Translate), facial recognition (Apple's Face ID or Microsoft's DeepFace), image labeling (used by Facebook, Apple's iPhoto and TikTok) , spam filtering and chatbots (such as Chat GPT).

There are also thousands of successful AI applications used to solve problems for specific industries or institutions. A few examples are energy storage,[147] deepfakes,[148] medical diagnosis, military logistics, foreign policy,[149] or supply chain management.

Game playing has been a test of AI's strength since the 1950s. Deep Blue became the first computer chess-playing system to beat a reigning world chess champion, Garry Kasparov, on 11 May 1997.[150] In 2011, in a Jeopardy! quiz show exhibition match, IBM's question answering system, Watson, defeated the two greatest Jeopardy! champions, Brad Rutter and Ken Jennings, by a significant margin.[151] In March 2016, AlphaGo won 4 out of 5 games of Go in a match with Go champion Lee Sedol, becoming the first computer Go-playing system to beat a professional Go player without handicaps.[152] Other programs handle imperfect-information games; such as for poker at a superhuman level, Pluribus[o] and Cepheus.[154] DeepMind in the 2010s developed a "generalized artificial intelligence" that could learn many diverse Atari games on its own.[155]

By 2020, Natural Language Processing systems such as the enormous GPT-3 (then by far the largest artificial neural network) were matching human performance on pre-existing benchmarks, albeit without the system attaining a commonsense understanding of the contents of the benchmarks.[156] DeepMind's AlphaFold 2 (2020) demonstrated the ability to approximate, in hours rather than months, the 3D structure of a protein.[157] Other applications predict the result of judicial decisions,[158] create art (such as poetry or painting) and prove mathematical theorems.

In 2023, the realism of a new generation of AI-based Text-to-image generators, such as Midjourney, DALL-E, or Stable Diffusion,[159][160] reached such a high level of realism that it led to a significant wave of viral AI-generated photos. Widespread attention was gained by a fake photo of Pope Francis wearing a white puffer coat,[161] the fictional arrest of Donald Trump,[162] and a hoax of an attack on the Pentagon,[163] as well as the usage in professional creative arts.[164]


Smart traffic lights

Artificially intelligent traffic lights use cameras with radar, ultrasonic acoustic location sensors, and predictive algorithms to improve traffic flow

Smart traffic lights have been developed at Carnegie Mellon since 2009. Professor Stephen Smith has started a company since then Surtrac that has installed smart traffic control systems in 22 cities. It costs about $20,000 per intersection to install. Drive time has been reduced by 25% and traffic jam waiting time has been reduced by 40% at the intersections it has been installed.[165]

Intellectual property

AI patent families for functional application categories and sub categories. Computer vision represents 49 percent of patent families related to a functional application in 2016.

In 2019, WIPO reported that AI was the most prolific emerging technology in terms of the number of patent applications and granted patents, the Internet of things was estimated to be the largest in terms of market size. It was followed, again in market size, by big data technologies, robotics, AI, 3D printing and the fifth generation of mobile services (5G).[166] Since AI emerged in the 1950s, 340,000 AI-related patent applications were filed by innovators and 1.6 million scientific papers have been published by researchers, with the majority of all AI-related patent filings published since 2013. Companies represent 26 out of the top 30 AI patent applicants, with universities or public research organizations accounting for the remaining four.[167] The ratio of scientific papers to inventions has significantly decreased from 8:1 in 2010 to 3:1 in 2016, which is attributed to be indicative of a shift from theoretical research to the use of AI technologies in commercial products and services. Machine learning is the dominant AI technique disclosed in patents and is included in more than one-third of all identified inventions (134,777 machine learning patents filed for a total of 167,038 AI patents filed in 2016), with computer vision being the most popular functional application. AI-related patents not only disclose AI techniques and applications, they often also refer to an application field or industry. Twenty application fields were identified in 2016 and included, in order of magnitude: telecommunications (15 percent), transportation (15 percent), life and medical sciences (12 percent), and personal devices, computing and human–computer interaction (11 percent). Other sectors included banking, entertainment, security, industry and manufacturing, agriculture, and networks (including social networks, smart cities and the Internet of things). IBM has the largest portfolio of AI patents with 8,290 patent applications, followed by Microsoft with 5,930 patent applications.[167]

Philosophy

Defining artificial intelligence

Alan Turing wrote in 1950 "I propose to consider the question 'can machines think'?"[168] He advised changing the question from whether a machine "thinks", to "whether or not it is possible for machinery to show intelligent behaviour".[168] He devised the Turing test, which measures the ability of a machine to simulate human conversation.[169] Since we can only observe the behavior of the machine, it does not matter if it is "actually" thinking or literally has a "mind". Turing notes that we can not determine these things about other people[p] but "it is usual to have a polite convention that everyone thinks"[170]

Russell and Norvig agree with Turing that AI must be defined in terms of "acting" and not "thinking".[171] However, they are critical that the test compares machines to people. "Aeronautical engineering texts," they wrote, "do not define the goal of their field as making 'machines that fly so exactly like pigeons that they can fool other pigeons.'"[172] AI founder John McCarthy agreed, writing that "Artificial intelligence is not, by definition, simulation of human intelligence".[173]

McCarthy defines intelligence as "the computational part of the ability to achieve goals in the world."[174] Another AI founder, Marvin Minsky similarly defines it as "the ability to solve hard problems".[175] These definitions view intelligence in terms of well-defined problems with well-defined solutions, where both the difficulty of the problem and the performance of the program are direct measures of the "intelligence" of the machine—and no other philosophical discussion is required, or may not even be possible.

A definition that has also been adopted by Google[176][better source needed] – major practitionary in the field of AI. This definition stipulated the ability of systems to synthesize information as the manifestation of intelligence, similar to the way it is defined in biological intelligence.

Evaluating approaches to AI

No established unifying theory or paradigm has guided AI research for most of its history.[q] The unprecedented success of statistical machine learning in the 2010s eclipsed all other approaches (so much so that some sources, especially in the business world, use the term "artificial intelligence" to mean "machine learning with neural networks"). This approach is mostly sub-symbolic, neat, soft and narrow (see below). Critics argue that these questions may have to be revisited by future generations of AI researchers.

Symbolic AI and its limits

Symbolic AI (or "GOFAI")[178] simulated the high-level conscious reasoning that people use when they solve puzzles, express legal reasoning and do mathematics. They were highly successful at "intelligent" tasks such as algebra or IQ tests. In the 1960s, Newell and Simon proposed the physical symbol systems hypothesis: "A physical symbol system has the necessary and sufficient means of general intelligent action."[179]

However, the symbolic approach failed on many tasks that humans solve easily, such as learning, recognizing an object or commonsense reasoning. Moravec's paradox is the discovery that high-level "intelligent" tasks were easy for AI, but low level "instinctive" tasks were extremely difficult.[180] Philosopher Hubert Dreyfus had argued since the 1960s that human expertise depends on unconscious instinct rather than conscious symbol manipulation, and on having a "feel" for the situation, rather than explicit symbolic knowledge.[181] Although his arguments had been ridiculed and ignored when they were first presented, eventually, AI research came to agree.[r][53]

The issue is not resolved: sub-symbolic reasoning can make many of the same inscrutable mistakes that human intuition does, such as algorithmic bias. Critics such as Noam Chomsky argue continuing research into symbolic AI will still be necessary to attain general intelligence,[183][184] in part because sub-symbolic AI is a move away from explainable AI: it can be difficult or impossible to understand why a modern statistical AI program made a particular decision. The emerging field of neuro-symbolic artificial intelligence attempts to bridge the two approaches.

Neat vs. scruffy

"Neats" hope that intelligent behavior is described using simple, elegant principles (such as logic, optimization, or neural networks). "Scruffies" expect that it necessarily requires solving a large number of unrelated problems (especially in areas like common sense reasoning). This issue was actively discussed in the 70s and 80s,[185] but in the 1990s mathematical methods and solid scientific standards became the norm, a transition that Russell and Norvig termed "the victory of the neats".[186]

Soft vs. hard computing

Finding a provably correct or optimal solution is intractable for many important problems.[52] Soft computing is a set of techniques, including genetic algorithms, fuzzy logic and neural networks, that are tolerant of imprecision, uncertainty, partial truth and approximation. Soft computing was introduced in the late 80s and most successful AI programs in the 21st century are examples of soft computing with neural networks.

Narrow vs. general AI

AI researchers are divided as to whether to pursue the goals of artificial general intelligence and superintelligence (general AI) directly or to solve as many specific problems as possible (narrow AI) in hopes these solutions will lead indirectly to the field's long-term goals.[187][188] General intelligence is difficult to define and difficult to measure, and modern AI has had more verifiable successes by focusing on specific problems with specific solutions. The experimental sub-field of artificial general intelligence studies this area exclusively.

Machine consciousness, sentience and mind

The philosophy of mind does not know whether a machine can have a mind, consciousness and mental states, in the same sense that human beings do. This issue considers the internal experiences of the machine, rather than its external behavior. Mainstream AI research considers this issue irrelevant because it does not affect the goals of the field. Stuart Russell and Peter Norvig observe that most AI researchers "don't care about the [philosophy of AI] – as long as the program works, they don't care whether you call it a simulation of intelligence or real intelligence."[189] However, the question has become central to the philosophy of mind. It is also typically the central question at issue in artificial intelligence in fiction.

Consciousness

David Chalmers identified two problems in understanding the mind, which he named the "hard" and "easy" problems of consciousness.[190] The easy problem is understanding how the brain processes signals, makes plans and controls behavior. The hard problem is explaining how this feels or why it should feel like anything at all, assuming we are right in thinking that it truly does feel like something (Dennett's consciousness illusionism says this is an illusion). Human information processing is easy to explain, however, human subjective experience is difficult to explain. For example, it is easy to imagine a color-blind person who has learned to identify which objects in their field of view are red, but it is not clear what would be required for the person to know what red looks like.[191]

Computationalism and functionalism

Computationalism is the position in the philosophy of mind that the human mind is an information processing system and that thinking is a form of computing. Computationalism argues that the relationship between mind and body is similar or identical to the relationship between software and hardware and thus may be a solution to the mind-body problem. This philosophical position was inspired by the work of AI researchers and cognitive scientists in the 1960s and was originally proposed by philosophers Jerry Fodor and Hilary Putnam.[192]

Philosopher John Searle characterized this position as "strong AI": "The appropriately programmed computer with the right inputs and outputs would thereby have a mind in exactly the same sense human beings have minds."[s] Searle counters this assertion with his Chinese room argument, which attempts to show that, even if a machine perfectly simulates human behavior, there is still no reason to suppose it also has a mind.[195]

Robot rights

If a machine has a mind and subjective experience, then it may also have sentience (the ability to feel), and if so, then it could also suffer, and thus it would be entitled to certain rights.[196] Any hypothetical robot rights would lie on a spectrum with animal rights and human rights.[197] This issue has been considered in fiction for centuries,[198] and is now being considered by, for example, California's Institute for the Future; however, critics argue that the discussion is premature.[199]

Future

Superintelligence

A superintelligence, hyperintelligence, or superhuman intelligence, is a hypothetical agent that would possess intelligence far surpassing that of the brightest and most gifted human mind. Superintelligence may also refer to the form or degree of intelligence possessed by such an agent.[188]

If research into artificial general intelligence produced sufficiently intelligent software, it might be able to reprogram and improve itself. The improved software would be even better at improving itself, leading to recursive self-improvement.[200] Its intelligence would increase exponentially in an intelligence explosion and could dramatically surpass humans. Science fiction writer Vernor Vinge named this scenario the "singularity".[201] Because it is difficult or impossible to know the limits of intelligence or the capabilities of superintelligent machines, the technological singularity is an occurrence beyond which events are unpredictable or even unfathomable.[202]

Robot designer Hans Moravec, cyberneticist Kevin Warwick, and inventor Ray Kurzweil have predicted that humans and machines will merge in the future into cyborgs that are more capable and powerful than either. This idea, called transhumanism, has roots in Aldous Huxley and Robert Ettinger.[203]

Edward Fredkin argues that "artificial intelligence is the next stage in evolution", an idea first proposed by Samuel Butler's "Darwin among the Machines" as far back as 1863, and expanded upon by George Dyson in his book of the same name in 1998.[204]

Risks

Technological unemployment

In the past, technology has tended to increase rather than reduce total employment, but economists acknowledge that "we're in uncharted territory" with AI.[205] A survey of economists showed disagreement about whether the increasing use of robots and AI will cause a substantial increase in long-term unemployment, but they generally agree that it could be a net benefit if productivity gains are redistributed.[206] Subjective estimates of the risk vary widely; for example, Michael Osborne and Carl Benedikt Frey estimate 47% of U.S. jobs are at "high risk" of potential automation, while an OECD report classifies only 9% of U.S. jobs as "high risk".[t][208] The methodology of speculating about future employment levels has been criticised as lacking evidential foundation, and for implying that technology (rather than social policy) creates unemployment (as opposed to redundancies).[209]

Unlike previous waves of automation, many middle-class jobs may be eliminated by artificial intelligence; The Economist states that "the worry that AI could do to white-collar jobs what steam power did to blue-collar ones during the Industrial Revolution" is "worth taking seriously".[210] Jobs at extreme risk range from paralegals to fast food cooks, while job demand is likely to increase for care-related professions ranging from personal healthcare to the clergy.[211]

Bad actors and weaponized AI

AI provides a number of tools that are particularly useful for authoritarian governments: smart spyware, face recognition and voice recognition allow widespread surveillance; such surveillance allows machine learning to classify potential enemies of the state and can prevent them from hiding; recommendation systems can precisely target propaganda and misinformation for maximum effect; deepfakes aid in producing misinformation; advanced AI can make centralized decision making more competitive with liberal and decentralized systems such as markets.[212]

Terrorists, criminals and rogue states may use other forms of weaponized AI such as advanced digital warfare and lethal autonomous weapons. By 2015, over fifty countries were reported to be researching battlefield robots.[213]

Machine-learning AI is also able to design tens of thousands of toxic molecules in a matter of hours.[214]

Algorithmic bias

AI programs can become biased after learning from real-world data. It is not typically introduced by the system designers but is learned by the program, and thus the programmers are often unaware that the bias exists.[215] Bias can be inadvertently introduced by the way training data is selected.[216] It can also emerge from correlations: AI is used to classify individuals into groups and then make predictions assuming that the individual will resemble other members of the group. In some cases, this assumption may be unfair.[217] An example of this is COMPAS, a commercial program widely used by U.S. courts to assess the likelihood of a defendant becoming a recidivist. ProPublica claims that the COMPAS-assigned recidivism risk level of black defendants is far more likely to be overestimated than that of white defendants, despite the fact that the program was not told the races of the defendants.[218]

Health equity issues may also be exacerbated when many-to-many mapping are done without taking steps to ensure equity for populations at risk for bias. At this time equity-focused tools and regulations are not in place to ensure equity application representation and usage.[219] Other examples where algorithmic bias can lead to unfair outcomes are when AI is used for credit rating or hiring.

At its 2022 Conference on Fairness, Accountability, and Transparency (ACM FAccT 2022) the Association for Computing Machinery, in Seoul, South Korea, presented and published findings recommending that until AI and robotics systems are demonstrated to be free of bias mistakes, they are unsafe and the use of self-learning neural networks trained on vast, unregulated sources of flawed internet data should be curtailed.[220]

Existential risk

Superintelligent AI may be able to improve itself to the point that humans could not control it. This could, as physicist Stephen Hawking puts it, "spell the end of the human race".[221] Philosopher Nick Bostrom argues that sufficiently intelligent AI, if it chooses actions based on achieving some goal, will exhibit convergent behavior such as acquiring resources or protecting itself from being shut down. If this AI's goals do not fully reflect humanity's, it might need to harm humanity to acquire more resources or prevent itself from being shut down, ultimately to better achieve its goal. He concludes that AI poses a risk to mankind, however humble or "friendly" its stated goals might be.[222] Political scientist Charles T. Rubin argues that "any sufficiently advanced benevolence may be indistinguishable from malevolence." Humans should not assume machines or robots would treat us favorably because there is no a priori reason to believe that they would share our system of morality.[223]

The opinion of experts and industry insiders is mixed, with sizable fractions both concerned and unconcerned by risk from eventual superhumanly-capable AI.[224] Stephen Hawking, Microsoft founder Bill Gates, history professor Yuval Noah Harari, and SpaceX founder Elon Musk have all expressed serious misgivings about the future of AI.[225] Prominent tech titans including Peter Thiel (Amazon Web Services) and Musk have committed more than $1 billion to nonprofit companies that champion responsible AI development, such as OpenAI and the Future of Life Institute.[226] Mark Zuckerberg (CEO, Facebook) has said that artificial intelligence is helpful in its current form and will continue to assist humans.[227] Other experts argue is that the risks are far enough in the future to not be worth researching, or that humans will be valuable from the perspective of a superintelligent machine.[228] Rodney Brooks, in particular, has said that "malevolent" AI is still centuries away.[u]

Copyright

AI's decisions making abilities raises the questions of legal responsibility and copyright status of created works. This issues are being refined in various jurisdictions.[230] However, criticism has been raised about whether and to what extent the works created with the assistance of AI are under the protection of copyright laws.[231]

Ethical machines

Friendly AI are machines that have been designed from the beginning to minimize risks and to make choices that benefit humans. Eliezer Yudkowsky, who coined the term, argues that developing friendly AI should be a higher research priority: it may require a large investment and it must be completed before AI becomes an existential risk.[232]

Machines with intelligence have the potential to use their intelligence to make ethical decisions. The field of machine ethics provides machines with ethical principles and procedures for resolving ethical dilemmas.[233] Machine ethics is also called machine morality, computational ethics or computational morality,[233] and was founded at an AAAI symposium in 2005.[234]

Other approaches include Wendell Wallach's "artificial moral agents"[235] and Stuart J. Russell's three principles for developing provably beneficial machines.[236]

Regulation

The regulation of artificial intelligence is the development of public sector policies and laws for promoting and regulating artificial intelligence (AI); it is therefore related to the broader regulation of algorithms.[237] The regulatory and policy landscape for AI is an emerging issue in jurisdictions globally.[238] Between 2016 and 2020, more than 30 countries adopted dedicated strategies for AI.[47] Most EU member states had released national AI strategies, as had Canada, China, India, Japan, Mauritius, the Russian Federation, Saudi Arabia, United Arab Emirates, US and Vietnam. Others were in the process of elaborating their own AI strategy, including Bangladesh, Malaysia and Tunisia.[47] The Global Partnership on Artificial Intelligence was launched in June 2020, stating a need for AI to be developed in accordance with human rights and democratic values, to ensure public confidence and trust in the technology.[47] Henry Kissinger, Eric Schmidt, and Daniel Huttenlocher published a joint statement in November 2021 calling for a government commission to regulate AI.[239] In 2023, OpenAI leaders published recommendations for the governance of superintelligence, which they believe may happen in less than 10 years.[240]

In fiction

The word "robot" itself was coined by Karel Čapek in his 1921 play R.U.R., the title standing for "Rossum's Universal Robots".

Thought-capable artificial beings have appeared as storytelling devices since antiquity,[18] and have been a persistent theme in science fiction.[20]

A common trope in these works began with Mary Shelley's Frankenstein, where a human creation becomes a threat to its masters. This includes such works as Arthur C. Clarke's and Stanley Kubrick's 2001: A Space Odyssey (both 1968), with HAL 9000, the murderous computer in charge of the Discovery One spaceship, as well as The Terminator (1984) and The Matrix (1999). In contrast, the rare loyal robots such as Gort from The Day the Earth Stood Still (1951) and Bishop from Aliens (1986) are less prominent in popular culture.[241]

Isaac Asimov introduced the Three Laws of Robotics in many books and stories, most notably the "Multivac" series about a super-intelligent computer of the same name. Asimov's laws are often brought up during lay discussions of machine ethics;[242] while almost all artificial intelligence researchers are familiar with Asimov's laws through popular culture, they generally consider the laws useless for many reasons, one of which is their ambiguity.[243]

Transhumanism (the merging of humans and machines) is explored in the manga Ghost in the Shell and the science-fiction series Dune.

Several works use AI to force us to confront the fundamental question of what makes us human, showing us artificial beings that have the ability to feel, and thus to suffer. This appears in Karel Čapek's R.U.R., the films A.I. Artificial Intelligence and Ex Machina, as well as the novel Do Androids Dream of Electric Sheep?, by Philip K. Dick. Dick considers the idea that our understanding of human subjectivity is altered by technology created with artificial intelligence.[244]

See also

Explanatory notes

  1. ^ a b This list of intelligent traits is based on the topics covered by the major AI textbooks, including: Russell & Norvig (2003), Luger & Stubblefield (2004), Poole, Mackworth & Goebel (1998) and Nilsson (1998)
  2. ^ This statement comes from the proposal for the Dartmouth workshop of 1956, which reads: "Every aspect of learning or any other feature of intelligence can be so precisely described that a machine can be made to simulate it."[12]
  3. ^ Russel and Norvig note in the textbook Artificial Intelligence: A Modern Approach (4th ed.), section 1.5: "In the longer term, we face the difficult problem of controlling superintelligent AI systems that may evolve in unpredictable ways." while referring to computer scientists, philosophers, and technologists.
  4. ^ Daniel Crevier wrote, "the conference is generally recognized as the official birthdate of the new science."[26] Russell and Norvifg call the conference "the birth of artificial intelligence."[27]
  5. ^ Russell and Norvig wrote "for the next 20 years the field would be dominated by these people and their students."[27]
  6. ^ Russell and Norvig wrote "it was astonishing whenever a computer did anything kind of smartish".[29]
  7. ^ The programs described are Arthur Samuel's checkers program for the IBM 701, Daniel Bobrow's STUDENT, Newell and Simon's Logic Theorist and Terry Winograd's SHRDLU.
  8. ^ Embodied approaches to AI[39] were championed by Hans Moravec[40] and Rodney Brooks[41] and went by many names: Nouvelle AI,[41] Developmental robotics,[42] situated AI, behavior-based AI as well as others. A similar movement in cognitive science was the embodied mind thesis.
  9. ^ Clark wrote: "After a half-decade of quiet breakthroughs in artificial intelligence, 2015 has been a landmark year. Computers are smarter and learning faster than ever."[9]
  10. ^ Alan Turing discussed the centrality of learning as early as 1950, in his classic paper "Computing Machinery and Intelligence".[67] In 1956, at the original Dartmouth AI summer conference, Ray Solomonoff wrote a report on unsupervised probabilistic machine learning: "An Inductive Inference Machine".[68]
  11. ^ This is a form of Tom Mitchell's widely quoted definition of machine learning: "A computer program is set to learn from an experience E with respect to some task T and some performance measure P if its performance on T as measured by P improves with experience E."[69]
  12. ^ Alan Turing suggested in "Computing Machinery and Intelligence" that a "thinking machine" would need to be educated like a child.[67] Developmental robotics is a modern version of the idea.[42]
  13. ^ Compared with symbolic logic, formal Bayesian inference is computationally expensive. For inference to be tractable, most observations must be conditionally independent of one another. AdSense uses a Bayesian network with over 300 million edges to learn which ads to serve.[108]
  14. ^ Expectation-maximization, one of the most popular algorithms in machine learning, allows clustering in the presence of unknown latent variables.[110]
  15. ^ The Smithsonian reports: "Pluribus has bested poker pros in a series of six-player no-limit Texas Hold'em games, reaching a milestone in artificial intelligence research. It is the first bot to beat humans in a complex multiplayer competition."[153]
  16. ^ See Problem of other minds
  17. ^ Nils Nilsson wrote in 1983: "Simply put, there is wide disagreement in the field about what AI is all about."[177]
  18. ^ Daniel Crevier wrote that "time has proven the accuracy and perceptiveness of some of Dreyfus's comments. Had he formulated them less aggressively, constructive actions they suggested might have been taken much earlier."[182]
  19. ^ Searle presented this definition of "Strong AI" in 1999.[193] Searle's original formulation was "The appropriately programmed computer really is a mind, in the sense that computers given the right programs can be literally said to understand and have other cognitive states."[194] Strong AI is defined similarly by Russell and Norvig: "The assertion that machines could possibly act intelligently (or, perhaps better, act as if they were intelligent) is called the 'weak AI' hypothesis by philosophers, and the assertion that machines that do so are actually thinking (as opposed to simulating thinking) is called the 'strong AI' hypothesis."[189]
  20. ^ See table 4; 9% is both the OECD average and the US average.[207]
  21. ^ Rodney Brooks writes, "I think it is a mistake to be worrying about us developing malevolent AI anytime in the next few hundred years. I think the worry stems from a fundamental error in not distinguishing the difference between the very real recent advances in a particular aspect of AI and the enormity and complexity of building sentient volitional intelligence."[229]

References

  1. ^ Winston, P H (1984). Artificial intelligence. Second edition. United States.
  2. ^ Google (2016).
  3. ^ McCorduck (2004), p. 204.
  4. ^ Schank (1991), p. 38.
  5. ^ Crevier (1993), p. 109.
  6. ^ a b c Funding initiatives in the early 80s: Fifth Generation Project (Japan), Alvey (UK), Microelectronics and Computer Technology Corporation (US), Strategic Computing Initiative (US):
  7. ^ a b First AI Winter, Lighthill report, Mansfield Amendment
  8. ^ a b Second AI Winter:
  9. ^ a b c d Clark (2015b).
  10. ^ a b AI widely used in the late 1990s:
  11. ^ a b Pennachin & Goertzel (2007); Roberts (2016)
  12. ^ McCarthy et al. (1955).
  13. ^ Newquist (1994), pp. 45–53.
  14. ^ E McGaughey, 'Will Robots Automate Your Job Away? Full Employment, Basic Income, and Economic Democracy' (2022) 51(3) Industrial Law Journal 511–559
  15. ^ "s AI Overhyped in 2022? Getting the Truth About the True Power". Analytics Insight. 21 March 2022. Archived from the original on 10 March 2023. Retrieved 11 March 2023.
  16. ^ Giles, Martin (13 September 2018). "Artificial intelligence is often overhyped—and here's why that's dangerous". MIT Technology. Archived from the original on 11 March 2023. Retrieved 11 March 2023.
  17. ^ Basen, Ira (21 February 2020). "Is AI overhyped? Researchers weigh in on technology's promise and problems". Canadian Broadcasting Corporation. Archived from the original on 11 March 2023. Retrieved 11 March 2023.
  18. ^ a b AI in myth:
  19. ^ McCorduck (2004), pp. 17–25.
  20. ^ a b McCorduck (2004), pp. 340–400.
  21. ^ Berlinski (2000).
  22. ^ AI's immediate precursors:
  23. ^ Russell & Norvig (2009), p. 16.
  24. ^ Manyika 2022, p. 9.
  25. ^ Manyika 2022, p. 10.
  26. ^ Crevier (1993), pp. 47–49.
  27. ^ a b Russell & Norvig (2003), p. 17.
  28. ^ Dartmouth workshop: The proposal:
  29. ^ Russell & Norvig (2003), p. 18.
  30. ^ Successful Symbolic AI programs:
  31. ^ AI heavily funded in the 1960s:
  32. ^ Howe (1994).
  33. ^ Newquist (1994), pp. 86–86.
  34. ^ Simon (1965, p. 96) quoted in Crevier (1993, p. 109)
  35. ^ Minsky (1967, p. 2) quoted in Crevier (1993, p. 109)
  36. ^ Lighthill (1973).
  37. ^ Expert systems:
  38. ^ Nilsson (1998), p. 7.
  39. ^ McCorduck (2004), pp. 454–462.
  40. ^ Moravec (1988).
  41. ^ a b Brooks (1990).
  42. ^ a b Developmental robotics:
  43. ^ Revival of connectionism:
  44. ^ Formal and narrow methods adopted in the 1990s:
  45. ^ McKinsey (2018).
  46. ^ MIT Sloan Management Review (2018); Lorica (2017)
  47. ^ a b c d UNESCO (2021).
  48. ^ Lanier, Jaron (20 April 2023). "Annals of Artificial Intelligence - There Is No A.I. - There are ways of controlling the new technology—but first we have to stop mythologizing it". The New Yorker. Archived from the original on 23 April 2023. Retrieved 24 April 2023.
  49. ^ Bogdan, Dennis (2 February 2023). "Comment - In the Age of A.I., Major in Being Human - David Brooks". The New York Times. Archived from the original on 3 February 2023. Retrieved 24 April 2023.
  50. ^ Problem solving, puzzle solving, game playing and deduction:
  51. ^ Uncertain reasoning:
  52. ^ a b c Intractability and efficiency and the combinatorial explosion:
  53. ^ a b c Psychological evidence of the prevalence sub-symbolic reasoning and knowledge:
  54. ^ Knowledge representation and knowledge engineering:
  55. ^ Russell & Norvig (2003), pp. 320–328.
  56. ^ a b c Representing categories and relations: Semantic networks, description logics, inheritance (including frames and scripts):
  57. ^ a b Representing events and time:Situation calculus, event calculus, fluent calculus (including solving the frame problem):
  58. ^ a b Causal calculus:
  59. ^ a b Representing knowledge about knowledge: Belief calculus, modal logics:
  60. ^ a b Default reasoning, Frame problem, default logic, non-monotonic logics, circumscription, closed world assumption, abduction: (Poole et al. places abduction under "default reasoning". Luger et al. places this under "uncertain reasoning").
  61. ^ a b Breadth of commonsense knowledge:
  62. ^ Smoliar & Zhang (1994).
  63. ^ Neumann & Möller (2008).
  64. ^ Kuperman, Reichley & Bailey (2006).
  65. ^ McGarry (2005).
  66. ^ Bertini, Del Bimbo & Torniai (2006).
  67. ^ a b Turing (1950).
  68. ^ Solomonoff (1956).
  69. ^ Russell & Norvig (2003), pp. 649–788.
  70. ^ Learning:
  71. ^ Reinforcement learning:
  72. ^ The Economist (2016).
  73. ^ Jordan & Mitchell (2015).
  74. ^ Natural language processing (NLP):
  75. ^ Applications of NLP:
  76. ^ Modern statistical approaches to NLP:
  77. ^ Vincent (2019).
  78. ^ Machine perception:
  79. ^ Speech recognition:
  80. ^ Object recognition:
  81. ^ Computer vision:
  82. ^ MIT AIL (2014).
  83. ^ Affective computing:
  84. ^ Waddell (2018).
  85. ^ Poria et al. (2017).
  86. ^ The Society of Mind: Moravec's "golden spike": Multi-agent systems, hybrid intelligent systems, agent architectures, cognitive architecture:
  87. ^ Domingos (2015), Chpt. 9.
  88. ^ Artificial brain as an approach to AGI: A few of the people who make some form of the argument:
  89. ^ Search algorithms:
  90. ^ Forward chaining, backward chaining, Horn clauses, and logical deduction as search:
  91. ^ State space search and planning:
  92. ^ Moving and configuration space:
  93. ^ Uninformed searches (breadth first search, depth-first search and general state space search):
  94. ^ Heuristic or informed searches (e.g., greedy best first and A*):
  95. ^ Tecuci (2012).
  96. ^ Optimization searches:
  97. ^ Genetic programming and genetic algorithms:
  98. ^ Artificial life and society based learning:
  99. ^ Logic:
  100. ^ Satplan:
  101. ^ Explanation based learning, relevance based learning, inductive logic programming, case based reasoning:
  102. ^ Propositional logic:
  103. ^ First-order logic and features such as equality:
  104. ^ Fuzzy logic:
  105. ^ Abe, Jair Minoro; Nakamatsu, Kazumi (2009). "Multi-agent Systems and Paraconsistent Knowledge". Knowledge Processing and Decision Making in Agent-Based Systems. Studies in Computational Intelligence. Vol. 170. Springer Berlin Heidelberg. pp. 101–121. doi:10.1007/978-3-540-88049-3_5. eISSN 1860-9503. ISBN 978-3-540-88048-6. ISSN 1860-949X. Archived from the original on 9 February 2023. Retrieved 2 August 2022.
  106. ^ Stochastic methods for uncertain reasoning:
  107. ^ Bayesian networks:
  108. ^ Domingos (2015), chapter 6.
  109. ^ Bayesian inference algorithm:
  110. ^ Domingos (2015), p. 210.
  111. ^ Bayesian learning and the expectation-maximization algorithm:
  112. ^ Bayesian decision theory and Bayesian decision networks:
  113. ^ a b c Stochastic temporal models: Dynamic Bayesian networks: Hidden Markov model: Kalman filters:
  114. ^ decision theory and decision analysis:
  115. ^ Information value theory:
  116. ^ Markov decision processes and dynamic decision networks:
  117. ^ Game theory and mechanism design:
  118. ^ Statistical learning methods and classifiers:
  119. ^ Decision tree:
  120. ^ K-nearest neighbor algorithm:
  121. ^ kernel methods such as the support vector machine: Gaussian mixture model:
  122. ^ Domingos (2015), p. 152.
  123. ^ Naive Bayes classifier:
  124. ^ a b Neural networks:
  125. ^ Classifier performance:
  126. ^ Backpropagation: Paul Werbos' introduction of backpropagation to AI: Automatic differentiation, an essential precursor:
  127. ^ Competitive learning, Hebbian coincidence learning, Hopfield networks and attractor networks:
  128. ^ Feedforward neural networks, perceptrons and radial basis networks:
  129. ^ Schulz & Behnke (2012).
  130. ^ Deep learning:
  131. ^ Deng & Yu (2014), pp. 199–200.
  132. ^ Ciresan, Meier & Schmidhuber (2012).
  133. ^ Habibi (2017).
  134. ^ Fukushima (2007).
  135. ^ Recurrent neural networks, Hopfield nets:
  136. ^ Schmidhuber (2015).
  137. ^ Werbos (1988); Robinson & Fallside (1987); Williams & Zipser (1994)
  138. ^ Goodfellow, Bengio & Courville (2016); Hochreiter (1991)
  139. ^ Hochreiter & Schmidhuber (1997); Gers, Schraudolph & Schraudolph (2002)
  140. ^ Russell & Norvig (2009), p. 1.
  141. ^ European Commission (2020), p. 1.
  142. ^ CNN (2006).
  143. ^ Targeted advertising:
  144. ^ Lohr (2016).
  145. ^ Smith (2016).
  146. ^ Rowinski (2013).
  147. ^ Frangoul (2019).
  148. ^ Brown (2019).
  149. ^ "Artificial intelligence, immune to fear or favour, is helping to make China's foreign policy | South China Morning Post". 25 March 2023. Archived from the original on 25 March 2023. Retrieved 26 March 2023.
  150. ^ McCorduck (2004), pp. 480–483.
  151. ^ Markoff (2011).
  152. ^ Google (2016); BBC (2016)
  153. ^ Solly (2019).
  154. ^ Bowling et al. (2015).
  155. ^ Sample (2017).
  156. ^ Anadiotis (2020).
  157. ^ Heath (2020).
  158. ^ Aletras et al. (2016).
  159. ^ Verma, Pranshu; Schaul, Kevin. "See why AI like ChatGPT has gotten so good, so fast". Washington Post. Retrieved 28 May 2023.
  160. ^ "Will AI-generated images create a new crisis for fact-checkers? Experts are not so sure". Reuters Institute for the Study of Journalism. 11 April 2023. Retrieved 28 May 2023.
  161. ^ Novak, Matt. "That Viral Image Of Pope Francis Wearing A White Puffer Coat Is Totally Fake". Forbes. Retrieved 28 May 2023.
  162. ^ "Trump shares deepfake photo of himself praying as AI images of arrest spread online". The Independent. 24 March 2023. Retrieved 28 May 2023.
  163. ^ Oremus, Will; Harwell, Drew; Armus, Teo (22 May 2023). "A tweet about a Pentagon explosion was fake. It still went viral". Washington Post. ISSN 0190-8286. Retrieved 28 May 2023.
  164. ^ Kolirin, Lianne (18 April 2023). "Artist rejects photo prize after AI-generated image wins award". CNN. Retrieved 28 May 2023.
  165. ^ "Going Nowhere Fast? Smart Traffic Lights Can Help Ease Gridlock". 18 May 2022. Archived from the original on 22 December 2022. Retrieved 22 December 2022.
  166. ^ "Intellectual Property and Frontier Technologies". WIPO. Archived from the original on 2 April 2022. Retrieved 30 March 2022.
  167. ^ a b "WIPO Technology Trends 2019 – Artificial Intelligence" (PDF). WIPO. 2019. Archived (PDF) from the original on 9 October 2022.
  168. ^ a b Turing (1950), p. 1.
  169. ^ Turing's original publication of the Turing test in "Computing machinery and intelligence": Historical influence and philosophical implications:
  170. ^ Turing (1950), Under "The Argument from Consciousness".
  171. ^ Russell & Norvig (2021), chpt. 2.
  172. ^ Russell & Norvig (2021), p. 3.
  173. ^ Maker (2006).
  174. ^ McCarthy 1999.
  175. ^ Minsky (1986).
  176. ^ "Artificial intelligence - Google Search". www.google.com. Archived from the original on 1 December 2022. Retrieved 5 November 2022.
  177. ^ Nilsson (1983), p. 10.
  178. ^ Haugeland (1985), pp. 112–117.
  179. ^ Physical symbol system hypothesis: Historical significance:
  180. ^ Moravec's paradox:
  181. ^ Dreyfus' critique of AI: Historical significance and philosophical implications:
  182. ^ Crevier (1993), p. 125.
  183. ^ Langley (2011).
  184. ^ Katz (2012).
  185. ^ Neats vs. scruffies, the historic debate: A classic example of the "scruffy" approach to intelligence: A modern example of neat AI and its aspirations:
  186. ^ Russell & Norvig (2003), pp. 25–26.
  187. ^ Pennachin & Goertzel (2007).
  188. ^ a b Roberts (2016).
  189. ^ a b Russell & Norvig (2003), p. 947.
  190. ^ Chalmers (1995).
  191. ^ Dennett (1991).
  192. ^ Horst (2005).
  193. ^ Searle (1999).
  194. ^ Searle (1980), p. 1.
  195. ^ Searle's Chinese room argument: Discussion:
  196. ^ Robot rights:
  197. ^ Evans (2015).
  198. ^ McCorduck (2004), pp. 19–25.
  199. ^ Henderson (2007).
  200. ^ Omohundro (2008).
  201. ^ Vinge (1993).
  202. ^ Russell & Norvig (2003), p. 963.
  203. ^ Transhumanism:
  204. ^ AI as evolution:
  205. ^ Ford & Colvin (2015);McGaughey (2022)
  206. ^ IGM Chicago (2017).
  207. ^ Arntz, Gregory & Zierahn (2016), p. 33.
  208. ^ Lohr (2017); Frey & Osborne (2017); Arntz, Gregory & Zierahn (2016, p. 33)
  209. ^ E McGaughey, 'Will Robots Automate Your Job Away? Full Employment, Basic Income, and Economic Democracy' (2022) 51(3) Industrial Law Journal 511–559
  210. ^ Morgenstern (2015).
  211. ^ Mahdawi (2017); Thompson (2014)
  212. ^ Harari (2018).
  213. ^ Weaponized AI:
  214. ^ Urbina, Fabio; Lentzos, Filippa; Invernizzi, Cédric; Ekins, Sean (7 March 2022). "Dual use of artificial-intelligence-powered drug discovery". Nature Machine Intelligence. 4 (3): 189–191. doi:10.1038/s42256-022-00465-9. PMC 9544280. PMID 36211133. S2CID 247302391.
  215. ^ CNA (2019).
  216. ^ Goffrey (2008), p. 17.
  217. ^ Lipartito (2011, p. 36); Goodman & Flaxman (2017, p. 6)
  218. ^ Larson & Angwin (2016).
  219. ^ Berdahl, Carl Thomas; Baker, Lawrence; Mann, Sean; Osoba, Osonde; Girosi, Federico (7 February 2023). "Strategies to Improve the Impact of Artificial Intelligence on Health Equity: Scoping Review". JMIR AI. 2: e42936. doi:10.2196/42936. ISSN 2817-1705. S2CID 256681439. Archived from the original on 21 February 2023. Retrieved 21 February 2023.
  220. ^ Dockrill, Peter, Robots With Flawed AI Make Sexist And Racist Decisions, Experiment Shows Archived 27 June 2022 at the Wayback Machine, Science Alert, 27 June 2022
  221. ^ Cellan-Jones (2014).
  222. ^ Bostrom (2014); Müller & Bostrom (2014); Bostrom (2015)
  223. ^ Rubin (2003).
  224. ^ Müller & Bostrom (2014).
  225. ^ Leaders' concerns about the existential risks of AI:
  226. ^ Funding to mitigate risks of AI:
  227. ^ Leaders who argue the benefits of AI outweigh the risks:
  228. ^ Arguments that AI is not an imminent risk:
  229. ^ Brooks (2014).
  230. ^ "Artificial intelligence and copyright". www.wipo.int. Archived from the original on 24 May 2022. Retrieved 27 May 2022.
  231. ^ Hugenholtz, P. Bernt; Quintais, João Pedro (October 2021). "Copyright and Artificial Creation: Does EU Copyright Law Protect AI-Assisted Output?". IIC - International Review of Intellectual Property and Competition Law. 52 (9): 1190–1216. doi:10.1007/s40319-021-01115-0. ISSN 0018-9855. S2CID 244184811.
  232. ^ Yudkowsky (2008).
  233. ^ a b Anderson & Anderson (2011).
  234. ^ AAAI (2014).
  235. ^ Wallach (2010).
  236. ^ Russell (2019), p. 173.
  237. ^ Regulation of AI to mitigate risks:
  238. ^ Kissinger, Henry (1 November 2021). "The Challenge of Being Human in the Age of AI". The Wall Street Journal. Archived from the original on 4 November 2021. Retrieved 4 November 2021.
  239. ^ "Governance of superintelligence". openai.com. Retrieved 27 May 2023.
  240. ^ Buttazzo (2001).
  241. ^ Anderson (2008).
  242. ^ McCauley (2007).
  243. ^ Galvan (1997).

AI textbooks

These were the four the most widely used AI textbooks in 2008:

Later editions.

The two most widely used textbooks in 2021.Open Syllabus: Explorer Archived 7 October 2021 at the Wayback Machine

History of AI

Other sources

Further reading

  • Autor, David H., "Why Are There Still So Many Jobs? The History and Future of Workplace Automation" (2015) 29(3) Journal of Economic Perspectives 3.
  • Boden, Margaret, Mind As Machine, Oxford University Press, 2006.
  • Cukier, Kenneth, "Ready for Robots? How to Think about the Future of AI", Foreign Affairs, vol. 98, no. 4 (July/August 2019), pp. 192–98. George Dyson, historian of computing, writes (in what might be called "Dyson's Law") that "Any system simple enough to be understandable will not be complicated enough to behave intelligently, while any system complicated enough to behave intelligently will be too complicated to understand." (p. 197.) Computer scientist Alex Pentland writes: "Current AI machine-learning algorithms are, at their core, dead simple stupid. They work, but they work by brute force." (p. 198.)
  • Domingos, Pedro, "Our Digital Doubles: AI will serve our species, not control it", Scientific American, vol. 319, no. 3 (September 2018), pp. 88–93.
  • Gopnik, Alison, "Making AI More Human: Artificial intelligence has staged a revival by starting to incorporate what we know about how children learn", Scientific American, vol. 316, no. 6 (June 2017), pp. 60–65.
  • Halpern, Sue, "The Human Costs of AI" (review of Kate Crawford, Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence, Yale University Press, 2021, 327 pp.; Simon Chesterman, We, the Robots?: Regulating Artificial Intelligence and the Limits of the Law, Cambridge University Press, 2021, 289 pp.; Keven Roose, Futureproof: 9 Rules for Humans in the Age of Automation, Random House, 217 pp.; Erik J. Larson, The Myth of Artificial Intelligence: Why Computers Can't Think the Way We Do, Belknap Press / Harvard University Press, 312 pp.), The New York Review of Books, vol. LXVIII, no. 16 (21 October 2021), pp. 29–31. "AI training models can replicate entrenched social and cultural biases. [...] Machines only know what they know from the data they have been given. [p. 30.] [A]rtificial general intelligence–machine-based intelligence that matches our own–is beyond the capacity of algorithmic machine learning... 'Your brain is one piece in a broader system which includes your body, your environment, other humans, and culture as a whole.' [E]ven machines that master the tasks they are trained to perform can't jump domains. AIVA, for example, can't drive a car even though it can write music (and wouldn't even be able to do that without Bach and Beethoven [and other composers on which AIVA is trained])." (p. 31.)
  • Johnston, John (2008) The Allure of Machinic Life: Cybernetics, Artificial Life, and the New AI, MIT Press.
  • Koch, Christof, "Proust among the Machines", Scientific American, vol. 321, no. 6 (December 2019), pp. 46–49. Christof Koch doubts the possibility of "intelligent" machines attaining consciousness, because "[e]ven the most sophisticated brain simulations are unlikely to produce conscious feelings." (p. 48.) According to Koch, "Whether machines can become sentient [is important] for ethical reasons. If computers experience life through their own senses, they cease to be purely a means to an end determined by their usefulness to... humans. Per GNW [the Global Neuronal Workspace theory], they turn from mere objects into subjects... with a point of view.... Once computers' cognitive abilities rival those of humanity, their impulse to push for legal and political rights will become irresistible—the right not to be deleted, not to have their memories wiped clean, not to suffer pain and degradation. The alternative, embodied by IIT [Integrated Information Theory], is that computers will remain only supersophisticated machinery, ghostlike empty shells, devoid of what we value most: the feeling of life itself." (p. 49.)
  • Marcus, Gary, "Am I Human?: Researchers need new ways to distinguish artificial intelligence from the natural kind", Scientific American, vol. 316, no. 3 (March 2017), pp. 58–63. A stumbling block to AI has been an incapacity for reliable disambiguation. An example is the "pronoun disambiguation problem": a machine has no way of determining to whom or what a pronoun in a sentence refers. (p. 61.)
  • Gary Marcus, "Artificial Confidence: Even the newest, buzziest systems of artificial general intelligence are stymmied by the same old problems", Scientific American, vol. 327, no. 4 (October 2022), pp. 42–45.
  • E McGaughey, 'Will Robots Automate Your Job Away? Full Employment, Basic Income, and Economic Democracy' (2022) 51(3) Industrial Law Journal 511, part 2(3) Archived 24 May 2018 at the Wayback Machine.
  • George Musser, "Artificial Imagination: How machines could learn creativity and common sense, among other human qualities", Scientific American, vol. 320, no. 5 (May 2019), pp. 58–63.
  • Myers, Courtney Boyd ed. (2009). "The AI Report" Archived 29 July 2017 at the Wayback Machine. Forbes June 2009
  • Raphael, Bertram (1976). The Thinking Computer. W.H. Freeman and Co. ISBN 978-0716707233. Archived from the original on 26 July 2020. Retrieved 22 August 2020.
  • Scharre, Paul, "Killer Apps: The Real Dangers of an AI Arms Race", Foreign Affairs, vol. 98, no. 3 (May/June 2019), pp. 135–44. "Today's AI technologies are powerful but unreliable. Rules-based systems cannot deal with circumstances their programmers did not anticipate. Learning systems are limited by the data on which they were trained. AI failures have already led to tragedy. Advanced autopilot features in cars, although they perform well in some circumstances, have driven cars without warning into trucks, concrete barriers, and parked cars. In the wrong situation, AI systems go from supersmart to superdumb in an instant. When an enemy is trying to manipulate and hack an AI system, the risks are even greater." (p. 140.)
  • Serenko, Alexander (2010). "The development of an AI journal ranking based on the revealed preference approach" (PDF). Journal of Informetrics. 4 (4): 447–59. doi:10.1016/j.joi.2010.04.001. Archived (PDF) from the original on 4 October 2013. Retrieved 24 August 2013.
  • Serenko, Alexander; Michael Dohan (2011). "Comparing the expert survey and citation impact journal ranking methods: Example from the field of Artificial Intelligence" (PDF). Journal of Informetrics. 5 (4): 629–49. doi:10.1016/j.joi.2011.06.002. Archived (PDF) from the original on 4 October 2013. Retrieved 12 September 2013.
  • Tom Simonite (29 December 2014). "2014 in Computing: Breakthroughs in Artificial Intelligence". MIT Technology Review. Archived from the original on 2 January 2015.
  • Sun, R. & Bookman, L. (eds.), Computational Architectures: Integrating Neural and Symbolic Processes. Kluwer Academic Publishers, Needham, MA. 1994.
  • Taylor, Paul, "Insanely Complicated, Hopelessly Inadequate" (review of Brian Cantwell Smith, The Promise of Artificial Intelligence: Reckoning and Judgment, MIT, 2019, ISBN 978-0262043045, 157 pp.; Gary Marcus and Ernest Davis, Rebooting AI: Building Artificial Intelligence We Can Trust, Ballantine, 2019, ISBN 978-1524748258, 304 pp.; Judea Pearl and Dana Mackenzie, The Book of Why: The New Science of Cause and Effect, Penguin, 2019, ISBN 978-0141982410, 418 pp.), London Review of Books, vol. 43, no. 2 (21 January 2021), pp. 37–39. Paul Taylor writes (p. 39): "Perhaps there is a limit to what a computer can do without knowing that it is manipulating imperfect representations of an external reality."
  • Tooze, Adam, "Democracy and Its Discontents", The New York Review of Books, vol. LXVI, no. 10 (6 June 2019), pp. 52–53, 56–57. "Democracy has no clear answer for the mindless operation of bureaucratic and technological power. We may indeed be witnessing its extension in the form of artificial intelligence and robotics. Likewise, after decades of dire warning, the environmental problem remains fundamentally unaddressed.... Bureaucratic overreach and environmental catastrophe are precisely the kinds of slow-moving existential challenges that democracies deal with very badly.... Finally, there is the threat du jour: corporations and the technologies they promote." (pp. 56–57.)

External links