Topic: Unit 10: Reasoning Agents | CS205: Building with Artificial Intelligence

Reasoning is one of the important core capabilities mentioned in the Turing test for intelligence. Reasoning allows an agent to infer or deduce new information from information already known to be true. Reasoning occurs in different "logics": languages of varying power and other constructs to describe the known characteristics, which include Boolean or propositional logic and first-order logic (FOL). FOL is better capable of compactly describing known information about a system. This unit will help you understand these logics and how inference is carried out in propositional logic and FOL. Finally, we will look at uncertainty models such as Bayesian analysis, Bayesian networks, and Markov Chains, which have proved better than logic-based systems in modeling uncertainty and predicting probabilistic outcomes.

Completing this unit should take you approximately 8 hours.

Select activity Upon successful completion of this unit, you will ...
Upon successful completion of this unit, you will be able to:
explain the foundations of propositional logic;
describe the methods of reasoning algorithms such as modus ponens and resolution over propositional logic;
describe the foundations of first-order logic;
describe the methods of reasoning algorithms such as modus ponens and resolution over first-order logic;
describe the foundations of reasoning under uncertainty and its importance to intelligent agents;
apply Bayesian analysis to predict probabilistic outcomes in problems that have uncertainty;
apply Bayesian networks to model probabilistic relationships between discrete variables; and
describe how Markov chains and hidden Markov chains relate to uncertainty reasoning.

10.1: Propositional Logic
Propositional or Boolean logic is a simple (but limited) notation to describe the knowledge associated with different problem domains. Because the notation is limited, describing complex systems using propositional logic can take time and effort. As we review, we will refer to principles such as modus ponens (forward and backward chaining) and resolution over propositional logic.
- Select activity Propositional Logic
  
  Propositional Logic Page
  
  Students must
  
  Mark as done
  
  The simplest language for logical reasoning is propositional logic (PL), also called boolean logic. The range of allowable constructs in PL includes symbols representing some fact or event in the world (which can be true or false) and logical connectives like not, and, or, and implications (also called "if-then statements").
- Select activity Using PL to Describe Properties of Systems
  
  Using PL to Describe Properties of Systems Book
  
  Students must
  
  Mark as done
  
  PL can work as a simple language to describe facts (also known as truths or axioms) about the domain of discourse. PL can be used to describe aspects of various systems and domains. Reasoning in PL happens through inference procedures. This is, in general, a very complex topic. Inferences can be drawn by repeated use of modus ponens rules to deduce new facts that must be true if the antecedents of the rules are true. As you read, focus on the general ideas and examples. If you are interested, you can go into the technical details on a second, deeper reading.
10.2: First Order Logic
First-order logic is a powerful (but fairly complex) notation to describe the knowledge associated with different problem domains. FOL is a far better notation than propositional logic to describe systems, but it can be more challenging to follow or understand. Using first-order logic, we will review inference principles such as modus ponens (forward and backward chaining) and resolution.
- Select activity First Order Logic (FOL)
  
  First Order Logic (FOL) Page
  
  Students must
  
  Mark as done
  
  FOL is often called the language of computer science. It has far more powerful constructs than PL and is more expressive. Unlike PL, FOL describes objects and their inter-relationships and incorporates the concept of quantifiers. Quantifiers allow you to express properties shared (or not shared) by sets of objects.
- Select activity Using FOL to Describe the Properties of Systems
  
  Using FOL to Describe the Properties of Systems Book
  
  Students must
  
  Mark as done
  
  In general, FOL is a complex topic. For now, focus on the high-level concepts. Like PL, FOL is also used to describe the details of a system or domain. But, because it has quantifiers and relationships, the hypotheses can be more compactly stated as "well formed formulae (wffs)" or sentences in FOL. Inference in FOL is a very complex subject. Because the notation is more powerful, proving things to be true (or false) in FOL is computationally intensive. Deductions are made in FOL using various principles like modus ponens and resolution.
10.3: Bayesian Reasoning and Uncertainty
An inherent part of intelligence is being able to handle uncertainty effectively. Specifically, we will discuss the framework of conditional probability and use Bayes' theorem as the foundation to model the influence of variables on outcomes. Using Bayes' rule, we can probabilistically predict the strategy to use.
- Select activity Conditional Probability
  
  Conditional Probability Book
  
  Students must
  
  Mark as done
  
  Probability theory provides a robust and well-understood platform to handle uncertainty. In addition to "prior" probability, it is also useful to master conditional probability to sharpen our ability to reason about uncertain events. Can you explain how conditional probability works and how to analyze the likelihood of events with some apparent dependence on one another?
- Select activity Applying Bayes' Theorem in Deduction
  
  Applying Bayes' Theorem in Deduction Book
  
  Students must
  
  Mark as done
  
  One of the common ways to use conditional probability is through Bayes' Theorem. The definition of conditional probability is used in Bayes' Theorem to render inferences in many situations where events are causally linked.
10.4: Modeling Causality with Bayesian Networks
A Bayesian network can model causal relationships probabilistically. Given certain evidence, Bayesian analysis can probabilistically predict the explanation for the evidence. Markov chains and hidden Markov chains are formal ways to model uncertainty in dynamic systems that change state in specific ways.
- Select activity Bayesian Networks
  
  Bayesian Networks Book
  
  Students must
  
  Mark as done
  
  The Bayesian Network is an easy-to-understand graphical notation representing the conditional inter-dependence of variables within a system. This simple graphical formalism can leverage conditional probability distributions to describe relationships between variables in a system. How can Bayesian networks compute the probabilities of specific events given other facts? Humans also use this kind of reasoning to render decisions in uncertain environments.
- Select activity Markov Chains
  
  Markov Chains Page
  
  Students must
  
  Mark as done
  
  Markov chains are one of the most common formalisms to describe event probabilities within a system where the next state is determined only by the current state but not by how the current state was achieved.
- Select activity Applications of Hidden Markov Chains
  
  Applications of Hidden Markov Chains Book
  
  Students must
  
  Mark as done
  
  In hidden Markov chains, the system's behavior depends on latent (or hidden) variables. This has a lot of applications in contemporary AI. For now, focus on grasping the high-level themes and ideas. If the subject interests you, you can dive deeper into technical details. The examples are particularly instructive.

Course Syllabus

Course Syllabus

Unit 1: What Is Artificial Intelligence?

1.1: The Turing Test

The Turing Test for Intelligence

Why the Turing Test Is Important

1.2: The Four Types of AI

Is Intelligence How You Think or the Output of Thinking?

Unit 1 Assessment

Unit 2: Agent-Based Approach to AI

2.1: Introduction to Agent-Based AI

Agents, Agent Types, and Their Capabilities

2.2: Analyzing Environmental Characteristics

Properties of Problem Environments and How to Analyze Them

Unit 2 Assessment

Unit 3: Machine Learning and Its Importance

3.1: Learning in AI and Agents

Supervised, Unsupervised, and Reinforcement ML

3.2: Applications of ML in Neural Networks

Newer Machine Learning Models and Applications

Unit 4: Machine Learning Algorithms

4.1: Classification Algorithms

Classification versus Regression

Importance of Classification and Regression in Machine Learning

Classification Using K-nearest Neighbors Algorithm

4.2: Classification Algorithm Performance

False Positives / False Negatives / Confusion Matrix

Precision and Recall Calculations from Confusion Matrix

Linear Regression – How It Works

4.3: Linear Regression Algorithms

Metrics for Linear Regression Effectiveness: R-squared, MSE and RSE

Lasso and Ridge Regression

Improving Linear Regression by Reducing Residual Errors

4.4: Other Supervised ML Classification Algorithms

Classification Using Decision Trees

Classification Using Logistic Regression

Applying Bayes' Theorem in Machine Learning

4.5: Unsupervised Learning and Reinforcement Learning

Unlabelled Data and Unsupervised Machine Learning

Principles and Applications of Reinforcement Learning

4.6: ML Using Neural Networks

Introduction to Neural Networks Basics

Neural Networks: Types and Applications

Unit 5: Problem-Solving Methods in AI

5.1: Integrating ML Skills

Applying Classification to Determine Insurability

How Regression Is Applied in Contemporary Computing

Using Neural Networks in Cancer Detection

5.2: General AI Problem-Solver Architecture

Characteristics of General Problem-Solver

5.3: Designing a General Problem-Solving Agent

How GPS Is Used

Computational Tractability of GPS

Unit 6: Search Algorithms

6.1: Uninformed Search Algorithms

Uninformed or Brute Force Search

Depth First Search Algorithm

Breadth First Search Algorithm

Uniform Cost Search Algorithm

6.2: Heuristic Search Algorithms

Heuristics and Using Them to Improve Search

Overview of A* Search and Analysis of Performance

Unit 7: Iterative Improvement Algorithms

7.1: Using Iterative Improvement to Solve Problems

Iterative Improvement Algorithms and Hill-Climbing

Constraint Satisfaction Problems and Their Importance

7.2: Improving Algorithm Efficiency

How Simulated Annealing Improves Hill-Climbing

Improving Mediocre Solutions Using Genetic Algorithms

Unit 8: Game-Playing Models

8.1: Game Trees and the Minimax Algorithm

Principles of Game Trees and How to Create One

Using the Minimax Algorithm in Adversarial Games

Assumptions Underlying Minimax Approach

8.2: Game-Playing Strategies

The Alpha Beta Pruning Algorithm

Tackling Multi-person Games

Unit 9: Natural Language Processing

9.1: Foundations of NLP

NLP Overview, Challenges, and Applications