CS3016/09. Implementing the Forward Algorithm

In this lab, we dive into the Forward Algorithm, a core component of probabilistic models—especially Hidden Markov Models (HMMs). The algorithm helps compute the probability of an observed sequence, given a model’s parameters. This is crucial in applications like speech recognition, bioinformatics, and NLP.

Open Table of contents

Introduction
- Problem Setup
Algorithm Overview
Implementation
Example Usage
Visualization
Time and Space Complexity
Applications
Conclusion
References

Introduction

Hidden Markov Models (HMMs) are statistical models where the system being modeled is assumed to be a Markov process with hidden states. The Forward Algorithm efficiently calculates the total probability of an observed sequence by summing over all possible hidden state sequences.

Problem Setup

Let:

\( N \): number of hidden states
\( T \): length of observation sequence
\( A \): state transition probabilities
\( B \): observation/emission probabilities
\( \pi \): initial state distribution
\( O = o_1, o_2, …, o_T \): sequence of observations

The goal is to compute:

\[ P(O \mid \lambda) = \sum_{all\ state\ sequences} P(O, state\ sequence \mid \lambda) \]

This is efficiently done using dynamic programming.

Algorithm Overview

Steps:

Initialization
\( \alpha_1(i) = \pi_i \cdot b_i(o_1) \)
Recursion
\( \alpha_t(j) = \sum_{i=1}^{N} \alpha_{t-1}(i) \cdot a_{ij} \cdot b_j(o_t) \)
Termination
\( P(O \mid \lambda) = \sum_{i=1}^{N} \alpha_T(i) \)

Implementation

import numpy as np

def forward_algorithm(A, B, pi, observations):
    N = A.shape[0]  # Number of states
    T = len(observations)  # Length of observation sequence
    alpha = np.zeros((T, N))
    
    # Initialization
    alpha[0, :] = pi * B[:, observations[0]]
    
    # Recursion
    for t in range(1, T):
        for j in range(N):
            alpha[t, j] = np.sum(alpha[t - 1] * A[:, j]) * B[j, observations[t]]
    
    # Termination
    probability = np.sum(alpha[T - 1, :])
    return probability, alpha

Example Usage

Click to expand

# Define model parameters
A = np.array([[0.7, 0.3],
              [0.4, 0.6]])

B = np.array([[0.1, 0.4, 0.5],
              [0.6, 0.3, 0.1]])

pi = np.array([0.6, 0.4])

# Observations (coded as indices)
observations = [0, 1, 2]  # e.g., observation symbols could be ['H', 'T', 'H']

# Run the forward algorithm
probability, alpha_matrix = forward_algorithm(A, B, pi, observations)

print("Probability of observing sequence:", probability)
print("Alpha matrix:\n", alpha_matrix)

Visualization

Click to expand

import seaborn as sns
import matplotlib.pyplot as plt

def visualize_alpha(alpha):
    plt.figure(figsize=(10, 4))
    sns.heatmap(alpha, annot=True, fmt=".4f", cmap="Blues")
    plt.title("Forward Probabilities (Alpha Matrix)")
    plt.xlabel("States")
    plt.ylabel("Time Step")
    plt.show()

visualize_alpha(alpha_matrix)

Time and Space Complexity

Step	Time Complexity	Space Complexity
Initialization	\( O(N) \)	\( O(NT) \)
Recursion	\( O(N^2T) \)	\( O(NT) \)
Termination	\( O(N) \)	\( O(1) \)

\( N \): Number of states
\( T \): Length of observation sequence

Applications

Speech Recognition: Predicting spoken words from audio input.
Gene Prediction: Modeling DNA sequences.
POS Tagging: Inferring grammatical structures from sentence tokens.
Time Series Analysis: Anomaly detection in sensor data.

Conclusion

The Forward Algorithm is a foundational method in probabilistic modeling for sequences. It elegantly leverages dynamic programming to solve a problem that would otherwise require exponential time. Mastering this algorithm is essential for understanding more advanced HMM tasks like training and decoding.

References

Rabiner, L. R.
A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition, Proceedings of the IEEE.
MIT 6.864 Lecture on HMMs
YouTube: HMM Tutorial Video