Matrix Chain Multiplication

It is a Method under Dynamic Programming in which previous output is taken as input for next.

Here, Chain means one matrix's column is equal to the second matrix's row [always].

In general:

If A = ⌊a_ij⌋ is a p x q matrix 
   B = ⌊b_ij⌋ is a q x r matrix
   C = ⌊c_ij⌋ is a p x r matrix

Then

Given following matrices {A₁,A₂,A₃,...A_n} and we have to perform the matrix multiplication, which can be accomplished by a series of matrix multiplications

A₁ xA₂ x,A₃ x.....x A_n

Matrix Multiplication operation is associative in nature rather commutative. By this, we mean that we have to follow the above matrix order for multiplication but we are free to parenthesize the above multiplication depending upon our need.

In general, for 1≤ i≤ p and 1≤ j ≤ r

It can be observed that the total entries in matrix 'C' is 'pr' as the matrix is of dimension p x r Also each entry takes O (q) times to compute, thus the total time to compute all possible entries for the matrix 'C' which is a multiplication of 'A' and 'B' is proportional to the product of the dimension p q r.

It is also noticed that we can save the number of operations by reordering the parenthesis.

Example1: Let us have 3 matrices, A₁,A₂,A₃ of order (10 x 100), (100 x 5) and (5 x 50) respectively.

Three Matrices can be multiplied in two ways:

A₁,(A₂,A₃): First multiplying(A₂ and A₃) then multiplying and resultant withA₁.
(A₁,A₂),A₃: First multiplying(A₁ and A₂) then multiplying and resultant withA₃.

No of Scalar multiplication in Case 1 will be:

No of Scalar multiplication in Case 2 will be:

To find the best possible way to calculate the product, we could simply parenthesis the expression in every possible fashion and count each time how many scalar multiplication are required.

Matrix Chain Multiplication Problem can be stated as "find the optimal parenthesization of a chain of matrices to be multiplied such that the number of scalar multiplication is minimized".

Number of ways for parenthesizing the matrices:

There are very large numbers of ways of parenthesizing these matrices. If there are n items, there are (n-1) ways in which the outer most pair of parenthesis can place.

(A₁) (A₂,A₃,A₄,................A_n)
Or (A₁,A₂)  (A₃,A₄ .................A_n)
Or (A₁,A₂,A₃)  (A₄ ...............A_n)
........................

Or(A₁,A₂,A₃.............A_n-1) (A_n)

It can be observed that after splitting the kth matrices, we are left with two parenthesized sequence of matrices: one consist 'k' matrices and another consist 'n-k' matrices.

Now there are 'L' ways of parenthesizing the left sublist and 'R' ways of parenthesizing the right sublist then the Total will be L.R:

Also p (n) = c (n-1) where c (n) is the nth Catalon number

c (n) = DAA Matrix Chain Multiplication

On applying Stirling's formula we have

c (n) = Ω DAA Matrix Chain Multiplication

Which shows that 4ⁿ grows faster, as it is an exponential function, then n^1.5.

Development of Dynamic Programming Algorithm

Characterize the structure of an optimal solution.
Define the value of an optimal solution recursively.
Compute the value of an optimal solution in a bottom-up fashion.
Construct the optimal solution from the computed information.

Dynamic Programming Approach

Let A_i,j be the result of multiplying matrices i through j. It can be seen that the dimension of A_i,j is p_i-1 x p_j matrix.

Dynamic Programming solution involves breaking up the problems into subproblems whose solution can be combined to solve the global problem.

At the greatest level of parenthesization, we multiply two matrices

A_1.....n=A_1....k x A_k+1....n)

Thus we are left with two questions:

How to split the sequence of matrices?
How to parenthesize the subsequence A_1.....k andA_k+1......n?

One possible answer to the first question for finding the best value of 'k' is to check all possible choices of 'k' and consider the best among them. But that it can be observed that checking all possibilities will lead to an exponential number of total possibilities. It can also be noticed that there exists only O (n² ) different sequence of matrices, in this way do not reach the exponential growth.

Step1: Structure of an optimal parenthesization: Our first step in the dynamic paradigm is to find the optimal substructure and then use it to construct an optimal solution to the problem from an optimal solution to subproblems.

Let A_i....j where i≤ j denotes the matrix that results from evaluating the product

A_i A_i+1....A_j.

If i < j then any parenthesization of the product A_i A_i+1 ......A_j must split that the product between A_k and A_k+1 for some integer k in the range i ≤ k ≤ j. That is for some value of k, we first compute the matrices A_i.....k & A_k+1....j and then multiply them together to produce the final product A_i....j. The cost of computing A_i....k plus the cost of computing A_k+1....j plus the cost of multiplying them together is the cost of parenthesization.

Step 2: A Recursive Solution: Let m [i, j] be the minimum number of scalar multiplication needed to compute the matrixA_i....j.

If i=j the chain consist of just one matrix A_i....i=A_i so no scalar multiplication are necessary to compute the product. Thus m [i, j] = 0 for i= 1, 2, 3....n.

If i<j we assume that to optimally parenthesize the product we split it between A_k and A_k+1 where i≤ k ≤j. Then m [i,j] equals the minimum cost for computing the subproducts A_i....k and A_k+1....j+ cost of multiplying them together. We know A_i has dimension p_i-1 x p_i, so computing the product A_i....k and A_k+1....jtakes p_i-1 p_k p_j scalar multiplication, we obtain

m [i,j] = m [i, k] + m [k + 1, j] + p_i-1  p_k p_j

There are only (j-1) possible values for 'k' namely k = i, i+1.....j-1. Since the optimal parenthesization must use one of these values for 'k' we need only check them all to find the best.

So the minimum cost of parenthesizing the product A_i A_i+1......A_j becomes

To construct an optimal solution, let us define s [i,j] to be the value of 'k' at which we can split the product A_i A_i+1 .....A_j To obtain an optimal parenthesization i.e. s [i, j] = k such that

m [i,j] = m [i, k] + m [k + 1, j] + p_i-1  p_k p_j

Next TopicMatrix Chain Multiplication Example

← prev next →

For Videos Join Our Youtube Channel: Join Now

Feedback

Send your Feedback to [email protected]

Help Others, Please Share

Learn Latest Tutorials

Splunk

SPSS

Swagger

Transact-SQL

Tumblr

ReactJS

Regex

Reinforcement Learning

R Programming

RxJS

React Native

Python Design Patterns

Python Pillow

Python Turtle

Keras

Preparation

Aptitude

Reasoning

Verbal Ability

Interview Questions

Company Questions

Trending Technologies

Artificial Intelligence

AWS

Selenium

Cloud Computing

Hadoop

ReactJS

Data Science

Angular 7

Blockchain

Git

Machine Learning

DevOps

B.Tech / MCA

DBMS

Data Structures

DAA

Operating System

Computer Network

Compiler Design

Computer Organization

Discrete Mathematics

Ethical Hacking

Computer Graphics

Software Engineering

Web Technology

Cyber Security

Automata

C Programming

C++

Java

.Net

Python

Programs

Control System

Data Mining

Data Warehouse

^{Like/Subscribe us for latest updates or newsletter}

DAA Tutorial

Asymptotic Analysis

Recurrence

Analysis of Sorting

Divide and Conquer

Sorting

Lower bound Theory

Sorting in Linear Time

Hashing

Binary Search Trees

Red Black Tree

Dynamic Programming

Greedy Algorithm

Backtracking

MST

Shortest Path

All-Pairs Shortest Paths

Maximum Flow

Sorting Networks

Complexity Theory

Approximation Algo

String Matching

Misc