Open In App

Z algorithm (Linear time pattern searching Algorithm)

Last Updated : 25 Feb, 2025
Comments
Improve
Suggest changes
Like Article
Like
Report

This algorithm efficiently locates all instances of a specific pattern within a text in linear time. If the length of the text is “n” and the length of the pattern is “m,” then the total time taken is O(m + n), with a linear auxiliary space. It is worth noting that the time and auxiliary space of this algorithm is the same as the KMP algorithm, but this particular algorithm is simpler to comprehend. In this approach, we create a Z array as part of the process.

What is Z Array?

For a string str[0..n-1], Z array is of same length as string. An element Z[i] of Z array stores length of the longest substring starting from str[i] which is also a prefix of str[0..n-1]. The first entry of Z array is meaning less as complete string is always prefix of itself. 

Example:
Index 0 1 2 3 4 5 6 7 8 9 10 11
Text a a b c a a b x a a a z
Z values X 1 0 0 3 1 0 0 2 2 1 0

More Examples:
str = “aaaaaa”
Z[] = {x, 5, 4, 3, 2, 1}
str = “aabaacd”
Z[] = {x, 1, 0, 2, 1, 0, 0}
str = “abababab”
Z[] = {x, 0, 6, 0, 4, 0, 2, 0}

How is Z array helpful in Searching Pattern in Linear time? 
The idea is to concatenate pattern and text, and create a string “P$T” where P is pattern, $ is a special character should not be present in pattern and text, and T is text. Build the Z array for concatenated string. In Z array, if Z value at any point is equal to pattern length, then pattern is present at that point. 

Example:
Pattern P = “aab”, Text T = “baabaa”
The concatenated string is = “aab$baabaa”
Z array for above concatenated string is {x, 1, 0, 0, 0, 3, 1, 0, 2, 1}.
Since length of pattern is 3, the value 3 in Z array indicates presence of pattern.

How to construct Z array? 
     A Simple Solution is to run two nested loops, the outer loop goes to every index and the inner loop finds length of the longest prefix that matches the substring starting at the current index. The time complexity of this solution is O(n2).
      We can construct Z array in linear time. 

The idea is to maintain an interval [L, R] which is the interval with max R such that [L,R] is prefix substring (substring which is also prefix).
Steps for maintaining this interval are as follows –
1) If i > R then there is no prefix substring that starts before i and ends after i, so we reset L and R and compute new [L,R] by comparing str[0..] to str[i..] and get Z[i] (= R-L+1).
2) If i <= R then let K = i-L, now Z[i] >= min(Z[K], R-i+1) because str[i..] matches with str[K..] for atleast R-i+1 characters (they are in [L,R] interval which we know is a prefix substring). Now two sub cases arise –
a) If Z[K] < R-i+1 then there is no prefix substring starting at str[i] (otherwise Z[K] would be larger) so Z[i] = Z[K] and interval [L,R] remains same.
b) If Z[K] >= R-i+1 then it is possible to extend the [L,R] interval thus we will set L as i and start matching from str[R] onwards and get new R then we will update interval [L,R] and calculate Z[i] (=R-L+1).

The algorithm runs in linear time because we never compare character less than R and with matching we increase R by one so there are at most T comparisons. In mismatch case, mismatch happen only once for each i (because of which R stops), that’s another at most T comparison making overall linear complexity.

Below is the implementation of Z algorithm for pattern searching. 

C++
// A C++ program that implements Z algorithm for pattern searching#include<iostream>usingnamespacestd;voidgetZarr(stringstr,intZ[]);// prints all occurrences of pattern in text using Z algovoidsearch(string&text,string&pattern){// Create concatenated string "P$T"stringconcat=pattern+"$"+text;intl=concat.length();// Construct Z arrayintZ[l];getZarr(concat,Z);// now looping through Z array for matching conditionfor(inti=0;i<l;++i){// if Z[i] (matched region) is equal to pattern// length we got the patternif(Z[i]==pattern.length())cout<<"Pattern found at index "<<i-pattern.length()-1<<endl;}}// Fills Z array for given string str[]voidgetZarr(stringstr,intZ[]){intn=str.length();intL,R,k;// [L,R] make a window which matches with prefix of sL=R=0;for(inti=1;i<n;++i){// if i>R nothing matches so we will calculate.// Z[i] using naive way.if(i>R){L=R=i;// R-L = 0 in starting, so it will start// checking from 0'th index. For example,// for "ababab" and i = 1, the value of R// remains 0 and Z[i] becomes 0. For string// "aaaaaa" and i = 1, Z[i] and R become 5while(R<n&&str[R-L]==str[R])R++;Z[i]=R-L;R--;}else{// k = i-L so k corresponds to number which// matches in [L,R] interval.k=i-L;// if Z[k] is less than remaining interval// then Z[i] will be equal to Z[k].// For example, str = "ababab", i = 3, R = 5// and L = 2if(Z[k]<R-i+1)Z[i]=Z[k];// For example str = "aaaaaa" and i = 2, R is 5,// L is 0else{// else start from R and check manuallyL=i;while(R<n&&str[R-L]==str[R])R++;Z[i]=R-L;R--;}}}}// Driver programintmain(){stringtext="GEEKS FOR GEEKS";stringpattern="GEEK";search(text,pattern);return0;}
Java
// A Java program that implements Z algorithm for pattern// searchingclassGFG{// prints all occurrences of pattern in text using// Z algopublicstaticvoidsearch(Stringtext,Stringpattern){// Create concatenated string "P$T"Stringconcat=pattern+"$"+text;intl=concat.length();intZ[]=newint[l];// Construct Z arraygetZarr(concat,Z);// now looping through Z array for matching conditionfor(inti=0;i<l;++i){// if Z[i] (matched region) is equal to pattern// length we got the patternif(Z[i]==pattern.length()){System.out.println("Pattern found at index "+(i-pattern.length()-1));}}}// Fills Z array for given string str[]privatestaticvoidgetZarr(Stringstr,int[]Z){intn=str.length();// [L,R] make a window which matches with // prefix of sintL=0,R=0;for(inti=1;i<n;++i){// if i>R nothing matches so we will calculate.// Z[i] using naive way.if(i>R){L=R=i;// R-L = 0 in starting, so it will start// checking from 0'th index. For example,// for "ababab" and i = 1, the value of R// remains 0 and Z[i] becomes 0. For string// "aaaaaa" and i = 1, Z[i] and R become 5while(R<n&&str.charAt(R-L)==str.charAt(R))R++;Z[i]=R-L;R--;}else{// k = i-L so k corresponds to number which// matches in [L,R] interval.intk=i-L;// if Z[k] is less than remaining interval// then Z[i] will be equal to Z[k].// For example, str = "ababab", i = 3, R = 5// and L = 2if(Z[k]<R-i+1)Z[i]=Z[k];// For example str = "aaaaaa" and i = 2, R is 5,// L is 0else{// else start from R and check manuallyL=i;while(R<n&&str.charAt(R-L)==str.charAt(R))R++;Z[i]=R-L;R--;}}}}// Driver programpublicstaticvoidmain(String[]args){Stringtext="GEEKS FOR GEEKS";Stringpattern="GEEK";search(text,pattern);}}// This code is contributed by PavanKoli.
Python
# Python3 program that implements Z algorithm# for pattern searching# Fills Z array for given string str[]defgetZarr(string,z):n=len(string)# [L,R] make a window which matches# with prefix of sl,r,k=0,0,0foriinrange(1,n):# if i>R nothing matches so we will calculate.# Z[i] using naive way.ifi>r:l,r=i,i# R-L = 0 in starting, so it will start# checking from 0'th index. For example,# for "ababab" and i = 1, the value of R# remains 0 and Z[i] becomes 0. For string# "aaaaaa" and i = 1, Z[i] and R become 5whiler<nandstring[r-l]==string[r]:r+=1z[i]=r-lr-=1else:# k = i-L so k corresponds to number which# matches in [L,R] interval.k=i-l# if Z[k] is less than remaining interval# then Z[i] will be equal to Z[k].# For example, str = "ababab", i = 3, R = 5# and L = 2ifz[k]<r-i+1:z[i]=z[k]# For example str = "aaaaaa" and i = 2, # R is 5, L is 0else:# else start from R and check manuallyl=iwhiler<nandstring[r-l]==string[r]:r+=1z[i]=r-lr-=1# prints all occurrences of pattern # in text using Z algodefsearch(text,pattern):# Create concatenated string "P$T"concat=pattern+"$"+textl=len(concat)# Construct Z arrayz=[0]*lgetZarr(concat,z)# now looping through Z array for matching conditionforiinrange(l):# if Z[i] (matched region) is equal to pattern# length we got the patternifz[i]==len(pattern):print("Pattern found at index",i-len(pattern)-1)# Driver Codeif__name__=="__main__":text="GEEKS FOR GEEKS"pattern="GEEK"search(text,pattern)# This code is contributed by# sanjeev2552
C#
// A C# program that implements Z // algorithm for pattern searching usingSystem;classGFG{// prints all occurrences of // pattern in text using Z algo publicstaticvoidsearch(stringtext,stringpattern){// Create concatenated string "P$T" stringconcat=pattern+"$"+text;intl=concat.Length;int[]Z=newint[l];// Construct Z array getZarr(concat,Z);// now looping through Z array// for matching condition for(inti=0;i<l;++i){// if Z[i] (matched region) is equal // to pattern length we got the pattern if(Z[i]==pattern.Length){Console.WriteLine("Pattern found at index "+(i-pattern.Length-1));}}}// Fills Z array for given string str[] privatestaticvoidgetZarr(stringstr,int[]Z){intn=str.Length;// [L,R] make a window which // matches with prefix of s intL=0,R=0;for(inti=1;i<n;++i){// if i>R nothing matches so we will // calculate. Z[i] using naive way. if(i>R){L=R=i;// R-L = 0 in starting, so it will start // checking from 0'th index. For example, // for "ababab" and i = 1, the value of R // remains 0 and Z[i] becomes 0. For string // "aaaaaa" and i = 1, Z[i] and R become 5 while(R<n&&str[R-L]==str[R]){R++;}Z[i]=R-L;R--;}else{// k = i-L so k corresponds to number // which matches in [L,R] interval. intk=i-L;// if Z[k] is less than remaining interval // then Z[i] will be equal to Z[k]. // For example, str = "ababab", i = 3, // R = 5 and L = 2 if(Z[k]<R-i+1){Z[i]=Z[k];}// For example str = "aaaaaa" and // i = 2, R is 5, L is 0 else{// else start from R and // check manually L=i;while(R<n&&str[R-L]==str[R]){R++;}Z[i]=R-L;R--;}}}}// Driver Code publicstaticvoidMain(string[]args){stringtext="GEEKS FOR GEEKS";stringpattern="GEEK";search(text,pattern);}}// This code is contributed by Shrikant13
Javascript
// A JavaScript program that implements Z algorithm for// pattern searching// prints all occurrences of pattern in text using algofunctionsearch(text,pattern){// Create concatenated string "P$T"letconcat=pattern+"$"+text;letl=concat.length;letZ=newArray(l);// Construct Z arraygetZarr(concat,Z);// now looping through Z array for matching conditionfor(leti=0;i<l;++i){// if Z[i] (matched region) is equal to pattern// length we got the patternif(Z[i]==pattern.length){console.log("Pattern found at index "+(i-pattern.length-1));}}}// Fills Z array for given string str[]functiongetZarr(str,Z){letn=str.length;// [L,R] make a window which matches with// prefix of sletL=0,R=0;for(leti=1;i<n;++i){// if i>R nothing matches so we will calculate.// Z[i] using naive way.if(i>R){L=R=i;// R-L = 0 in starting, so it will start// checking from 0'th index. For example,// for "ababab" and i = 1, the value of R// remains 0 and Z[i] becomes 0. For string// "aaaaaa" and i = 1, Z[i] and R become 5while(R<n&&str[R-L]==str[R])R++;Z[i]=R-L;R--;}else{// k = i-L so k corresponds to number which// matches in [L,R] interval.letk=i-L;// if Z[k] is less than remaining interval// then Z[i] will be equal to Z[k].// For example, str = "ababab", i = 3, R = 5// and L = 2if(Z[k]<R-i+1)Z[i]=Z[k];// For example str = "aaaaaa" and i = 2, R is 5,// L is 0else{// else start from R and check manuallyL=i;while(R<n&&str[R-L]==str[R])R++;Z[i]=R-L;R--;}}}}// Driver programlettext="GEEKS FOR GEEKS";letpattern="GEEK";search(text,pattern);

Output
Pattern found at index 0 Pattern found at index 10 


Time Complexity: O(m+n), where m is length of pattern and n is length of text.
Auxiliary Space: O(n)
 



Next Article
Article Tags :
Practice Tags :

Similar Reads

close