Hidden Markov Models

Monday, May 30, 2011

Nowadays, many applications use Hidden Markov Models (HMMs) to solve crucial issues such as bioinformatics, speech recognition, musical analysis, digital signal processing, data mining, financial applications, time series analysis and many others. HMMs are probabilistic models which are very useful to model sequence behaviours or discrete time series events. Formally it models Markov processes with hidden states, like an extension for Markov Chains. For computer scientists, is a state machine with probabilistic transitions where each state can emit a value with a given probability.

For better understanding HMMs, I will illustrate how it works with "The Fair Bet Casino" problem. Imagine you are in a casino where you can bet on coins tosses, tossed by a dealer. A coin toss can have two outcomes: head (H) or tail (T). Now suppose that the coin dealer has two coins, a fair (F) which outputs both H and T with 1/2 probabilities and a biased coin (B) which outputs H with probability 3/4 and T with 1/4. Using probability language we say:

P(H|F) = 1/2
P(T|F) = 1/2
P(H|B) = 3/4
P(T|B) = 1/4

Now imagine that the dealer changes the coin in a way you can't see, but you know that he does it with a 1/10 probability. So thinking the coin tosses as a sequence of events we can say:

P(F_i+1|F_i) = 9/10
P(B_i+1|F_i) = 1/10
P(B_i+1|B_i) = 9/10
P(F_i+1|B_i) = 1/10

We can model it using a graph to illustrate the process:

HMM for "The Fair Bet Casino" problem

That's a HMM! It isn't any rocket science. Is just important to add a few remarks. We call the set of all possible emissions of the Markov process as the alphabet Σ ({H, T} in our problem). For many of computational method involving HMMs you will also need a initial state distribution π. For our problem we may assume that the we have equal probability for each coin.

Now comes in our mind what we can do with the model in our hands. There are lot's of stuff to do with it, such as: given a sequence of results, when the dealer used the biased coin or even generate a random sequence with a coherent behaviour when compared to the model.

There is a nice library called ghmm (available for C and Python) which handles HMMs and already gives us the most famous and important HMM algorithms. Unfortunately the python wrapper is not pythonic. Let's model our problem in python to have some fun:





import ghmm



# setting 0 for Heads and 1 for Tails as our Alphabet 

sigma = ghmm.IntegerRange(0, 2)



# transition matrix: rows and columns means origin and destiny states

transitions_probabilities = [

    [0.9, 0.1], # 0: fair state

    [0.1, 0.9], # 1: biased state

]



# emission matrix: rows and columns means states and symbols respectively

emissions_probabilities = [

    [0.5, 0.5], # 0: fair state emissions probabilities

    [0.75, 0.25], # 1: biased state emissions probabilities

]



# probability of initial states

pi = [0.5, 0.5] # equal probabilities for 0 and 1



hmm = ghmm.HMMFromMatrices(

sigma,

    # you can model HMMs with others emission probability distributions

    ghmm.DiscreteDistribution(sigma),

    transitions_probabilities,

    emissions_probabilities,

    pi

)

>>> print hmm
DiscreteEmissionHMM(N=2, M=2)



  state 0 (initial=0.50)

    Emissions: 0.50, 0.50

    Transitions: ->0 (0.90), ->1 (0.10)



  state 1 (initial=0.50)

    Emissions: 0.75, 0.25

    Transitions: ->0 (0.10), ->1 (0.90)

Now that we have our HMM object on the hand we can play with it. Suppose you have the given sequence of coin tosses and you would like to distinguish which coin was being used at a given state:



tosses = [1, 1, 1, 0, 1, 0, 0, 1, 1, 0, 0, 1, 0, 1, 0, 0, 0, 0, 1, 1, 0,   1, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 1, 0, 1]

The viterbi algorithm can be used to trace the most probable states at each coin toss according to the HMM distribution:



# not as pythonic is could be :-/

sequence = ghmm.EmissionSequence(sigma, tosses)



viterbi_path, _ = hmm.viterbi(sequence)



>>> print viterbi_path

[0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,  1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1]

Nice! But sometimes is interesting to have the probability of each state on the point instead of only the most probable one. To have that, you must use the posterior or forward algorithms to have more detailed information.



states_probabilities = hmm.posterior(sequence)



>>> print

states_probabilities
[[0.8407944139086141, 0.1592055860913865], [0.860787703168127, 0.13921229683187356], ... ]

The posterior method result, returns the list of probabilities at each state, for example, in the first index we have [0.8407944139086141, 0.1592055860913865]. That means that we have ~0.84 probability of chance that the dealer is using the fair coin and ~0.16 for the biased coin. We also can plot a graph to show the behaviour of the curve of the probability of the dealer being using the fair coin (I used matplotlib for the graphs).

Probability of being a fair coin over time

This is only a superficial example of what can HMMs do. It's worthy give a look at it if you want do some sequence or time series analysis in any domain. I hope this post presented and cleared what are HMM and how they can be used to analyse data.

Marcel Caraciolo

I am a brazilian data scientist, entrepreneur, python hacker and technology consultant. Nowadays I work with data-centric applications, specially in machine learning, recommender systems and bioinformatics. I am also interested in distributed computing, high performance and data visualization, educational and bioinformatics ventures.

Until 2013 I was the co-founder of two companies Atepassar.com, a social network for students in Brazil and co-founder of PyCursos, a on-line startup for python training and on-line courses. In 2014, I assumed a new position at Genomika Diagnósticos, a brazilian genetics tests laboratory, as CTO.

My Github Repos

9 comments:

UnknownNovember 25, 2015 at 3:06 AM
Embedded system training: Wiztech Automation Provides Excellent training in embedded system training in Chennai - IEEE Projects - Mechanical projects in Chennai. Wiztech provide 100% practical training, Individual focus, Free Accommodation, Placement for top companies. The study also includes standard microcontrollers such as Intel 8051, PIC, AVR, ARM, ARMCotex, Arduino, etc.

Embedded system training in chennai
Embedded Course training in chennai
Matlab training in chennai
Android training in chennai
LabVIEW training in chennai
Robotics training in chennai
Oracle training in chennai
Final year projects in chennai
Mechanical projects in chennai
ece projects in chennai

ciitnoidaApril 24, 2018 at 11:29 PM
Best B.Tech College in Noida
maheshOctober 3, 2018 at 2:04 AM
Gaining Python certifications will validate your skills and advance your career.
python certification
amarDecember 20, 2018 at 1:34 AM
Good Post........
AI Training in Bangalore
Machine Learning Training in Bagalore
Python Training in Bangalore
AnonymousMay 24, 2019 at 3:02 AM
Very awesome!!! When I seek for this I found this website at the top of all blogs in search engine.
machine learning course in bangalore
AnonymousJune 4, 2019 at 1:54 AM
Excellent Blog! I would like to thank for the efforts you have made in writing this post. I am hoping the same best work from you in the future as well. I wanted to thank you for this websites! Thanks for sharing. Great websites! Now please do visit our website which will be very helpful.
machine learning course bangalore
jamesJune 26, 2019 at 10:46 PM
Amazing content.
Data Mining Service Providers in Bangalore
AnonymousJuly 11, 2020 at 3:01 AM
python training in bangalore | python online taining
aws training in bangalore | aws online training
artificial intelligence training in bangalore | artificial intelligence online training
machine learning training in bangalore | machine learning online training
data science training in bangalore | data science online training

jane hollySeptember 20, 2020 at 4:09 AM
This professional hacker is absolutely reliable and I strongly recommend him for any type of hack you require. I know this because I have hired him severally for various hacks and he has never disappointed me nor any of my friends who have hired him too, he can help you with any of the following hacks:

-Phone hacks (remotely)
-Credit repair
-Bitcoin recovery (any cryptocurrency)
-Make money from home (USA only)
-Social media hacks
-Website hacks
-Erase criminal records (USA & Canada only)
-Grade change
-funds recovery

Email: onlineghosthacker247@ gmail .com

Artificial Intelligence in Motion

Pages