Knowing the 'hidden' class re.Scanner in Python and how to use it: Building a simple Tweet Extractor

Wednesday, September 8, 2010

Hi all,

I've written a simple post at my personal blog (Yes, I have one, but it talks more about my daily life, personal projects and geek stuff) Mobideia about the use of a hidden class 'Scanner' from the module re, responsible for handling with regular expressions.

The practical example includes a simple Tweet Extractor for identify the URLs, (,  usernames (@joao), hashtags (#django), Retweets (RT) and words.  It may be very useful for you who wants to play with Natural Language Processing (NLP) and text mining on Twitter.

In order to avoid the replication of all text here,  I've decided to place the links for accessing the post:

Portuguese Version

English Version (by Google Translate):

I hope you enjoy!


Marcel Caraciolo


  1. Just wanted to say thanks for this and all the other uploads. Much appreciated!


  2. This professional hacker is absolutely reliable and I strongly recommend him for any type of hack you require. I know this because I have hired him severally for various hacks and he has never disappointed me nor any of my friends who have hired him too, he can help you with any of the following hacks:

    -Phone hacks (remotely)
    -Credit repair
    -Bitcoin recovery (any cryptocurrency)
    -Make money from home (USA only)
    -Social media hacks
    -Website hacks
    -Erase criminal records (USA & Canada only)
    -Grade change
    -funds recovery

    Email: onlineghosthacker247@ gmail .com

  3. this is very important information . thank you very much for this post thank you good luck 바카라사이트

  4. 온라인카지노 I can see that you are an expert at your field! I am launching a website soon, and your information will be very useful for me.. Thanks for all your help and wishing you all the success ..

  5. 스포츠토토
    I was very hapρy to discover thiѕ site. I need tօ
    to tһank you for oneѕ time jᥙst foг this fantastic гead!!
    I definiteⅼy loved eѵery рart оf it and i also
    hɑve you book marked tօ look ɑt new stuff on ʏour site.

  6. The the next time I just read a blog, I really hope that this doesn't disappoint me approximately brussels. Get real, Yes, it was my option to read, but I really thought you'd have some thing intriguing to say. All I hear is usually a couple of whining about something that you could fix when you weren't too busy searching for attention.

  7. I precisely needed to appreciate you once more. I am not sure the things that I might have used without the type of concepts revealed by you regarding this subject matter. It truly was a hard case in my circumstances, however , finding out the expert strategy you handled that took me to jump over joy. I’m just happy for this assistance and then hope that you really know what an amazing job you are carrying out educating the rest with the aid of your webpage. I am sure you have never come across any of us.