Fajri Abdillah

Scrapy Dragonball Multiverse

Dragonball is one of my favorite manga. This is doujinshi, not drawing by Akira toriyama, but still awesome. So I want to crawl every single page, and I can read it offline.


  • Python

    Write a good python code is fun, because it indented very well

  • Scrapy & XPATH

    Confusing at first, but it goes well, trial and error using xpath code just to capture some in string in HTML element




  • Python 2.6
  • Ubuntu Server 12.04
  • Apache 2
  • Scrapy 0.16
  • Redis


Scrapy Dragonball

Lesson learned

Scrapy is absolutely a web crawler framework, and fast. If Dragonball is your favorite, well then your childhood was awesome.