rss
logo

I provide consulting and custom development for Natural Language Processing, Information Extraction and Search solutions.Self Picture


 learn more   get in touch 

Logo - I Build Search
Aug 05
2008

RediffBlog Crawler digg

Wouldn’t it be great if you could save your rediffblog to disk? You could then browse through your posts and comments even when you werent connected to the internet. You could give your whole blog to a friend, move to another blog host, or even delete your blog but save a copy of it.

That is where BlogCrawler comes in. BlogCrawler enables you to save your entire blog to disk along with all the comments!

Installation and Uninstallation

Download BlogCrawler 0.1
(Requires the .NET 2.0 Framework)

To install, double click the setup.exe file and follow the instructions. BlogCrawler comes with an uninstaller that you can access either through the Start Menu or through the Add/Remove Programs section of the the Control Panel.

User Guide

Setting up your blog for crawling

The first step is to add unique tags before and after your post in the Template. This tells BlogCrawler where each post starts and ends. Use the two strings <!-- BlogCrawler Start --> and <!-- BlogCrawler End --> between <rediffBlog> and </rediffBlog>.

Remember to save your template and publish your blog after you’re done

Running BlogCrawler

  1. Now, fire up BlogCrawler.
  2. Click on Next to go to Step 2. Enter the fields.

  3. Enter the start and end strings.

  4. Wait while your blog is downloaded.

  5. Thats it! You’re done. Open the containing folder to see your posts with comments :-)

Sample Output


View Sample Page

Troubleshooting

In case of issues, feel free to get in touch: pravinp -at- gmail -dot- com

Awards

Softpedia clean

Leave a Reply

XHTML: You can use these tags: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong> <pre lang="" line="" escaped="">

Latest Articles

Apr
07

Palindromic sub-sequences in python

This bit of python code returns all palindromic subsequences in the input string.

[Read More]
Feb
19

Join a list of integers in Python

How do you run a string join on a list of integers in Python? After googling for about 10 mins, I gave up and did this. I am sure there is a better way of doing it!

[Read More]

Featured Projects

Deebot

Deebot

Deeb0t is an IRC chat bot capable of making meaningful conversation with other users. It also responds to commands issued by its owner.

[Read More]

Document Tagger

Document Tagger

DocTagger lets you automatically classify text documents. Use this as a starting point to write apps that can sort through volumes of unorganized data.

[Read More]

This page and its contents are copyright © 2010, Pravin Paratey.