rss
logo

I provide consulting and custom development for Natural Language Processing, Information Extraction and Search solutions.Self Picture


 learn more   get in touch 

Logo - I Build Search

Dev Stuff

PHP classes for NLP

A set of classes to perform Natural Language Processing Tasks. Some of the code is ported from the NTLK project.

Document Tagger

DocTagger lets you automatically classify text documents. Use this as a starting point to write apps that can sort through volumes of unorganized data.

Yammy

Decode and view any archived yahoo messenger conversation. No password required!

Yux

An alternate Yahoo Messenger client for *nix systems that attempts to match the look and feel of the official Windows client.

Yahoo Packet Analyzer

This tool will show you the raw YMSG (Yahoo Messenger) packets passing through your machine. Developers and non-developers can use this to understand the YMSG protocol better.

Indic to English Transliterator

Transliteration is the process of converting a word from one language to another while retaining its phonetic characteristics. This application lets you convert a word from any major Indian language (currently supports Hindi, Marathi, Sanskrit and Bengali) to English.

Deebot

Deeb0t is an IRC chat bot capable of making meaningful conversation with other users. It also responds to commands issued by its owner.

Plugins

A collection of Wordpress plugins that I've developed.

Themes

A collection of Wordpress templates that I designed over the years.

RediffBlog Crawler

BlogCrawler enables you to save your entire rediffblogs blog to disk along with all the comments!

Latest Articles

Feb
19

Join a list of integers in Python

How do you run a string join on a list of integers in Python? After googling for about 10 mins, I gave up and did this. I am sure there is a better way of doing it! [Read More]
Jan
21

Writing a spider in 10 mins using Scrapy

I came across Scrapy a few days back and have grown to really love it. This tutorial will illustrate how you can write a simple spider using Scrapy to scrape data off Paul Smith. All this in 10 minutes. [Read More]

Featured Projects

Document Tagger

Document Tagger

DocTagger lets you automatically classify text documents. Use this as a starting point to write apps that can sort through volumes of unorganized data.

[Read More]

Deebot

Deebot

Deeb0t is an IRC chat bot capable of making meaningful conversation with other users. It also responds to commands issued by its owner.

[Read More]

This page and its contents are copyright © 2010, Pravin Paratey.