Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

BBS Find - the world4ch bbs searcher

Name: Anonymous 2009-01-02 22:38

http://bbsfind.appspot.com/

Sup /prog/
So I was recently approved for a google appspot, and decided to make this. It is a simple webapp which allows you to search for threads based upon title. It uses the subject.txt of the different text boards on dis.4chan.org.

At the moment, it can only search /prog/, but it isn't hard to extend it to all 19 of the current text boards.

Some notes:
* The datastore is using a subjec.txt from just before new year
* At the moment I am scraping the threads locally into CSV format, then uploading them to the datastore.
* Appengine does not currently support threads or cron-like background tasks, which makes automated updating a pain. There are a few hacks around this, though.
(see http://stackoverflow.com/questions/149307/background-tasks-on-appengine for an explanation of the problem)
* You may see some duplicate results, this is because I cancelled a CSV import one time, and had to start again. Ignore for now.

Things todo (high-low priority):
* Automated updating (possibly via http://code.google.com/p/gaeutilities/wiki/Cron). Will probably involve something like selecting the latest "last post time" field, then checking subject.txt for threads created after that time.
* Support for all text boards
* Search posts as well as threads. Not really sure what the best way to do this is though
* Order by title/date in asc/desc order (should be really easy, since all of the functionality is already in Query)

Name: Anonymous 2009-01-02 22:42

site:dis.4chan.org inurl:prog whatever

Name: Anonymous 2009-01-02 22:45

yes, you can use google's special keywords to search, however I found it occasionally missed stuff. This was more a learning exercise more then anything else.

Name: Anonymous 2009-01-02 22:48

>>2
I also plan to add some neat stuff, like a meter of how popular a thread is, etc.

I just realized that searching for "prog" yields the max number of records (1000). This is because I was storing board by name, and so all records have prog in its keyword list. haha

Name: Anonymous 2009-01-02 22:56

alias bbsfind='curl -s http://dis.4chan.org/prog/subject.txt | grep'

Name: Anonymous 2009-01-02 22:58

okay yes we get it there are easier ways to do this smoke a blunt or something

Name: Anonymous 2009-01-02 23:23

Name: Anonymous 2009-01-03 0:01

./progscrape.py; sqlite3 prog.db

Name: Anonymous 2009-01-03 0:05

Xello Harn.

Name: Anonymous 2009-01-03 1:22

>>1
You are part of the 3% of /prog/ that actually programs.  congrats!

Name: Anonymous 2009-01-03 2:11

>>10
3%? I won't settle for more than 1%!

Name: slowpoke 2009-01-03 6:15

Name: Anonymous 2009-08-16 17:21

that questions a The left

Name: Anonymous 2013-08-08 16:56

why it only searches 2008 and before?

Name: sage 2013-08-08 17:01

>>16
* The datastore is using a subjec.txt from just before new year

Name: Anonymous 2013-08-08 17:44

Name: Anonymous 2013-08-08 17:53

>>17
Of 2008?

Name: Anonymous 2013-08-08 19:28

>>19
Have you realized why you're a retard yet?

Name: Anonymous 2013-08-08 21:13

>>20
no, i too dumb

Newer Posts
Don't change these.
Name: Email:
Entire Thread Thread List