tinman spent a few years mucking around industry before going back to school for a Masters. Currently not enjoying the weather in North England..
He wrote Perl that looked suspiciously like C code in 1998, while working as an intern, and has been trying to cure that bad habit ever since.
Something that always confuses me is that Java has so many different idioms for writing and reading from files. You have FileWriter/FileReader, FileInputStream, and BufferedWriters and Readers.. and I'm sure I've missed out a few (like Properties and so forth).
So, instead of spending time figuring out that the proper call (in most instances) is BufferedWriter wraps a FileWriter wraps a String which is a filename, I'm just going to write it down somewhere permanent.. Maybe a tattoo is in order ?
Hmm, indexing is kind of sort of done. Lucene is pretty darn fast, whew. My unofficial and informally obtained statistics say that 3GB of text (in 3k files) takes around 7-8 minutes to index. I should, ideally, find some spare time to install PLucene and see how well it does, in the pure index-speed arena.
I can now finally get on with the actual research. The base system is almost done.