aboutsummaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2009-10-13notmuch-index-message: Avoid crashing when a message has no references.Carl Worth
It's obviously an innocent-enough message, and the right thing is so easy to do.
2009-10-13notmuch-index-message: Read message filenames from stdinCarl Worth
This allows for indexing an arbitrary number of messages with a single invocation rather than just a single message on the command line.
2009-10-13Move index_file out from main() into its own function.Carl Worth
This is a step toward having a program that will index many messages with a single invocation.
2009-10-13notmuch-index-message: Index References as well.Carl Worth
We're basically matching sup now! (As long as one uses sup with my special notmuch_index.rb file).
2009-10-13Minor code re-ordering for clarity.Carl Worth
Pull the "constant" source_id value out from among several calls that set a value based on the Message ID.
2009-10-13notmuch-index-message: Don't index the "re:" prefix in subjects.Carl Worth
Getting closer to sup results all the time.
2009-10-13notmuch-index-message: Don't index the line introducing a quote.Carl Worth
We identify it based on a trailing ':' on the line before a quote begins. At this point the database-dump diff between sup and notmuch is getting very, very small, (at least for our one test message).
2009-10-13notmuch-index-message: Don't index quoted lines and signatures.Carl Worth
At this point, we're achieving a result that is *very* close to what sup does. The only difference is that we are still indexing the "excerpts from message ..." line, and we are not yet indexing references.
2009-10-13notmuch-index-message: Separate gen_terms_body into its own functionCarl Worth
This one is complex enough to deserve its own treament.
2009-10-13notmuch-index-message: Add code to actually create a Xapian indexCarl Worth
Most of this code is fairly clean and works well. One part is fairly painful---namely extracting the body of an email message from libgmime. Currently, I'm just extracting the offset to the end of the headers, and then separately opening the message. Surely there's a better way. Anyway, with that the results are looking very similar to sup-sync now, (as verified by xapian-dump). The only substantial difference I'm seeing now is that sup does not seem to index quoted portions of messages nor signatures. I'm not actually sure whether I want to follow sup's lead in that or not.
2009-10-13Rename g_mime_test to notmuch-index-messageCarl Worth
In preparation for actually creating a Xapian index from the message, (not that we're doing that quite yet).
2009-10-13xapian-dump: Add a little mor indentationCarl Worth
Just to make it easier to visually identify where one document ends and the next begins.
2009-10-13Include document data in the dump.Carl Worth
At the same time, I've started hacking up sup with a new NotmuchIndex class in the place of the previous XapianIndex class. The new class stores only the source_info field in the document data, (rather than a serialized ruby hash with a bunch of data that can be found in the original message). Eventually, I plan to replace source_info with a relative filename for the message, (or even a list of filenames for when multiple messages in the database share a common message ID).
2009-10-13xapian-dump: Add support to unserialize values.Carl Worth
The interface for this is cheesy, (bare integer value numbers on the command line indicating that unserialization is desired for those value numbers). But this at least lets us print sup databases with human-readable output for the date values.
2009-10-13Add .gitignore file to ignore compiled binaries.Carl Worth
2009-10-13xapian-dump: Add values to the dump as well.Carl Worth
2009-10-13xapian-dump: Fix to dump all terms for each document ID.Carl Worth
2009-10-13xapian-dump: Actually dump document IDsCarl Worth
It's not a complete tool yet, but it at least does something now.
2009-10-13Remove unused variable.Carl Worth
Compiling with -Wall considered useful.
2009-10-13Add the beginnings of a xapian-dump program.Carl Worth
This will (when it is finished) make a much more reliable way to ensure that notmuch's sync program behaves identically to sup-sync. It doesn't actually do anything yet.
2009-10-13Add sup-compatible prefixes and achieve sup-compatible print outputCarl Worth
What I've done here is to instrument sup-sync to print the text and terms objects it constructs just before indexing a message. Then I've made my g_mime_test program achieve (nearly) identical output for an example email message, (just missing the body text). Next we can start shoving this data into a Xapian index.
2009-10-13Initial commit of a test program to form the basis of notmuch.Carl Worth
Basically just playing with some simple code using libgmime to parse an email message.