notmuch/index.cc, branch master

notmuch/index.cc, branch master thread-based email index, search, and tagging https://git.notmuchmail.org/git/notmuch/atom?h=master 2009-11-10T00:24:03Z libify: Move library sources down into lib directory. 2009-11-10T00:24:03Z Carl Worth cworth@cworth.org 2009-11-10T00:12:28Z urn:sha1:146549321044615d9aef2b30cedccda9c49f3f38 A "make" invocation still works from the top-level, but not from down inside the lib directory yet. index: Don't bother indexing quoted portions of messages (and signatures). 2009-10-28T22:41:42Z Carl Worth cworth@cworth.org 2009-10-28T22:41:42Z urn:sha1:56218ddbb4a72fdec534773f2bd4e85aec914ae9 Our old notmuch-index-message.cc code had this, but I originally left it out when adding indexing back in. I was concerned primarily with mistakenly detecting signature markers and omitting important text, (for example, I often do long lines of "----" as section separators). But now I see that there's a performance benefit to skippint the quotations, (about 120 files/sec. instead of 95 files/sec.). I mitigated the bogus signature checking by recognizing nothing other than the all-time classic "-- ". index: Store "Full Name <user@example.com>" addressses in the database 2009-10-28T20:09:08Z Carl Worth cworth@cworth.org 2009-10-28T20:09:08Z urn:sha1:3a91df21caddd952fe9a3e3ba8128e781a3f6ec5 We put these is as a separate term so that they can be extracted. We don't actually need this for searching, since typing an email address in as a search term will already trigger a phrase search that does exactly what's wanted. Add full-text indexing using the GMime library for parsing. 2009-10-28T19:50:10Z Carl Worth cworth@cworth.org 2009-10-28T17:42:07Z urn:sha1:f9bbd7baa07110c7f345c8413e2426d00382cb1c This is based on the old notmuch-index-message.cc from early in the history of notmuch, but considerably cleaned up now that we have some experience with Xapian and know just what we want to index, (rather than just blindly trying to index exactly what sup does). This does slow down notmuch_database_add_message a *lot*, but I've got some ideas for getting some time back.