Ticket #37: pyndexter-xapian.diff

File pyndexter-xapian.diff, 0.6 kB (added by Dirk Datzert <dummy@habmalnefrage.de>, 2 years ago)

Patch for xapian-indexer to chek the word length

  • pyndexter-0.2/pyndexter/indexers/xapian.py

    old new  
    4848 
    4949        doc.add_term('Q' + uri) 
    5050 
     51        FLINT_BTREE_MAX_KEY_LEN = 240 
     52 
    5153        for word in self.words.finditer(content): 
    5254            term = self.stemmer.stem_word(word.group().lower()) 
     55            if len(term) > FLINT_BTREE_MAX_KEY_LEN:  
     56               continue 
    5357            doc.add_posting(term, word.start()) 
    5458 
    5559        self.db.replace_document('Q' + uri, doc)