Bez popisu

dstromberg 9606ba0024 Better duck typing for hashed objects před 13 roky
Makefile 0cf2a846ea clean rule removes seek.txt and array.txt, which are graphing temporaries před 13 roky
README 2c101e4dba Few minor README improvements před 13 roky
bloom_filter_mod.py 9606ba0024 Better duck typing for hashed objects před 13 roky
count-bits 5fb9908837 Larger blocks. Give a percentage of set/clear před 13 roky
gen-performance-graph 3c8ee83155 Misc graph formatting changes, including manually computing maximum y před 13 roky
test-bloom-filter 996179887e Basic graphing added - or rather, text file output for subsequent graphing před 13 roky
this-pylint c763fde8d6 Initial checkin před 14 roky

README


This bloom filter implementation:
1) Has a constructor that accepts a maximum comfortable number of members and maximum appropriate error (false positive) rate, and
derives the fiddly bits from that; most bloom filter modules ask the enduser to specify the fiddly bits themselves.
2) Has a nice test suite, including checks for error rate.
3) Is in pure Python that'll run on CPython 2.x, CPython 3.x, PyPy or Jython.
4) Has a pair of simple, fast hash functions that give a good error rate - they're better than many of the alternatives,
They're not Murmur or Jenkins, but the tests strongly suggest that they're working well.
5) Passes pylint and pep8.
6) Supports adding elements, testing for membership, and'ing sets and or'ing sets.

The code is derived from http://code.activestate.com/recipes/577686-bloom-filter/ and inherits that code's license.

For more about Bloom Filters:
http://en.wikipedia.org/wiki/Bloom_filter
http://spyced.blogspot.com/2009/01/all-you-ever-wanted-to-know-about.html