bug-gperf
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Ann: Working on mph algos


From: Reini Urban
Subject: Ann: Working on mph algos
Date: Sat, 19 Feb 2022 13:57:36 +0100

I just ported the NetBSD nbperf code to linux, and found it quite useful.
3 new minimal perfect hash algos, with different space-time tradeoffs.

1. CHM with order preserving keyword array, which is esp. useful for range searches.
e.g. you search for the first hit, and can immediately get the next 10 keywords also.
2. efficiently generate minimal perfect hashes, esp. with >15.000 keywords. BPZ is
extremely space efficient, but if false-positives can be queried, the original list must be stored also.

Of course it's much slower than the default gperf algo, but for large lists gperf is not well
suited. Also the ordering property of CHM is nice.

Working here, merging it into main, search and output.
https://github.com/rurban/gperf/commits/nbperf

The original code: https://github.com/rurban/nbperf (BSD3 Licensed)
I'll bug assign@gnu for the assignment.
Do we need permission from the original author also, or is BSD enough?
--
Reini Urban

reply via email to

[Prev in Thread] Current Thread [Next in Thread]