fastest-levenshtein vs natural vs levenshtein-edit-distance | String Similarity Measurement Libraries

Package	Downloads	Stars	Size	Issues	Publish	License

fastest-levenshtein	18,200,448	742	21.3 kB	1	-	MIT
natural	308,974	10,857	13.8 MB	78	7 months ago	MIT
levenshtein-edit-distance	206,261	72	12.4 kB	0	-	MIT

Package

Downloads

Stars

Size

Issues

Publish

License

fastest-levenshtein

18,200,448

742

21.3 kB

MIT

natural

308,974

10,857

13.8 MB

7 months ago

MIT

levenshtein-edit-distance

206,261

12.4 kB

MIT

Performance

fastest-levenshtein:
fastest-levenshtein is optimized for speed, making it the fastest option among the three. It uses efficient algorithms to minimize computational overhead, making it suitable for high-frequency operations where performance is paramount.
natural:
natural may not be as fast as the other two for pure distance calculations, as it focuses on a broader set of NLP functionalities. However, it is efficient for tasks that require multiple NLP features.
levenshtein-edit-distance:
levenshtein-edit-distance offers a balance between performance and simplicity. While not as fast as fastest-levenshtein, it provides reasonable performance for most applications without the complexity of optimization.

Functionality

fastest-levenshtein:
fastest-levenshtein specializes solely in calculating Levenshtein distance, providing a focused and efficient solution for string similarity measurement.
natural:
natural is a versatile library that includes various NLP features such as tokenization, stemming, and classification, in addition to string similarity measures, making it suitable for comprehensive text analysis.
levenshtein-edit-distance:
levenshtein-edit-distance is dedicated to Levenshtein distance calculations, offering a straightforward API that is easy to integrate into projects without additional overhead.

Ease of Use

fastest-levenshtein:
fastest-levenshtein has a simple API that allows for quick implementation, making it user-friendly for developers looking for a straightforward solution to string distance calculations.
natural:
natural, while feature-rich, may require a steeper learning curve due to its extensive functionalities. However, it provides thorough documentation to assist users in navigating its features.
levenshtein-edit-distance:
levenshtein-edit-distance is designed with clarity in mind, providing an intuitive interface that is easy to understand and use, especially for beginners or educational purposes.

Use Cases

fastest-levenshtein:
best suited for applications requiring real-time performance, such as search engines, autocomplete features, or any scenario where rapid string comparison is essential.
natural:
perfect for applications that require a combination of string similarity and other NLP tasks, such as chatbots, text analysis tools, or any project that benefits from a broader NLP toolkit.
levenshtein-edit-distance:
ideal for projects where simplicity and clarity are prioritized, such as educational tools or basic applications needing string comparison without complex requirements.

Community and Support

fastest-levenshtein:
has a smaller community but is focused on performance, which may appeal to developers prioritizing speed over extensive features.
natural:
boasts a larger community and extensive documentation, making it easier to find resources, tutorials, and support for a wide range of NLP tasks.
levenshtein-edit-distance:
has a moderate user base, providing adequate community support and documentation for basic usage.

Usage

Node

const {distance, closest} = require('fastest-levenshtein') // Print levenshtein-distance between 'fast' and 'faster' console.log(distance('fast', 'faster')) //=> 2 // Print string from array with lowest edit-distance to 'fast' console.log(closest('fast', ['slow', 'faster', 'fastest'])) //=> 'faster'

Deno

import {distance, closest} from 'https://deno.land/x/fastest_levenshtein/mod.ts' // Print levenshtein-distance between 'fast' and 'faster' console.log(distance('fast', 'faster')) //=> 2 // Print string from array with lowest edit-distance to 'fast' console.log(closest('fast', ['slow', 'faster', 'fastest'])) //=> 'faster'

Benchmark

I generated 500 pairs of strings with length N. I measured the ops/sec each library achieves to process all the given pairs. Higher is better.

Test Target	N=4	N=8	N=16	N=32	N=64	N=128	N=256	N=512	N=1024
fastest-levenshtein	44423	23702	10764	4595	1049	291.5	86.64	22.24	5.473
js-levenshtein	21261	10030	2939	824	223	57.62	14.77	3.717	0.934
leven	19688	6884	1606	436	117	30.34	7.604	1.929	0.478
fast-levenshtein	18577	6112	1265	345	89.41	22.70	5.676	1.428	0.348
levenshtein-edit-distance	22968	7445	1493	409	109	28.07	7.095	1.789	0.445

Test Target

N=4

N=8

N=16

N=32

N=64

N=128

N=256

N=512

N=1024

fastest-levenshtein

44423

23702

10764

4595

1049

291.5

86.64

22.24

5.473

js-levenshtein

21261

10030

2939

824

223

57.62

14.77

3.717

0.934

leven

19688

6884

1606

436

117

30.34

7.604

1.929

0.478

fast-levenshtein

18577

6112

1265

345

89.41

22.70

5.676

1.428

0.348

levenshtein-edit-distance

22968

7445

1493

409

109

28.07

7.095

1.789

0.445

Relative Performance

This image shows the relative performance between fastest-levenshtein and js-levenshtein (the 2nd fastest). fastest-levenshtein is always a lot faster. y-axis shows "times faster".