nearley vs antlr4 vs pegjs vs jison | Parsing Libraries for JavaScript Comparison

Package	Downloads	Stars	Size	Issues	Publish	License

nearley	3,874,335	3,690	-	198	5 years ago	MIT
antlr4	1,285,087	18,068	3.09 MB	1,024	a year ago	BSD-3-Clause
pegjs	678,045	4,885	-	116	9 years ago	MIT
jison	75,124	4,376	-	161	8 years ago	MIT

Package

Downloads

Stars

Size

Issues

Publish

License

nearley

3,874,335

3,690

198

5 years ago

MIT

antlr4

1,285,087

18,068

3.09 MB

1,024

a year ago

BSD-3-Clause

pegjs

678,045

4,885

116

9 years ago

MIT

jison

75,124

4,376

161

8 years ago

MIT

Grammar Definition

nearley:
Nearley supports a more expressive grammar definition that can handle ambiguous grammars. It allows for the use of multiple parsing strategies, enabling developers to define complex language rules with ease.
antlr4:
ANTLR4 allows for defining grammars using a rich syntax that supports complex language constructs, including context-free grammars. It provides features like syntax highlighting and error reporting, making it easier to develop and debug grammars.
pegjs:
PEG.js uses Parsing Expression Grammar (PEG) for defining grammars, which is intuitive and allows for clear expression of language rules. It provides a simple way to define recursive grammars and supports backtracking.
jison:
Jison uses a BNF-like syntax for defining grammars, which is straightforward and easy to understand. It is designed for simplicity, making it accessible for developers who may not have extensive experience with parsing.

Error Handling

nearley:
Nearley excels in error handling by allowing developers to define custom error messages and recovery strategies. It provides detailed feedback on parsing errors, which can be crucial for debugging complex grammars.
antlr4:
ANTLR4 provides robust error handling mechanisms, including error recovery strategies and detailed error messages, which help developers identify and fix issues in their grammars effectively.
pegjs:
PEG.js has a straightforward error reporting mechanism that provides feedback on parsing errors. However, it may require additional effort to implement advanced error recovery strategies.
jison:
Jison offers basic error handling capabilities, allowing developers to define custom error messages. However, it may not be as comprehensive as ANTLR4 in terms of recovery strategies.

Performance

nearley:
Nearley is designed to handle complex grammars and can perform well with ambiguous inputs. However, its performance may vary depending on the parsing strategy used.
antlr4:
ANTLR4 is optimized for performance and can handle large input sizes efficiently. Its generated parsers are designed for speed, making it suitable for performance-critical applications.
pegjs:
PEG.js generates parsers that are efficient for most use cases, but performance can degrade with very complex grammars due to backtracking.
jison:
Jison is generally performant for small to medium-sized grammars, but may face limitations with very large or complex grammars due to its simpler parsing approach.

Community and Ecosystem

nearley:
Nearley has a growing community and is gaining popularity for its flexibility. It has good documentation and examples, but may not have as many resources as ANTLR4.
antlr4:
ANTLR4 has a large and active community, with extensive documentation, tutorials, and examples available. It also supports multiple target languages, making it versatile for various projects.
pegjs:
PEG.js has a moderate community presence, with sufficient documentation and examples. It is well-supported, but the ecosystem is not as extensive as ANTLR4's.
jison:
Jison has a smaller community compared to ANTLR4, but it is still supported by a dedicated user base. Documentation is available, though it may not be as extensive as ANTLR4's.

Learning Curve

nearley:
Nearley has a moderate learning curve, especially for developers familiar with parsing concepts. Its flexibility can be both an advantage and a challenge for newcomers.
antlr4:
ANTLR4 has a steeper learning curve due to its comprehensive feature set and complex grammar definitions. However, once mastered, it offers powerful capabilities for parsing.
pegjs:
PEG.js is relatively easy to learn, especially for those familiar with JavaScript. Its grammar definitions are intuitive, making it accessible for developers of all skill levels.
jison:
Jison is easy to learn and suitable for beginners, making it a good choice for those new to parsing. Its straightforward syntax allows for quick implementation.

nearley ↗️

nearley is a simple, fast and powerful parsing toolkit. It consists of:

nearley is a streaming parser with support for catching errors gracefully and providing all parsings for ambiguous grammars. It is compatible with a variety of lexers (we recommend moo). It comes with tools for creating tests, railroad diagrams and fuzzers from your grammars, and has support for a variety of editors and platforms. It works in both node and the browser.

Unlike most other parser generators, nearley can handle any grammar you can define in BNF (and more!). In particular, while most existing JS parsers such as PEGjs and Jison choke on certain grammars (e.g. left recursive ones), nearley handles them easily and efficiently by using the Earley parsing algorithm.

nearley is used by a wide variety of projects:

nearley is an npm staff pick.

Contributing

Please read this document before working on nearley. If you are interested in contributing but unsure where to start, take a look at the issues labeled "up for grabs" on the issue tracker, or message a maintainer (@kach or @tjvr on Github).

nearley is MIT licensed.

A big thanks to Nathan Dinsmore for teaching me how to Earley, Aria Stewart for helping structure nearley into a mature module, and Robin Windels for bootstrapping the grammar. Additionally, Jacob Edelman wrote an experimental JavaScript parser with nearley and contributed ideas for EBNF support. Joshua T. Corbin refactored the compiler to be much, much prettier. Bojidar Marinov implemented postprocessors-in-other-languages. Shachar Itzhaky fixed a subtle bug with nullables.

Citing nearley

If you are citing nearley in academic work, please use the following BibTeX entry.

@misc{nearley, author = "Kartik Chandra and Tim Radvan", title = "{nearley}: a parsing toolkit for {JavaScript}", year = {2014}, doi = {10.5281/zenodo.3897993}, url = {https://github.com/kach/nearley} }