remark-parse vs remark-rehype vs remark-retext
Markdown Processing Libraries Comparison
1 Year
remark-parseremark-rehyperemark-retextSimilar Packages:
What's Markdown Processing Libraries?

These libraries are part of the unified ecosystem, which is designed to process and transform content written in Markdown. They provide a powerful and flexible way to parse, transform, and output Markdown content into different formats, such as HTML or plain text. Each library serves a specific purpose in the Markdown processing pipeline, allowing developers to build custom workflows for content transformation, making it easier to integrate Markdown into various applications and platforms.

Package Weekly Downloads Trend
Github Stars Ranking
Stat Detail
Package
Downloads
Stars
Size
Issues
Publish
License
remark-parse12,208,8668,19319.5 kB12 years agoMIT
remark-rehype6,758,63029548.3 kB02 months agoMIT
remark-retext121,9243020.9 kB02 months agoMIT
Feature Comparison: remark-parse vs remark-rehype vs remark-retext

Parsing Capability

  • remark-parse:

    remark-parse is specifically designed to convert Markdown text into an abstract syntax tree (AST). It supports the full Markdown syntax and provides a robust parsing mechanism that can handle edge cases, ensuring that the resulting AST accurately represents the original Markdown structure.

  • remark-rehype:

    remark-rehype does not parse Markdown itself; instead, it takes an existing Markdown AST (produced by remark-parse) and converts it into HTML. It relies on the AST structure to generate valid HTML, ensuring that the output is semantically correct and adheres to HTML standards.

  • remark-retext:

    remark-retext does not focus on parsing Markdown but rather on processing the text content. It allows developers to analyze and manipulate the text for various NLP tasks, making it a complementary tool in the Markdown processing workflow.

Transformation Flexibility

  • remark-parse:

    remark-parse is primarily a parsing tool and does not perform transformations on the Markdown content itself. However, it lays the groundwork for further transformations by providing a structured AST that can be manipulated by other tools in the unified ecosystem.

  • remark-rehype:

    remark-rehype excels in transforming the Markdown AST into HTML, allowing for extensive customization of the output. Developers can easily modify the AST before conversion to add custom attributes, classes, or even additional HTML elements, making it highly flexible for various use cases.

  • remark-retext:

    remark-retext provides flexibility in processing the text content of Markdown, enabling developers to apply various NLP transformations. This can include grammar corrections, text analysis, and other modifications, allowing for a wide range of applications in content management and analysis.

Integration with Unified Ecosystem

  • remark-parse:

    As part of the unified ecosystem, remark-parse seamlessly integrates with other libraries like remark-rehype and remark-retext, allowing for a smooth workflow from Markdown parsing to HTML rendering or text processing. This integration facilitates the creation of complex content processing pipelines.

  • remark-rehype:

    remark-rehype is designed to work closely with remark-parse, enabling a straightforward transition from Markdown AST to HTML. Its integration with the unified ecosystem allows developers to leverage other tools and plugins for enhanced HTML processing.

  • remark-retext:

    remark-retext integrates with the retext ecosystem, allowing for advanced text processing capabilities. Its compatibility with other libraries in the unified ecosystem makes it easy to incorporate NLP features into Markdown workflows.

Use Cases

  • remark-parse:

    remark-parse is ideal for applications that require Markdown content to be parsed into a structured format for further manipulation, such as static site generators, content management systems, or any tool that needs to process Markdown input.

  • remark-rehype:

    remark-rehype is best suited for scenarios where Markdown content needs to be rendered as HTML, such as web applications, blogs, or any platform that displays Markdown content to users in a web format.

  • remark-retext:

    remark-retext is perfect for applications focused on text analysis or content validation, such as writing assistants, grammar checkers, or any tool that aims to enhance the quality of Markdown text through NLP techniques.

Learning Curve

  • remark-parse:

    remark-parse has a moderate learning curve, especially for developers familiar with AST concepts. Understanding how to manipulate the AST for further processing may require some initial investment in learning.

  • remark-rehype:

    remark-rehype is relatively easy to use for those already familiar with HTML and Markdown. Its primary function is straightforward, but customizing the output may require a deeper understanding of the AST structure.

  • remark-retext:

    remark-retext may have a steeper learning curve for those unfamiliar with NLP concepts. Understanding how to effectively use the library for text processing tasks can take time, but it offers powerful capabilities once mastered.

How to Choose: remark-parse vs remark-rehype vs remark-retext
  • remark-parse:

    Choose remark-parse if your primary need is to parse Markdown content into an abstract syntax tree (AST). This package is essential for breaking down Markdown into a structured format that can be further manipulated or transformed by other tools in the unified ecosystem.

  • remark-rehype:

    Select remark-rehype if you need to convert Markdown AST into HTML. This package acts as a bridge between the Markdown parsing and HTML rendering stages, allowing for additional transformations and enhancements to the HTML output, such as adding classes or attributes to elements.

  • remark-retext:

    Opt for remark-retext if your focus is on processing the text content of Markdown for natural language processing (NLP) tasks. This package integrates with the retext ecosystem, enabling features like grammar checking, sentiment analysis, or other text-based transformations.

README for remark-parse

remark-parse

Build Coverage Downloads Size Sponsors Backers Chat

remark plugin to add support for parsing from markdown.

Contents

What is this?

This package is a unified (remark) plugin that defines how to take markdown as input and turn it into a syntax tree.

See the monorepo readme for info on what the remark ecosystem is.

When should I use this?

This plugin adds support to unified for parsing markdown. If you also need to serialize markdown, you can alternatively use remark, which combines unified, this plugin, and remark-stringify.

If you just want to turn markdown into HTML (with maybe a few extensions), we recommend micromark instead. If you don’t use plugins and want to access the syntax tree, you can directly use mdast-util-from-markdown. remark focusses on making it easier to transform content by abstracting these internals away.

You can combine this plugin with other plugins to add syntax extensions. Notable examples that deeply integrate with it are remark-gfm, remark-mdx, remark-frontmatter, remark-math, and remark-directive. You can also use any other remark plugin after remark-parse.

Install

This package is ESM only. In Node.js (version 16+), install with npm:

npm install remark-parse

In Deno with esm.sh:

import remarkParse from 'https://esm.sh/remark-parse@11'

In browsers with esm.sh:

<script type="module">
  import remarkParse from 'https://esm.sh/remark-parse@11?bundle'
</script>

Use

Say we have the following module example.js:

import rehypeStringify from 'rehype-stringify'
import remarkGfm from 'remark-gfm'
import remarkParse from 'remark-parse'
import remarkRehype from 'remark-rehype'
import {unified} from 'unified'

const doc = `
# Mercury

**Mercury** is the first planet from the [Sun](https://en.wikipedia.org/wiki/Sun)
and the smallest planet in the Solar System.
`

const file = await unified()
  .use(remarkParse)
  .use(remarkGfm)
  .use(remarkRehype)
  .use(rehypeStringify)
  .process(doc)

console.log(String(file))

…then running node example.js yields:

<h1>Mercury</h1>
<p><strong>Mercury</strong> is the first planet from the <a href="https://en.wikipedia.org/wiki/Sun">Sun</a>
and the smallest planet in the Solar System.</p>

API

This package exports no identifiers. The default export is remarkParse.

unified().use(remarkParse)

Add support for parsing from markdown.

Parameters

There are no parameters.

Returns

Nothing (undefined).

Examples

Example: support GFM and frontmatter

We support CommonMark by default. Non-standard markdown extensions can be enabled with plugins.

This example shows how to support GFM features (autolink literals, footnotes, strikethrough, tables, tasklists) and frontmatter (YAML):

import rehypeStringify from 'rehype-stringify'
import remarkFrontmatter from 'remark-frontmatter'
import remarkGfm from 'remark-gfm'
import remarkParse from 'remark-parse'
import remarkRehype from 'remark-rehype'
import {unified} from 'unified'

const doc = `---
layout: solar-system
---

# Hi ~~Mars~~Venus!
`

const file = await unified()
  .use(remarkParse)
  .use(remarkFrontmatter)
  .use(remarkGfm)
  .use(remarkRehype)
  .use(rehypeStringify)
  .process(doc)

console.log(String(file))

Yields:

<h1>Hi <del>Mars</del>Venus!</h1>

Example: turning markdown into a man page

Man pages (short for manual pages) are a way to document CLIs (example: type man git-log in your terminal). They use an old markup format called roff. There’s a remark plugin, remark-man, that can serialize as roff.

This example shows how to turn markdown into man pages by using unified with remark-parse and remark-man:

import remarkMan from 'remark-man'
import remarkParse from 'remark-parse'
import {unified} from 'unified'

const doc = `
# titan(7) -- largest moon of saturn

Titan is the largest moon…
`

const file = await unified().use(remarkParse).use(remarkMan).process(doc)

console.log(String(file))

Yields:

.TH "TITAN" "7" "September 2023" "" ""
.SH "NAME"
\fBtitan\fR - largest moon of saturn
.P
Titan is the largest moon…

Syntax

Markdown is parsed according to CommonMark. Other plugins can add support for syntax extensions. If you’re interested in extending markdown, more information is available in micromark’s readme.

Syntax tree

The syntax tree used in remark is mdast.

Types

This package is fully typed with TypeScript. It exports the additional type Options (which is currently empty).

Compatibility

Projects maintained by the unified collective are compatible with maintained versions of Node.js.

When we cut a new major release, we drop support for unmaintained versions of Node. This means we try to keep the current release line, remark-parse@^11, compatible with Node.js 16.

Security

As markdown can be turned into HTML and improper use of HTML can open you up to cross-site scripting (XSS) attacks, use of remark can be unsafe. When going to HTML, you will combine remark with rehype, in which case you should use rehype-sanitize.

Use of remark plugins could also open you up to other attacks. Carefully assess each plugin and the risks involved in using them.

For info on how to submit a report, see our security policy.

Contribute

See contributing.md in remarkjs/.github for ways to get started. See support.md for ways to get help. Join us in Discussions to chat with the community and contributors.

This project has a code of conduct. By interacting with this repository, organization, or community you agree to abide by its terms.

Sponsor

Support this effort and give back by sponsoring on OpenCollective!

Vercel

Motif

HashiCorp

GitBook

Gatsby

Netlify

Coinbase

ThemeIsle

Expo

Boost Note

Markdown Space

Holloway


You?

License

MIT © Titus Wormer