Content Filtering

next-markdown-mirror aggressively filters non-content elements to give AI agents clean, focused Markdown. Here's how the filtering works and how to customize it.

Built-in filtering

By default, these elements are stripped before conversion:

Tags removed

nav, footer, header, form, script, style, noscript, iframe, svg, canvas, template

Selectors removed

Selector	Reason
`[data-md-skip]`	Explicit opt-out attribute
`[aria-hidden="true"]`	Hidden from assistive tech
`.sr-only`	Screen-reader only content
`.visually-hidden`	Visually hidden content
`button`	Interactive element
`input`	Form element
`select`	Form element
`textarea`	Form element

Images filtered

Icons and small images (smaller than 50x50px based on width/height attributes) are removed to reduce noise.

The `data-md-skip` attribute

The simplest way to exclude a specific element from Markdown output is to add data-md-skip to it:

<div class="sidebar" data-md-skip>
  <!-- This entire div will be excluded from Markdown -->
</div>

This works regardless of any other configuration. It's useful for excluding elements that are part of the main content area but shouldn't appear in the Markdown output (e.g., share buttons, related posts widgets).

contentSelectors

The contentSelectors option controls which part of the page is treated as the main content. Selectors are tried in order — the first match is used.

Default selectors:

['main', 'article', '[role="main"]', '.content', '.post', '#content', '#main']

If none match, the full <body> is used.

Example: target a specific container

export const GET = createMarkdownHandler({
  baseUrl: process.env.NEXT_PUBLIC_SITE_URL!,
  contentSelectors: ['.docs-content', '.blog-post', 'article'],
});

The converter will look for .docs-content first, then .blog-post, then article. This is useful when your site has different layouts for different page types.

excludeSelectors

Add additional CSS selectors for elements to remove. These are applied on top of the built-in exclusions.

export const GET = createMarkdownHandler({
  baseUrl: process.env.NEXT_PUBLIC_SITE_URL!,
  excludeSelectors: [
    '.sidebar',
    '.table-of-contents',
    '.breadcrumbs',
    '.comments-section',
    '.newsletter-signup',
  ],
});

includeSelectors

Force-include elements that would otherwise be excluded by the built-in filters. Include selectors take priority over exclude selectors.

export const GET = createMarkdownHandler({
  baseUrl: process.env.NEXT_PUBLIC_SITE_URL!,
  excludeSelectors: ['.sidebar'],
  includeSelectors: ['.sidebar .code-example'],
});

In this example, the sidebar is excluded, but any .code-example inside the sidebar is preserved.

Combining selectors

The filtering pipeline runs in this order:

contentSelectors — find the main content area
Built-in exclusions — remove known non-content tags and selectors
excludeSelectors — remove additional elements you specified
includeSelectors — re-include elements that were excluded but should be kept

export const GET = createMarkdownHandler({
  baseUrl: process.env.NEXT_PUBLIC_SITE_URL!,
  // Only look at the article content
  contentSelectors: ['article.blog-post'],
  // Remove extras within the article
  excludeSelectors: [
    '.author-bio',
    '.share-buttons',
    '.related-posts',
  ],
});

Example: documentation site

export const GET = createMarkdownHandler({
  baseUrl: process.env.NEXT_PUBLIC_SITE_URL!,
  contentSelectors: ['.docs-content', 'main'],
  excludeSelectors: [
    '.toc',
    '.edit-on-github',
    '.prev-next-nav',
    '.feedback-widget',
  ],
  // Keep code blocks even if they're inside excluded areas
  includeSelectors: ['pre', '.code-group'],
});

Using filterContent directly

For standalone usage outside of Next.js, you can call filterContent directly:

import { filterContent } from 'next-markdown-mirror';

const cleaned = filterContent(html, {
  exclude: ['.sidebar', '.ads'],
  include: ['pre'],
});

See Standalone Usage for more examples.

Built-in filtering​

Tags removed​

Selectors removed​

Images filtered​

The data-md-skip attribute​

contentSelectors​

Example: target a specific container​

excludeSelectors​

includeSelectors​

Combining selectors​

Example: blog with sidebar​

Example: documentation site​

Using filterContent directly​