Tag: content-extraction

Type: All Skills Tools
tool

CLI for Summarizing URLs, Files, and Videos

This fast command-line interface allows developers to summarise content from various sources, including remote URLs, local files, and YouTube videos. It supports advanced extraction modes, model selection, and structured JSON output for rob…

casibase/casibase cli summarization urls files
tool ★ 149

Universal Content Extraction and Summarization Tool

This utility extracts text content from a wide array of sources, including URLs, PDFs, media files, and web pages. It supports advanced features like structured data extraction and LLM-powered summarisation.

lfnovo/content-core content-extraction web-scraping document-parsing media-transcription
skill ★ 3

Codebase Normalization and Content Extraction

This skill normalizes codebases by extracting hardcoded strings into a managed content repository and subsequently patching the source files with content references. This two-phase process ensures content is centralized, translatable, and m…

Contentrain/ai content-extraction code-normalization i18n content-management
tool ★ 372,633

Advanced web search and content extraction tool

This suite provides structured tools for advanced web research, allowing developers to perform targeted searches with filters (e.g., topic, time range) or extract clean, chunked content from specific URLs. It supports both basic and advance…

openclaw/openclaw web-search content-extraction research data-gathering
skill ★ 732

Intlayer Automatic Content Extraction Compiler

Automatically extracts translatable content from components to eliminate the need for manual content file creation. It supports configuration for Vite and Next.js environments.

aymericzip/intlayer intlayer content-extraction react nextjs
tool ★ 90

RivalSearchMCP: Comprehensive Deep Research Toolset

A multi-source research toolset providing deterministic access to web, social, news, and academic databases. It enables structured content extraction, website mapping, and persistent research workspaces without requiring API keys.

damionrashford/RivalSearchMCP mcp web-search deep-research content-extraction
tool ★ 3

Web Scraping via MCP Server

Extracts clean Markdown content, links, and metadata from URLs using Mozilla Readability. It is optimised for server-rendered pages but does not execute JavaScript.

ofershap/mcp-server-scraper mcp web-scraping content-extraction metadata-extraction
tool ★ 2

Markdown Content Search and Extraction

Search, navigate, and extract structured content from local markdown files using heading-based extraction and code block identification. It supports full-text search and YAML frontmatter parsing.

ofershap/mcp-server-markdown mcp markdown content-extraction documentation-search
tool ★ 8

WebPeel: Clean Web Content Extraction

Extracts clean, structured markdown from any URL while significantly reducing token usage for LLM agents. It handles JavaScript rendering, bot protection, and provides specialised extractors for platforms like YouTube and GitHub.

webpeel/webpeel web-scraping mcp llm content-extraction
tool ★ 1

web screenshot and content extraction tool

This tool provides programmatic access to web rendering services, enabling agents to capture pixel-perfect screenshots or extract structured content (Markdown, HTML, plain text) from any URL. It supports advanced features like device emulat…

User0856/snaprender-integrations web-scraping screenshot content-extraction api-tool