Tag: content-extraction
CLI for Summarizing URLs, Files, and Videos
This fast command-line interface allows developers to summarise content from various sources, including remote URLs, local files, and YouTube videos. It supports advanced extraction modes, model selection, and structured JSON output for rob…
Universal Content Extraction and Summarization Tool
This utility extracts text content from a wide array of sources, including URLs, PDFs, media files, and web pages. It supports advanced features like structured data extraction and LLM-powered summarisation.
Codebase Normalization and Content Extraction
This skill normalizes codebases by extracting hardcoded strings into a managed content repository and subsequently patching the source files with content references. This two-phase process ensures content is centralized, translatable, and m…
Advanced web search and content extraction tool
This suite provides structured tools for advanced web research, allowing developers to perform targeted searches with filters (e.g., topic, time range) or extract clean, chunked content from specific URLs. It supports both basic and advance…
Intlayer Automatic Content Extraction Compiler
Automatically extracts translatable content from components to eliminate the need for manual content file creation. It supports configuration for Vite and Next.js environments.
RivalSearchMCP: Comprehensive Deep Research Toolset
A multi-source research toolset providing deterministic access to web, social, news, and academic databases. It enables structured content extraction, website mapping, and persistent research workspaces without requiring API keys.
Web Scraping via MCP Server
Extracts clean Markdown content, links, and metadata from URLs using Mozilla Readability. It is optimised for server-rendered pages but does not execute JavaScript.
Markdown Content Search and Extraction
Search, navigate, and extract structured content from local markdown files using heading-based extraction and code block identification. It supports full-text search and YAML frontmatter parsing.
WebPeel: Clean Web Content Extraction
Extracts clean, structured markdown from any URL while significantly reducing token usage for LLM agents. It handles JavaScript rendering, bot protection, and provides specialised extractors for platforms like YouTube and GitHub.
web screenshot and content extraction tool
This tool provides programmatic access to web rendering services, enabling agents to capture pixel-perfect screenshots or extract structured content (Markdown, HTML, plain text) from any URL. It supports advanced features like device emulat…