Tag: web-scraping
Zero-API-Key Web Search and Verification Tool
This utility provides source-backed web search, advanced page browsing, and evidence-aware claim verification without requiring API keys. It supports multi-engine SERP results and includes a Web Unlocker to access geo-blocked or rate-limite…
Source-backed web search and evidence checking tool
This tool provides robust, source-backed web search and evidence verification without requiring API keys. It supports multi-engine SERP retrieval, deep page browsing with unlocker capabilities, and structured claim reporting for factual gro…
OpenCLI: Turn Websites into Command Line Tools
OpenCLI transforms any website or Electron application into a functional command-line interface. It provides AI-powered discovery and interaction, allowing developers to automate complex web tasks using familiar CLI syntax while reusing exi…
Advanced B2B Account Qualification and Research
This agent qualifies B2B accounts by performing deep research across multiple vectors, including Apollo data, website scraping, job postings, and ad intelligence. It first runs a mandatory industry gate check before compiling structured dat…
Systematic Comparative Project Analysis Across Repositories
This skill systematically assesses a target project or concept by comparing it against the foundational context of all repositories loaded from the truth layer. It generates structured reports covering competitive positioning, potential par…
Open WebSearch: Advanced Live Web Retrieval Skill
This skill provides comprehensive, multi-source web retrieval, managing complex setup via local CLI/daemon or workspace MCP tools. It intelligently prioritises direct URL fetching, focused searches, and GitHub READMEs while adhering to stri…
Comparative project and market analysis skill
This skill performs systematic competitive and partnership analysis by comparing a target project or content resource against the foundational context of all internal repositories. It utilizes web scraping and structured frameworks to gener…
CLI for Advanced Browser Automation and Testing
This CLI enables agents to programmatically interact with websites using the Chrome DevTools Protocol (CDP). It supports full web automation, including element snapshotting, complex authentication flows, and robust command chaining for reli…
Headlessly analyze Skool communities for revenue signals
This skill allows for the headless reading and analysis of Skool communities, enabling the discovery of customer pain points and potential acquisition leads. It uses direct HTTP reads via the MCP connector, ensuring the user's active browse…
Fast Library for Parsing HTML and XML
This library provides a fast and elegant solution for parsing and manipulating complex HTML and XML structures. It allows developers to easily traverse and modify the Document Object Model (DOM) within a Node.js environment.
Summarise content from URLs, videos, and files
This CLI utility provides comprehensive summarisation and transcription capabilities across multiple formats, including web URLs, local documents, and YouTube videos. It supports various output lengths and includes flags for raw extraction …
Securely fetch and clean web content for LLMs
This utility fetches URLs, providing clean, markdown-formatted content alongside structured metadata and external links. It includes advanced injection safety scanning and handles common web obstacles like paywalls and bot blocks.
Neural Web Search and Research Tool
A neural web search tool providing semantic discovery, cited answer generation, and web content extraction. It supports domain-filtered searches, similarity discovery, and fetching full text from URLs for deep research.
Web Scraping via MCP Server
Extracts clean Markdown content, links, and metadata from URLs using Mozilla Readability. It is optimised for server-rendered pages but does not execute JavaScript.
GPT Researcher Autonomous Deep Research Agent
An autonomous agent that performs deep web and local research to generate detailed, cited reports using a planner-executor-publisher architecture. It supports custom retrievers, MCP data sources, and parallelised agent execution.
WebPeel: Clean Web Content Extraction
Extracts clean, structured markdown from any URL while significantly reducing token usage for LLM agents. It handles JavaScript rendering, bot protection, and provides specialised extractors for platforms like YouTube and GitHub.
WebPeel Web Fetching and Search Tool
A web fetching utility that converts web pages into clean, token-efficient markdown for AI agents. It features smart escalation from HTTP requests to headless browser rendering and includes DuckDuckGo search capabilities.
Playwright Browser Automation and Screenshotting
Provides capabilities for automating a real browser instance to capture screenshots and interact with web content from the terminal.
AI-Powered Browser Automation CLI
An agentic CLI tool for automating complex web workflows, including form filling, data extraction, and navigating dynamic content. It supports both deterministic Playwright actions and autonomous AI-driven exploration.
AI-powered browser automation and extraction
An AI-driven engine that uses LLMs and computer vision to automate web workflows, extract structured data, and manage browser sessions. It integrates via Python and TypeScript SDKs, a REST API, and an MCP server.
web screenshot and content extraction tool
This tool provides programmatic access to web rendering services, enabling agents to capture pixel-perfect screenshots or extract structured content (Markdown, HTML, plain text) from any URL. It supports advanced features like device emulat…