Tag: data-ingestion

Type: All Skills Tools
skill ★ 7,851

Manage and structure scientific research experiments

This skill establishes the foundational structure for a research experiment, handling directory setup, baseline file copying, and ingesting diverse research sources. It manages all necessary bookkeeping by initializing and updating central …

Upsonic/Upsonic experiment-management workflow setup data-ingestion
tool ★ 13,608

Build AI skills from diverse knowledge sources

This tool facilitates the conversion of diverse knowledge sources—including documentation, GitHub repositories, PDFs, and videos—into structured, AI-ready skills. It provides a comprehensive workflow for scraping, enhancing, packaging, and …

yusufkaraaslan/Skill_Seekers skill-building knowledge-extraction data-ingestion vector-database
skill ★ 23

Consolidate contacts from multiple data sources

This skill aggregates contact information from diverse sources, including email, calendars, vCards, and LinkedIn exports. It deduplicates records and stores a unified, persistent contact graph in memory.

markmhendrickson/neotoma contact-management data-consolidation deduplication data-ingestion
tool ★ 111

Comprehensive Ontology Engineering and Graph Management

This comprehensive tool suite facilitates the entire ontology lifecycle, allowing developers to generate, validate, and govern RDF/OWL knowledge graphs. It supports advanced data ingestion from structured sources (SQL, CSV, JSON) and provid…

fabio-rovai/open-ontologies ontology rdf owl knowledge-graph
tool ★ 20

Local Agent-BOM Inventory Validation and Ingestion

This tool validates and processes canonical agent-bom inventory JSON, which has been pre-collected from various sources like CMDBs or cloud endpoints. It allows developers to perform local scanning, generate findings, and export structured …

msaad00/agent-bom agent-bom inventory-management sbom validation
skill ★ 27,411

ClickHouse Schema, Query, and Data Best Practices Review

This skill provides comprehensive, rule-based guidance for optimizing ClickHouse database interactions. It enforces best practices across schema design, query writing, and data ingestion, ensuring all recommendations are cited against speci…

langfuse/langfuse clickhouse database-optimization schema-design query-tuning