-
Content automation
20 Feb 2024Readmore -
Data integration
Airbyte is an open-source data integration platform and ELT tool designed to seamlessly integrate, transform, and load data. It offers reliable database and API replication at any scale, AI/LLM ready data, and the ability to embed connectors easily. Airbyte provides solutions for AI & LLMs, database replication, analytics, and embedding connectors. It supports various deployment models, including self-hosted, cloud, and hybrid, ensuring data security and governance.12 Apr 2025Readmore -
Web scraping
Reworkd is a platform that uses LLMs to extract web data at scale. It automatically generates and repairs Playwright scraping code for thousands of websites. Users provide feedback on issues, and Reworkd's AI instantly fixes them, eliminating the need to maintain scrapers manually. Reworkd automates the entire web data pipeline, from scanning websites to outputting data.20 Mar 2025Readmore -
AI
Ask On Data is an AI-powered, open source chat based ETL tool for data engineering. Via chat, it enables tasks like data migration, cleaning, and analysis, making it accessible for data scientists and engineers to enhance efficiency effortlessly. It is a Natural Language Processing (NLP) based GenAI powered data engineering tool with agentic capabilities, allowing users to harness the power of data without coding skills.07 Jan 2025Readmore