Show HN: DeepTable – an API that converts messy Excel files into structured data
Solves the 'harder, more general problem' of understanding the semantic structure of real-world spreadsheets, where LLMs fail on complex workbooks at scale.
View Origin LinkProduct Positioning & Context
AI Executive Synthesis
Solves the 'harder, more general problem' of understanding the semantic structure of real-world spreadsheets, where LLMs fail on complex workbooks at scale.
DeepTable addresses a pervasive and costly data ingestion problem for enterprises: transforming complex, unstructured Excel data into usable, structured formats. The explicit mention of LLMs failing on 'complex workbooks at scale' highlights a significant gap this solution aims to fill. The ability to convert messy spreadsheets into 'SQL-ready relational tables with full cell-level provenance' is a critical value proposition for data engineering, analytics, and compliance. This directly impacts data quality, automation, and operational efficiency, reducing manual data cleaning efforts. Market implications are substantial for any organization relying on Excel for data exchange or storage, particularly in finance, supply chain, or operations. This product targets a fundamental data integration challenge, offering a robust, scalable solution where existing methods fall short. The API model facilitates seamless integration into existing data pipelines.
We tried to build an Excel error checker. To achieve that, we needed to actually understand the semantic structure of a spreadsheet first. So we built that, and it turned out to be the harder, more general problem.The core issue: most real-world spreadsheets aren't relational tables. Merged cells, multi-level headers, multiple tables per sheet, totals mixed in with data. You can't just dump them to CSV and call it done. LLMs handle the easy cases but fall apart on complex workbooks at scale.Our approach uses an agent-guided compilation pipeline that produces SQL-ready relational tables with full cell-level provenance. This demo visualizes what we do: https://storage.googleapis.com/deeptable-public/deeptable_an...We have a handful of early customers but honestly don't know yet whether this is a real market or a niche problem. We're posting this to hear from people who've dealt with arbitrary spreadsheet ingestion. Whether you solved it, gave up, or are still living with the pain.If you want to try it on your own files, email me (see my profile for my email) and I'll give you API access.
Community Voice & Feedback
No active discussions extracted yet.
Related Early-Stage Discoveries
Discovery Source
Hacker News Aggregated via automated community intelligence tracking.
Tech Stack Dependencies
No direct open-source NPM package mentions detected in the product documentation.
Media Tractions & Mentions
No mainstream media stories specifically mentioning this product name have been intercepted yet.
Deep Research & Science
No direct peer-reviewed scientific literature matched with this product's architecture.
Market Trends