Midship (YC S24) just launched their document extraction platform that turns any unstructured document into clean, standardized business data— and exports to Excel, APIs, ERPs, and business systems. The problem: Professionals today waste 50% of their time on manual data entry. Here's how Midship solves it: • Extracts data from any document type • View all your extractions in Midship's structured data interface • Built-in audit interface to verify every data point • Export data where you need - your Excel templates, API integrations, etc. While others simply pull raw data from documents, Midship transforms extracted data into the format you use for work. Whether column names don't match, source documents show monthly values but you need annual amounts, or layouts are inconsistent - you'll get clean, standardized data. Try out their document extraction playground with your own documents at https://lnkd.in/gxapzNcn. Congrats Aahel Iyer, Kieran Taylor, and Max Maio on the launch! 🚀 https://lnkd.in/gc4vKvZu
Ivo Mbi Kubam this is a tool to keep in mind. Especially we we set up BI systems for local businesses in camer
I see nothing around ensuring data accuracy??? Teams dont have people or time to QA data. Extracted data is 100% useless if its not 100% accurate. Then how do you action the new data? In ERPs designed 20 years ago that are not built for 1000’s of data points from 1000’s of documents. Requires a purpose built front end. Nonetheless I applaud the Midship team for getting in the document extraction game as these are rough waters!
Congratulations on the launch, Midship team! It’s always great to see new energy in the document processing space. The focus on transforming unstructured data into standardized business-ready formats has been a core offering for many IDP vendors, including our solution at RaccoonDoc, mainly through advanced post-processing capabilities. While Midship’s features align with the market's current standards, I’m excited to see how you continue to innovate and address the complex and evolving needs of businesses in this domain. Welcome to the IDP ecosystem!
Really impressed by Midship's approach to solving the document extraction challenge! The focus on delivering standardized, business-ready data rather than just raw extraction is brilliant. As someone exploring AI applications, I'm particularly intrigued by how you handle data transformation and normalization. @Midship team, curious about your handling of edge cases - how does your system manage documents with non-standard formatting or when dealing with domain-specific jargon? The built-in audit interface suggests a thoughtful 'human-in-the-loop' approach to maintaining accuracy. The ability to map directly to existing Excel templates is a game-changer for adoption. Would love to learn more about the ML architecture behind your transformation pipeline! 📊🔄 #AI #DataAutomation
Max Maio Looks great, but I would like to see some more complex cases showcased in the demo... What about documents with multiple tables ? What happens when rows of tables are not one to one mapped ? When there are several tables in parallel ? Or When the document itself is in another language ? Is the solution able to handle those cases ? Real world documents have been difficult to digitise due to these reasons in the past...
I released a similar site 2 years ago 🙂 https://walheda.com I wish I had the chance to get investors, but I am not located at the right place it seems :)
Interesting, Mathis Gineste have a look ! Would it be sold also as a tech brick ?
Congrats Midship !!!!!
President @ Transcend Consulting Inc. | Where AI Innovation Meets Supply Chain Opportunity
4wCould this unstructured data then be dropped into something like Claude to create graphs, charts and reports? So awesome! 🚀