3 comments

  • eigenvalue 7 days ago ago

    For a real-world test of this, I just tried it with this document featured in this other submission today about OCR:

    https://news.ycombinator.com/item?id=42443022

    My message from that discussion:

    Out of curiosity, I tried submitting the first 200 pages of the PDF he used to my new tool that I also submitted today to Show HN, ( fixmydocuments.com ), and it generated the following without any further interaction besides submitting the PDF file:

    https://fixmydocuments.com/api/hosted/m-moires-de-saint-simo...

    I think it's not a bad result, and any minor imperfections could be revised easily in the markdown. My feature to turn the document into presentation slides got a bit confused because of the French language, so some slides ended up getting translated into English. But again, it wouldn't be hard to revise the slide contents using ChatGPT or Claude to make them all either French or English:

    https://fixmydocuments.com/api/hosted/m-moires-de-saint-simo...

  • roh26it 7 days ago ago

    Looks solid, going to try it out.

    I'm going to be __that guy__, but just ask - is the functionality set similar to llamaparse or is this llamaparse + llm?

    • eigenvalue 7 days ago ago

      This does a lot more… the optimization state is incredibly elaborate. And some of the derived document type generation is complex enough to be a standalone app in my opinion. It’s really a suite of tools for generating new types of documents from your original document without any additional input required from the user besides the document itself.