Creating EPUB from Scanned PDF with MinerU and LLMs

As a book reader, I read over a hundred books each year and collect much more. The preferred format is absolutely EPUB, however, I can’t always get books in EPUB/MOBI especially for rare or old books. Usually, they are available in PDF if at all. Some of these PDFs are manually scanned in a barely readable condition. I wouldn’t blame on them since I’ve been doing that before and know that is not easy. What I need is a tool to convert the not so readable book into a readable one with OCR and LLM, that is MinerU. ...

September 16, 2025 · 4 min · Jun

My AI Learning Materials and News Feeds (Updated)

Forewords This is the 3rd time updating the lists, where I added a section of Prompt Engineering upon requests since it is a good starting point for those not into self-hosting nor theory. The term “Prompt Engineering” is a bit controversial but people will get better understanding after learning the listed materials. As a lifelong self-learner, I use all sorts of methods to learn new things, and AI is what I’m currently into. Although I’ve been using AI tools since 2022, my background wasn’t focused on AI. So, like everyone else, I had to do some “AI For Dummies” level study in order to get more involved. Below is a list of learning materials that I find very helpful for myself to get started with and might also be helpful for someone else in the same situation. ...

July 8, 2025 · 6 min · Jun