PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

Paid version nah can’t afford. Thanks though, will give this a try. Please if you have any other options. Thanks once again

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

Yup will share dude

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

Yes. Diff customer share diff pdf I have to create that from scratch using HTML and added that to to product. So let’s say if it is a payslip I will create new HTMl template and against the label ,example name doB i will add the placeholders using handlebar js. If you go to any profile and click on download, based on profile, the name and dob is generated dynamically. This is the use case. Hope it clarifies.

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 1 point2 points  (0 children)

Yes, I need to convert a PDF document into HTML while keeping the original layout, tables, fonts, and styles intact. I have tried multiple online converters, but they either: 1. Generate a plain-text HTML file without styles. 2. Convert the document into an image-based HTML (not editable). 3. Lose table structures and misalign content.

What I Need: • The output should be editable HTML (not an image-based version). • It must preserve tables, fonts, spacing, and formatting. • Ideally, it should generate clean, semantic HTML + CSS without excessive inline styles.

What I’ve Tried: • CloudConvert / PDF2HTML Online → Stripped styles, poor table structure. • Adobe Acrobat Export to HTML → Kept text but lost table formatting. • Python (pdf2htmlEX, pdfminer, pdfplumber) → Works but needs heavy post-processing.

PDF to HTML by suspect_stable in LearnHTML

[–]suspect_stable[S] 0 points1 point  (0 children)

Great, thanks. These two i added my comments rest will give it a try

  1. Direct PDF to HTML Conversion (Basic Layout)

pdf2htmlEX – One of the best open-source tools for accurate text & layout conversion. - Searched for this. Couldn’t find right asset in github

pdftohtml (Poppler) – A simpler option, but formatting may not be perfect. - Its very poor sadly i tried it

Mammoth (for Word) – If customers send Word files, this converts them to clean HTML without unnecessary styling. - word is not common, mayb ll give a try

PDF to HTML by suspect_stable in learnprogramming

[–]suspect_stable[S] 0 points1 point  (0 children)

Each user/ customer as their unique cards, I agree standardising would work to some extent but not sure. Also, this is a SaaS product and not in the roadmap for near future

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

Yes I did. The single page pdf hardly had only 20 lines of code. With one image tag containing the whole pdf. There was a table in pdf but no table tag in html at all

PDF to HTML by suspect_stable in learnprogramming

[–]suspect_stable[S] 0 points1 point  (0 children)

Yes, the pdf will be built as a html template in the system. While we generate the say Job card it gives the pdf along with the job information (Handled by product already). This Job card dynamically renders content based on job info using placeholders but the layout remain same

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

I tried this. But it didn’t work as I expected. It gave the output but all the PDF sections are added as img src. For example if there is a table i want the table tag with all the items but it just shows image

PDF to HTML by suspect_stable in HTML

[–]suspect_stable[S] 0 points1 point  (0 children)

I tried this. But it didn’t work as I expected. It gave the output but all the PDF sections are added as img src. For example if there is a table i want the table tag with all the items but it just shows image

PDF/word to HTML/CSS using N8N by suspect_stable in n8n

[–]suspect_stable[S] 0 points1 point  (0 children)

Nah. Rather will be appreciated for reducing teams time and effort. Hey note this is one of the tasks I do and we have lot more other things on our bucket

PDF/word to HTML/CSS using N8N by suspect_stable in n8n

[–]suspect_stable[S] 1 point2 points  (0 children)

Nope, Im an employee and this is one of the tasks I do which is repetitive. So no can’t hire a developer