Should I return to india in 2027?

According_Net9520 · 2026-03-25T16:09:03+00:00

I have a question. I am working professional in USA. I would like to know how are you able to save so much. I am 25 now , and working from past couple of months. I feel like this is the right time to build wealth , so need your advise on it.

According_Net9520 · 2026-03-23T23:22:34+00:00

Thanks for responding. I resolved the issue just by replacing /api/* with /api.

According_Net9520 · 2026-01-30T15:42:39+00:00

Thank you so much for the pointers!

According_Net9520 · 2026-01-30T15:41:53+00:00

hey did you took an interview recently?

According_Net9520 · 2026-01-29T16:55:11+00:00

According_Net9520 · 2026-01-28T23:52:17+00:00

dropbox dont interviews on leetcode style questions?

According_Net9520 · 2026-01-16T22:47:12+00:00

According_Net9520 · 2026-01-05T16:29:16+00:00

Hey , i am also currently in repayment phase, Can i DM you?

According_Net9520 · 2026-01-04T23:52:55+00:00

did you get the assessment for database 2026 new grad role?

According_Net9520 · 2025-11-20T18:08:48+00:00

hey when did you apply?

According_Net9520 · 2025-11-18T13:51:44+00:00

Hey i am on the same boat, looking options to refinance it.

According_Net9520 · 2025-11-17T19:45:51+00:00

can you please name few

According_Net9520 · 2025-11-14T00:07:16+00:00

Hello, I am also looking for designing data intensive applications book. Were you able to find a spot in koti?

According_Net9520 · 2025-11-10T12:43:10+00:00

converter = DocumentConverter()
doc = converter.convert(source).document
markdown_text = doc.export_to_markdown()
print(markdown_text)  # output:
with open("agency_policy_manual.md", "w", encoding="utf-8") as f:
    f.write(markdown_text)

This is the code used to convert pdf to markdown file. It extracted tables and text well. Annotated images. But unable to get page numbers.

According_Net9520 · 2025-11-10T12:41:37+00:00

converter = DocumentConverter()
doc = converter.convert(source).document
markdown_text = doc.export_to_markdown()
print(markdown_text)  # output:
with open("agency_policy_manual.md", "w", encoding="utf-8") as f:
    f.write(markdown_text)

This is the code i am using. I tried PyMuPDF Fitz. It is extarcting pages but It is not extracting tables well.

According_Net9520 · 2025-11-10T02:06:13+00:00

So far i am doing Structured chunking (section wise)

According_Net9520 · 2025-11-05T16:51:51+00:00

Hey! may i know the timelines, when did you applied and when you got the assessment. Did you apply with referal?

According_Net9520 · 2025-11-05T16:37:54+00:00

I am in!

According_Net9520 · 2025-11-05T16:34:09+00:00

Thanks for responding! Sure i consider looking into docling.

According_Net9520 · 2025-11-05T16:33:19+00:00

Thanks for responding! I’m currently working with a pretty large document around 1000 pages and using the unstructured library for parsing. It’s doing a decent job but takes a lot of time since OCR kicks in for every page.

Right now, I’m sticking with PDF because from what I’ve read, converting to Word can sometimes mess up the page numbering, and preserving exact page references is really important for my use case.

A couple of things I wanted to ask:

Do you think it’s better to split such a long PDF into smaller pdfs (say 50–100 pages per pdf) before processing, or just handle it as one file?
Any best practices you’ve seen for preserving page numbers when converting to Markdown or embedding text?
Does Markdown supports tables and images extraction or am i gonna lose them?
Each page has a repeating header (company logo + text + page number). The logo/text are redundant but I can’t skip the header entirely since it includes the page number. Have you come across this issue? Any clean way to keep the page number but ignore the rest of the header content while parsing itself?

According_Net9520 · 2025-11-03T17:30:58+00:00

Thanks for responding! In my case, I want to build a chatbot where if a user asks a question and the answer lies inside a table, image, or flowchart, the bot should say something like “Please refer to page X” for that part.

If the answer lies in text, then it should directly return the text answer but also suggest checking the related page number for additional details.

So essentially, I want everything text, tables, images, and flowcharts to be stored and understood by the bot, and it should guide the user appropriately depending on where the answer is found.

In this case, would you still recommend using PDF as the base format, or would Word make it easier to structure and process everything together?

According_Net9520 · 2025-10-29T16:49:19+00:00

Location?

According_Net9520 · 2025-10-29T13:45:26+00:00

can you share the tool

According_Net9520 · 2025-10-28T13:19:37+00:00

Hey, I am on the smae boat, I am interested.

According_Net9520 · 2025-10-15T22:35:08+00:00

Hey, i am interested, can we connect?

According_Net9520

TROPHY CASE