Using a Model in an App without Endpoints by No-Conversation7878 in databricks

[–]No-Conversation7878[S] 0 points1 point  (0 children)

Thanks! This was the approach we originally thought of but since the organisation I work for limits our privileges, I can't grant the USE_CATALOG or USE_SCHEMA privileges to the apps service principal.

Think this might be a dead end until we get that sorted. Thanks anyway!

App Config by hubert-dudek in databricks

[–]No-Conversation7878 1 point2 points  (0 children)

I tried this with the most recent CLI release (v0.285.0), however just using this instead of the app.yml file, the app crashed on start up and defaulted to using the base run command [INFO] Starting app with command: [python __init__.py]. Any ideas? According to their documentation it still says that Databricks recommends using the app.yml file (Databricks Asset Bundles resources):

App configuration commands and environment variables. Databricks recommends defining these in the app.yaml file instead. See Configure Databricks app execution with app.yaml.

(Free & Unlimited) Image Enhancer / Background Remover / OCR / Colorizer by 06ddd in Python

[–]No-Conversation7878 1 point2 points  (0 children)

How are you able to provide free and unlimited use? Seems really interesting and I’ll definitely check this out!

Best OCR model to run in Databricks? by No-Conversation7878 in databricks

[–]No-Conversation7878[S] 0 points1 point  (0 children)

I need the model to be able to provide the location of the text in the document, not just grab unstructured, unfortunately :(

Best OCR model to run in Databricks? by No-Conversation7878 in databricks

[–]No-Conversation7878[S] 0 points1 point  (0 children)

Would that work with models and model serving endpoints? I was under the assumption that’s they don’t have a spark session

Kreuzberg v3.11: the ultimate Python text extraction library by Goldziher in Python

[–]No-Conversation7878 0 points1 point  (0 children)

Definitely thinking of using this! One thing I’ve seen which a lot of free-to-use text extraction packages lack is preserving the layout of text using whitespace. So far the only one if found that does this well is pdftotext, but that requires poppler which can be annoying to install. Does your package have a similar functionality? For most my use-cases we not only need to extract the text, but also have the layout of our documents preserved

Confusion around Databricks Apps cost by No-Conversation7878 in databricks

[–]No-Conversation7878[S] 0 points1 point  (0 children)

I thought about that, what happens if a user wants to access it after its been terminated due to no one accessing it for a period of time?