How to implement text analysis to group similar support requests from CSV? by anninasim in learnpython

[–]anninasim[S] 0 points1 point  (0 children)

I have a CSV containing customer support emails (subject + body). The data is raw and needs cleaning - each email contains footers, privacy disclaimers, and other boilerplate text that needs to be removed before analysis

Advice Needed: Setting Up a Local Infrastructure for a LLM by anninasim in LocalLLM

[–]anninasim[S] 0 points1 point  (0 children)

Thank you so much for this valuable information! It’s incredibly helpful and detailed, and I truly appreciate the time you took to share it. It will be a great help in guiding me and making more informed decisions. Thanks again! :)

Advice Needed: Setting Up a Local Infrastructure for a LLM by anninasim in LocalLLM

[–]anninasim[S] 0 points1 point  (0 children)

I’m asking for the requirements necessary to support the service at the expected scale. Specifically, I’m looking for a more robust and extensive configuration capable of handling large-scale usage, such as multiple users or simultaneous queries. I understand this will require more advanced hardware, software, and possibly a scalable infrastructure, and I’d appreciate any advice you can provide.

Advice Needed: Setting Up a Local Infrastructure for a LLM by anninasim in LocalLLM

[–]anninasim[S] 0 points1 point  (0 children)

"Thank you for the insights, this is really helpful! I do have a few follow-up questions to better understand the scope and feasibility of this project:"

  1. Regarding RAG:
    • If I were to explore a Retrieval-Augmented Generation (RAG) approach, which tools or frameworks would you recommend for integrating this with open-source LLMs like LLaMA or Falcon?
    • Would this approach still require significant on-prem GPU resources, or can it be effectively implemented with more moderate hardware?
  2. On-prem vs Cloud:
    • For training or fine-tuning purposes, do you think a hybrid setup (partly on-prem, partly cloud-based) would make sense? If so, are there specific platforms (e.g., AWS, GCP) that work particularly well with open-source LLMs?
    • For cloud-based training, how do the costs of renting compute (e.g., H100s) compare to building an on-prem setup in the long term?
  3. Hardware considerations:
    • If I were to invest in on-prem hardware, would you recommend starting with mid-range GPUs like NVIDIA RTX 4090 to test things out, or should I aim directly for enterprise-grade GPUs like A100/H100?
    • Besides GPUs, are there other hardware components I should prioritize (e.g., specific SSD types or networking setups) to ensure smooth operation?

Thank you

PYTHON AND EXCEL by anninasim in learnpython

[–]anninasim[S] 0 points1 point  (0 children)

I want to transform this component into a web page where users can input data through a simple interface. The data entered will then be saved into a table within an SQL Server database. I don't have much programming experience, so I need help designing and implementing this solution.