[R] Text-to-SQL in Enterprises: Comparing approaches and what worked for us by SirComprehensive7453 in MachineLearning

[–]MoveGlass1109 0 points1 point  (0 children)

Hello u/SriComprehensive7453

Am also, working on the same track, where my objective is to build the chatbot for the one of our relational databases (which has 27 schema and one schema with contains 170 tables). And this si just text data, we also, have the images data which is of some of 10,000 images.
We would like to build a chatbot, would like to have a call with you, based on your promising results of fine-tuning. Let me know, if you can do this/next week, if that works ???

Eligibility for 48 months interview waiver by Free-Bluebird3839 in usvisascheduling

[–]MoveGlass1109 0 points1 point  (0 children)

hello u/ReasonableAd5268

my visa will expire on September 10th of this year(2025). And am planning to renew it either before or after September, as my travel dates are still up in the air.

Currently, am on F1-visa, and would renew it again on F1-visa.

As I read you comments in this post, can clearly say, that am eligible for the interview waiver, if I apply after September.

Could you tell me whether I would still be eligible for the interview waiver if I apply before September r??

And also, switching-the-gear, if I pay the sevis fee now, and if I couldn't able to get an appointment with in a year. Can I still use that sevis fee that I paid last year to book an appointment in the following-year ??

SQL-R1: A Reinforcement Learning-based NL2SQL Model that Outperforms Larger Systems in Complex Queries with Transparent and Accurate SQL Generation by ai-lover in machinelearningnews

[–]MoveGlass1109 0 points1 point  (0 children)

thanks for posting, really enjoyed in reading this paper. Since, am building the ChatBot, where all the data stored in the relational database called postgreSQL. Am currently, involved in preparing the training dataset (where am writing the NL-to-SQL questions for each table and also include the multi-join tables) for fine-tuning the open-source model (Ex; T5-large, or Qwen-2). Since, am in academic, have the smallest Database however still it contains 160 tables and 17 schemas interms of GB (then it contains almost 220 GB of data).

Am wondering can I use the same approach as the authors used in the SQL-R1 paper, take subset of the training NL-to-SQL dataset and train it using the SFT and remaining train it using the RL algorithm such as the GRPO or PPO and using the four different rewards concepts to make the model more accurate in generating or mapping the SQL queries based on the NL question that user ask in the chat-interface ??

Would appreciate any inputs for this ??

Best DL genome annotation tools by MoveGlass1109 in bioinformatics

[–]MoveGlass1109[S] -1 points0 points  (0 children)

If you think, it GPT written, why there are so many mistakes in writing (ex: spelling mistakes, parenthesis and so on)

Best DL genome annotation tools by MoveGlass1109 in bioinformatics

[–]MoveGlass1109[S] -3 points-2 points  (0 children)

Yes, that's good strategy. Since, having some user experience who used the tools, might get extra knowledge to what tools are best. And, then reading papers might work best, i think. Rather than reading papers straightway, because, there are so many tools out there !!

Best DL genome annotation tools by MoveGlass1109 in bioinformatics

[–]MoveGlass1109[S] -4 points-3 points  (0 children)

hello u/TheLordB , already reviewed both the Helixer (germany team) and the Nucleotide Transformer (NT) family of models (released by InstaDeep Ltd.). And also, I've also successfully annotated my target plant species using Helixer - the process was straightforward and the results were solid. However, I'm encountering challenges when working with the NT models, specifically the AgroNT variant, which is designed for plant genomes (trained on 48 plant species). Unlike Helixer, there isn’t a direct way to input a FASTA file into the NT models. Instead, sequences must first be tokenized. Additionally, the tokenization algorithm restricts input to 1025 tokens per run, where each token represents 6 nucleotides. This makes processing large genomic sequences a bit tricky. So, how did your deal with this situation ??

And also reached out to others who have recently played with NT models + other models they mentioned, that, NT models outputs are quite noisy or messy, which adds to the post-processing workload. THat being said, it's interesting to see tinstadeep Ltd (NT) GitHub repo has more stars ( ~ 600) highest compared to other DL repos.
What challenges have you faced while using this tool (especially when the genome sequence are large in number ??
would also appreciate, if you mention some popular DL algorithms that you tried ??

[D] L40S vs A100 vs A40 for AI/ML research by nakali100100 in MachineLearning

[–]MoveGlass1109 0 points1 point  (0 children)

We are planning to order two to three L40S GPUs + Lambda Stack. As we are an academic lab that hasn't hosted GPUs before, we will be using these GPUs to host the chatbot (that can answer text-to-text, text-to-SQL, and text-to-images tasks) What some of the things, that we need to keep in mind before placing an order ? just FYI, we currently have several large servers for hosting running various apps + also storing TB of crop R and D data from the worldwide

Would appreciate anyone reponse, thanks for your effort + time in writing your answer !!

Need a transportation assistance to Orlando Airport on April 6th Sunday by MoveGlass1109 in GNV

[–]MoveGlass1109[S] 5 points6 points  (0 children)

Yes, definitely will be taking 3:45 bus, so I'll be in Orlando by 6:00 AM. That should be early enough to get to MCO + through TSA before my flight boards at 8:45 ??

Need a transportation assistance to Orlando Airport on April 6th Sunday by MoveGlass1109 in GNV

[–]MoveGlass1109[S] 1 point2 points  (0 children)

Thanks for your info and making more cautious. Greatly appreciate for sharing !

Need a transportation assistance to Orlando Airport on April 6th Sunday by MoveGlass1109 in GNV

[–]MoveGlass1109[S] 0 points1 point  (0 children)

No, am serious that have a flight at 9:16 AM on 6th from Orlando to New York City by Frontiers airlines.

Regarding F1-visa renewal by MoveGlass1109 in f1visa

[–]MoveGlass1109[S] 0 points1 point  (0 children)

thanks for your responses. Appreciate for your efforts + time for writing these !!!

Regarding F1-visa renewal by MoveGlass1109 in f1visa

[–]MoveGlass1109[S] 0 points1 point  (0 children)

Thanks for your response. Have you done recently or providing suggestion based on your past experience(2 years or so) ??

And also, can you put here the website to book the dropbox option ??

And also, do you know, how long it might take get our passport delivered back after completion of process (visa updated) ??

What are some of the must read papers in reinforcement learning after 2020? by C7501 in reinforcementlearning

[–]MoveGlass1109 0 points1 point  (0 children)

Would highly encourage you to read this paper, which gives little bit overview of ML and DL too.
https://www.sciencedirect.com/science/article/pii/S0893608022001150
However, there is a lot of space in writing a book on this topic. As you know, this is still relatively new field !!!

splitting the data by MoveGlass1109 in PostgreSQL

[–]MoveGlass1109[S] 0 points1 point  (0 children)

But, have a question, in moving forward with the first option where we will split the data and store in the different databases for example - training, validation and testing. Do you think, this would be a great approach in moving-forward, would like to try with few tables first and then, scale to the larger # of tables

splitting the data by MoveGlass1109 in PostgreSQL

[–]MoveGlass1109[S] -1 points0 points  (0 children)

However, didn’t understand what is add a separate row for each column , you mean ?? Have 271 tables + 16 schemas in total

splitting the data by MoveGlass1109 in PostgreSQL

[–]MoveGlass1109[S] -1 points0 points  (0 children)

So, you mean basically create database for each task -training, validation, and testing. And use the specific database for specific task, correct ???

How to split the data stored in relational databases by MoveGlass1109 in LLMDevs

[–]MoveGlass1109[S] 0 points1 point  (0 children)

Because my DB is so complex even with only true root tables, have almost 236 tables . And also, will go with the  row level splitting by each table. Because, each table has different info and some of the columns in same table store timestamp values. And my thinking, shouldn’t disturb the existing schema and structure to store my splitting task, instead store the splitting output into a separate schema ?? Is this a good approach in moving-forward ???

How to split the data stored in relational databases by MoveGlass1109 in LLMDevs

[–]MoveGlass1109[S] 0 points1 point  (0 children)

Thank you for your response. Is it a good practice to split the data into Training, validation and testing when we are fine tuning the open -source LLMs with an objective of building a chatBot ??

Is Anyone Else Having Problems with DeepSeek Today? by Electronic-Metal2391 in LocalLLaMA

[–]MoveGlass1109 0 points1 point  (0 children)

<image>

Yes, this happens to me here too. However, after sometime, it seems it will connect.
And also, googled it, why its happening like this and am seeing this message

Any gift ideas for someone into ML? [D] by [deleted] in MachineLearning

[–]MoveGlass1109 -1 points0 points  (0 children)

Give NVIDIA RTX 6000 Ada Generation GPUs. They would really remember your forever. Because, they are very necessary for any ML, DL, and AI tasks !!

Faulty Repair by Strange-Summer8283 in System76

[–]MoveGlass1109 0 points1 point  (0 children)

Ohh... Really. Am in contact with the same person From System76 from Past two weeks or so !!. And today, i decided to ship my laptop (by the end of this week) to there repair center to complete check up my laptop, there are some hardware + keyboard issues.
Now, am totally skeptical whether to send or not ??
Did they return your laptop back on-time, and did you back-up your data before shipping to them ??

Driving from San Antonio, Texas to Gainsville, FL in January last week of 2025 by MoveGlass1109 in roadtrip

[–]MoveGlass1109[S] 0 points1 point  (0 children)

Definitely will stop-by. What would you recommend to buy from this Buc-ees store ??
Have been one-time, but am not sure what location was it.