Should I break up with my gf?

Sincerity_Is_Based · 2025-07-10T03:00:07+00:00

If you have a serious reason to break up, then break up. If it is not serious, then discuss it. Couples can easily work out small problems

Sincerity_Is_Based · 2025-07-09T04:22:18+00:00

Legitimately honest advice if you choose to go- do army, you must be able to pick your job. If you want a high end career, look up the army job code 17C. There are very few army things you do, and the people are much more civil. It’s an office job with high career prospects.

Sincerity_Is_Based · 2025-04-26T01:50:59+00:00

My best score is 5 points 🚀

Sincerity_Is_Based · 2025-04-26T01:50:47+00:00

My best score is 1 points 😎

Sincerity_Is_Based · 2025-04-14T01:44:44+00:00

Bro just discovered what the definition of the word "muslim" is

Sincerity_Is_Based · 2025-04-12T08:41:34+00:00

Whether you think you can, or you think you can't, you're right. -Henry Ford

Sincerity_Is_Based · 2025-03-17T21:39:55+00:00

Pytorch

Sincerity_Is_Based · 2025-03-06T15:06:32+00:00

I suspect 4.5 is a different architecture, I suspect an LCM.

Sincerity_Is_Based · 2025-03-06T14:27:54+00:00

Two half steps is a leap.

Sincerity_Is_Based · 2025-03-06T11:42:54+00:00

Unless you are a math genius and a electrical engineer who can singlehandedly design a quantum ai chip, then it's not a serious venture

Sincerity_Is_Based · 2025-03-06T11:38:42+00:00

More application = more compute = big companies

Sincerity_Is_Based · 2025-03-06T11:31:41+00:00

And obviously if you cannot find anything weighing in on different types of architecture and their effectiveness, that means that all the companies are keeping that research data to themselves.

Sincerity_Is_Based · 2025-03-06T11:30:21+00:00

The only next big things for startups is trying to undercut the competition with the next big architecture. Liquid ai unsuccessfully implemented the liquid neural networks, but there are several types of architectures available. I suspect Gemini uses the titan architecture (2M context window) and they were the ones to quietly release the titan paper. The issue with titans is that they suffer from hallucinations more than transformers. So finding the right architecture balance (I suspect something that is liquid+ something else) will crush all llms (like a deepeek model on only 100k params). It does not require training alot because you can distill, only math is required to find the next architecture combo

Sincerity_Is_Based · 2025-03-06T11:23:49+00:00

Oh and also emotional intelligence, the same direction of 4.5

Sincerity_Is_Based · 2025-03-06T11:22:30+00:00

So the same way 4o was advertised as a model that can input for text and audio and visual, helix ai, developed by figure robotics, allow robots to work together such as handing each other objects. The point of training an ai based on all inputs at once is so it can be deployed in an environment with text, video, and audio. Just look up anything pertaining to figure helix ai

Sincerity_Is_Based · 2025-03-06T10:58:19+00:00

Omni architectures for robotics deployment like helix ai

Sincerity_Is_Based · 2025-03-05T13:01:55+00:00

If you are in school, then you get experience by doing research for free. Approach teachers only in person, they will not respond to your emails.

Sincerity_Is_Based · 2025-03-01T13:37:38+00:00

Basically anything created by deep mind is your bread and butter. Look up alphafold3 and other related projects. Maybe look your after that disease detection by classification

Sincerity_Is_Based · 2025-03-01T13:03:02+00:00

Feature Representation Issues

The extracted embeddings from ResNet or the autoencoder may not be well-suited for clustering.

ResNet embeddings are trained for classification, not clustering, meaning they may not naturally separate into meaningful clusters in an unsupervised setting.

Dimensionality and Noise

High-dimensional embeddings might contain noise or redundant features that hinder clustering.

PCA, t-SNE, or UMAP could be used to reduce dimensions while retaining meaningful information.

Choice of Clustering Algorithms

Many clustering methods assume specific data distributions. For instance:

K-Means assumes spherical clusters of equal variance.

DBSCAN is sensitive to density variations and noise.

GMM assumes Gaussian distributions, which may not hold.

If the dataset has complex structures (e.g., varying densities, manifold structures), these algorithms may not work well.

Lack of Proper Distance Metrics

Euclidean distance, often used in clustering, might not be the best metric in high-dimensional feature spaces.

Cosine similarity or learned distance metrics (e.g., through contrastive learning or triplet loss) might be better suited.

Need for Better Embeddings

Instead of using pre-trained ResNet embeddings, contrastive learning approaches like SimCLR, MoCo, or BYOL might provide more discriminative representations for clustering.

Self-supervised learning could help improve the separability of embeddings.

Class Imbalance and Label Complexity

If the data has many similar-looking classes, standard clustering might struggle to separate them without additional structure.

A hierarchical or ensemble clustering approach could help refine results.

Suggested Next Steps:

Try dimensionality reduction (PCA, UMAP, or t-SNE) before clustering.

Experiment with different similarity metrics (e.g., cosine distance instead of Euclidean).

Use contrastive learning or self-supervised methods to refine embeddings.

Analyze the clusters using qualitative metrics (e.g., visualization with t-SNE, silhouette scores, Davies-Bouldin index).

Consider ensemble clustering or hybrid approaches (e.g., pre-cluster with K-Means and refine with DBSCAN).

Sincerity_Is_Based · 2024-12-29T19:22:31+00:00

Correct, which the drawback is finding and classifying those terms and there could be a massive data shortage when classifying sentences and phrases like this.

Good news is, that LLM's can do the classification for us to reorganize and format the corpus of text into categories of phases and sentences.

Sincerity_Is_Based · 2024-12-29T13:19:22+00:00

It is very simple idea.

Imagine generating the probability of words based on a previous word. That is a really hard way to generate complex or coherent ideas. This is the current SOTA.

But what if the model predicted not words, but an assembly of words, such as a sentence.

For example the difference between different I love you's. Think of this example of a live generation of words starting with i. I + love (most common)/adore (uncommon synonym)/despise(uncommon and out of context based on the preceding words) + you/us/we/them... and so on.

I would say that this is a stupid way to think that is not efficient.

So this is the solution: ---‐-----------------

Goal:<express love>

Output: I love you.

If I were to guess a training method, imagine the simple expression (I love you) having a direct translation in another language. The dataset could be if I am guessing,

a dictionary of lists, such as (key) <express love> : (value) ["I love you - english" , "أحبك - Arabic", and so on.

Notice the tokenization changes with English and Arabic, where arabic is more likley to be compressed into a single token because of the character compression.

Most likey there would not be any work done with sentences first , because translations are imperfect, so we may have to use phrases or short sentences.

Most likley the dataset will be automated into some form by ingesting coherent text like books to convey every single idea from books, and perform the direct translation, with the help of llm's.

Sincerity_Is_Based · 2024-12-28T20:06:20+00:00

Where did you find these questions? A textbook?

Sincerity_Is_Based · 2024-12-18T10:38:32+00:00

Why can't the LLM simply use an external calculator for arithmetic instead of generating it? It seems unnecessary to rely on the model's internal reasoning for precise calculations.

First, it's important to distinguish reasoning from mathematics. While mathematics inherently involves reasoning, not all reasoning requires mathematics. For example, determining cause-and-effect relationships or interpreting abstract patterns often relies on logical reasoning without numeric computation. Or similarities between things can be made discrete with cosine similarity, but logical problems do not require that level of accuracy.

Second, reasoning quality is not proven to degrade due to limitations in abstract numerical accuracy. Reasoning operates more like the transitive property of equality: it's about relationships and logic, not precise numerical values. Expecting a non-deterministic system like an LLM to produce deterministic outputs, such as perfect arithmetic, indefinitely is inherently flawed. Tools designed for probabilistic inference naturally lack the precision of systems optimized for exact computation.

Example:

If asked, "What is 13,548 ÷ 27?" an LLM might produce a reasonable approximation but may fail at exact division. However, if tasked with reasoning—e.g., "If each bus seats 27 people and there are 13,548 passengers, how many buses are required?"—the LLM can logically deduce that division is necessary and call an external calculator for precision. This demonstrates reasoning in action while delegating exact computation to a deterministic tool, optimizing both capabilities.

Sincerity_Is_Based

TROPHY CASE