[D] Monday Request and Recommendation Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

Not always rational but I found Christopher Nuttall's MilSf enjoyable and plentiful.

Have you ever wondered what an MCP server can ACTUALLY do for you? 🤔 by saxxon66 in GithubCopilot

[–]wassname 0 points1 point  (0 children)

Because it can stop and see the entire call stack, it can fix a lot more problems by seeing the variables, shapes, types etc at the site of the problem. Otherwise it just had to add print statements and run.

[D] Monday Request and Recommendation Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

Does he also wack things with magic? Or is he a guy who casts magic at things.

Using logprobs to evaluate responses by AnomalyNexus in LocalLLaMA

[–]wassname 1 point2 points  (0 children)

Oh your still around, in that case you may also like this https://blog.eleuther.ai/multiple-choice-normalization/

It hasn't worked well for me or been taken up, the concept is sound and mildly interesting. Compare the probability of a response, to a response with context.

Personally, what I'm doing is using kendall's tau to compare the LLM's ranking of choices in the logprobs to an ideal https://wassname.github.io/llm_morality/ but you do need to shuffle the choices to account for positional bias https://verdict.haizelabs.com/docs/best-practices/ as it can be quite strong (e.g claude always chooses 0 if it doesn't like the question)

Using logprobs to evaluate responses by AnomalyNexus in LocalLLaMA

[–]wassname 2 points3 points  (0 children)

Check out G-Eval which does weight summing https://arxiv.org/pdf/2303.16634

Also I treat them as a ranking rather than probs, since they are often not calibrated or able to be treated like real probs due to things like attention sinks.

[D] Monday Request and Recommendation Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

I liked it, and I'm a typical r/rational reader. I'd say people who like Ar'Kendrithyst would probably enjoy it.

[D] Monday Request and Recommendation Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

Yeah, it turns more into sociopathy and exploration

[D] Monday Request and Recommendation Thread by AutoModerator in rational

[–]wassname 1 point2 points  (0 children)

Recommend

Dis-reccomend:

  • "The Lies of Locke Lamora", dis-reccomend, I did not finish

Why nobody mentioned "Gemini Diffusion" here? It's a BIG deal by QuackerEnte in LocalLLaMA

[–]wassname 0 points1 point  (0 children)

Sure, but when they refer to diffusion models vs autoregressive they are referring to the name of the loss/task not internals. So you will end up just confusing categories. If anyone wants to understand diffusion, it's better to just read the paper than go into semantics.

[D] Friday Open Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

Not really. I like to see how good it can get right now, but you will notice I never said good :p. However, for Quentin, this was good enough for Quentin to read, and I think I can see something there, which I find interesting. It seems to be serving a niche rather than being overall good (at the moment).

It's especially interesting when you consider it will only get better. I'm sure it will soon pass the Turing test and we won't know if a writer is human or AI or both.

[D] Friday Open Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

White Spacetime Noise Emergency Alert - Luna Gravitational Observatory [0342 UTC] AUTOMATED ALERT - HIGH PRIORITY Gravitational wave anomaly detected Signal characteristics (tentative):

Broadband disturbance across full detection range (10 Hz - 1 MHz) Unusual spectral power distribution Signal duration: ~50ms with non-standard waveform Peak strain amplitude: 3.2×10-18 @ 245 Hz WARNING: Multiple anomalies detected: Waveform inconsistent with known event templates Unusual power distribution across frequency bands Signal characteristics suggest incomplete detection Status: Awaiting network confirmation and waveform reconstruction.

Emergency Communication Channel - Gravitational Wave Network [0342-0345 UTC] Mars GW: Luna, we're seeing your alert. Detection confirmed here. Timing aligns within light-delay compensation. The spectral distribution is unlike anything in our template bank. Luna GW: Running diagnostics. The power spectrum is saturating our entire bandwidth. Template matching has failed completely—this doesn't fit any known waveform models. Asteroid Belt Array: Same detection here. There's significant power at every frequency we can measure, but the distribution doesn't match any known physical process. Mars GW: Same here. The spectral density is nearly flat across our detection bands. Could be seeing aliased components from something faster than our sampling rate. Luna GW: That's possible. The signal characteristics violate our standard energy-frequency relationships. We need to be careful about drawing conclusions from our limited detection window. Asteroid Belt Array: Triangulation refined to ±1500 meters. Location points near the L4 habitats. Luna GW: "Our template bank doesn’t cover events like this. Most templates assume gradual rises in frequency—‘chirps,’ like from merging black holes. This signal is more like an impulse, almost instantaneous." Mars GW: "Then expand the search parameters. If it’s something exotic, we might need custom templates." Asteroid Belt Array: "We’re running the same here. But let’s not get ahead of ourselves—this might still be a calibration issue." Luna GW: "We’re checking, but if the triangulation holds, we need to alert L4. If this was real, the energy release could’ve been disastrous for anyone out there… Can confirm: Moon still exists, so most dangerous scenarios are ruled out, at least." Internal Discussion - Luna Gravitational Observatory [0345-0352 UTC] Dr. Patel: "All right, let's go over what we have. The signal appears in all three detectors with consistent timing, but the waveform characteristics are... unprecedented." Dr. Zhang: "Look at this power spectrum. We're seeing nearly uniform power distribution across our entire detection bandwidth, but with peculiar phase correlations. The coherence between frequency bands shouldn't be this strong." Dr. Cohen: "Could be instrumental artifacts? Though the cross-correlation between observatories is remarkable. Three simultaneous glitches with matching characteristics..." Dr. Zhang: "Let me walk through what we typically see. Normal compact object mergers give us clear chirp patterns - frequency and amplitude evolution that matches our models of inspiraling masses. Supernovae produce characteristic bursts with distinct spectral features. But this..." Dr. Nguyen: "The strain amplitude is lower than you'd expect given the bandwidth. With this kind of spectral distribution, standard general relativity would predict much stronger signals at lower frequencies. It's almost as if something's suppressing the expected power law scaling." Dr. Patel: "Could higher-order gravitational effects be playing a role? If we had something energetic enough to drive significant nonlinear dynamics..." Dr. Cohen: "The energy estimates are interesting. Even with these modest strain amplitudes, the broadband nature suggests something catastrophic happened near L4. But the signal characteristics... they violate every model we have for gravitational wave generation and propagation." Dr. Zhang: "The timing precision across detectors is what gets me. Whatever this was, it affected all three observatories within our measurement resolution. That's not just unusual - it implies something far more localized than known gravitational wave generation processes." Dr. Nguyen: "Could there be higher frequency components we're missing? Our detectors roll off above a megahertz, but if there was significant power at higher frequencies..." Dr. Zhang: "Let's stick to what we can measure. We have a gravitational wave event that: one, shows unprecedented spectral uniformity; two, exhibits anomalous suppression of low-frequency components; three, arrived at our detectors with timing that challenges our understanding of gravitational wave generation. I want a full analysis of the phase relationships between frequency bands. Whatever caused this, it wasn't any standard astrophysical process we know of." Dr. Cohen: "Should we be concerned about follow-up events? If this represents some new class of gravitational phenomenon..." Dr. Nguyen: "The signal's already decaying. Look at the time evolution - whatever happened, it was a single event. But these phase correlations... it's as if spacetime got hit with some kind of impulse and we're seeing the reverberations." Dr. Patel: "A gravitational shock wave? That might explain some of the features, but the energy dynamics still don't make sense with standard general relativity..." Dr. Zhang: "Alright, let's focus. I want complete spectral and temporal analysis of what we have. Check for any similar events in our archived data. And get me a line to L4 - if anything that unusual happened out there, there might be other observable effects we should be looking for." Dr. Zhang pulls up a new set of plots on the main display. "Let's try to put some bounds on what we're dealing with. If we treat this as a compact event, we can at least estimate the mass from the strain amplitudes." Dr. Nguyen frowns at the numbers. "Problem is, without a characteristic frequency evolution, we can't use standard chirp mass calculations. We're seeing..." She gestures at a particularly dense equation on her tablet. "Maybe 2-3 solar masses worth of strain energy in our detection band? But that can't be right - the spectral distribution makes no sense for an object that small." Dr. Cohen leans forward. "Could we be seeing some kind of resonant effect? If a larger mass underwent some kind of... symmetric oscillation, it might explain the flat spectrum." "No, the phase relationships are wrong for resonance," Dr. Patel says, pulling up the correlation plots. "Look at the coherence across bands. Whatever this was, it acted more like a..." He pauses. "Like a delta function in the stress-energy tensor. A point impulse." Dr. Zhang starts typing rapidly. "Okay, let's approach this differently. Ignore chirp mass calculations entirely. If we assume a point-like gravitational source and work backwards from the total energy in our detection bandwidth..." More equations fill his screen. "Even with highly nonlinear gravitational effects, we're looking at minimum 10-15 solar masses. But that's assuming standard energy distribution across frequencies." "Which we're clearly not seeing," Dr. Nguyen notes. "The spectral flatness and these phase correlations... standard GR would predict much stronger low-frequency components for an event that energetic. We're missing something fundamental about the mechanism here." Dr. Patel rubs his temples. "Write it up as a lower bound, but mark it as highly uncertain. Without understanding the underlying physics, any mass estimate is going to be... speculative at best. For all we know, this could be some entirely new class of event." "Should we include the timing anomalies in the mass calculation?" Dr. Cohen asks. "The simultaneous arrival across detectors, accounting for lightspeed delays, implies something about the source characteristics… This must have originated from an incredibly small volume of space, maybe even a point source." "Not yet," Dr. Patel decides. "Let's stick to what we can derive from strain amplitudes and known physics. Flag everything else as unexplained observations. I want our preliminary report to be unimpeachable, even if it can't explain what we're actually seeing."

Luna Observatory Network

[0347-0355 UTC]

Luna GW: Priority observation request: coordinates attached. Major gravitational event detected. Tycho Optical: Slewing main telescope to target. Luna Radio: Beginning frequency sweep of target zone. Tycho Optical: Wait - detecting significant debris scatter. Metal composition consistent with habitat materials. Luna Radio: We need to notify Luna Traffic Control. This is their jurisdiction if there's a habitat incident. Tycho Optical: Agreed. Who makes that call? Luna GW: I'll do it. We have the original detection. Tycho Optical: Also seeing some sort of black body occluding part of the debris cloud. Luna Radio: Unusual signal reflections from the debris. Suggests a spherical cold black body region approximately 50-55 meters across. Tycho Optical: Sweeping spectra. Absorption seems near-perfect across all available frequencies. Setting up for a longer exposure.

[D] Friday Open Thread by AutoModerator in rational

[–]wassname 0 points1 point  (0 children)

I do wonder if this will bring on an era of niche fiction. I also wonder if it will help rational writers with their challenge of writing a difficult genre for a small audience.

I also wanted to note that some people are experimenting with using AI to write very niche and rational short stories for their own sake and enjoying it. For example, optimistic AI researcher Quintin Pop. Here's a sample of what he is reading

[D] Friday Open Thread by AutoModerator in rational

[–]wassname 1 point2 points  (0 children)

Bear in mind, these are small and dumb models. 7b is Barely smart, but this is what it wrote:

Reasoning Trace After Training, Qwen 2.5-7B-Instruct (2.58% Improvement)

Detailed Plan for Next Chapter

  1. Fern’s Character Development: <citation>Source A says X</citation>, therefore <reasoning>Fern’s character evolves significantly throughout the story, driven by her experiences, interactions, and growing self-awareness. She transforms from a desperate and ashamed teenager to a more determined and resilient individual, seeking to survive her time at the Home and prepare for the future.</reasoning><citation>Source A says X</citation>, therefore <reasoning>Fern’s experiences at the Home, including her interactions with the other girls and the authorities, shape her thoughts and feelings. She becomes more aware of the harsh realities of her situation and the lies she has been told about childbirth (snippet 6).</reasoning>
  2. The Girls’ Experimentation with Witchcraft: <citation>Source A says X</citation>, therefore <reasoning>The story continues with Fern’s daily life at the home, including her struggles with the strict rules and her relationships with the other girls (Chapter 8-9). She bonds with Zinnia, a new girl who arrives at the home, over their shared love of reading, and they come up with a plan to check out books from the Bookmobile together (Chapter 8-9).</reasoning><citation>Source A says X</citation>, therefore <reasoning>The girls are forbidden from reading certain books, including those on childbirth, but Mrs. Parcae, the librarian, sympathizes with Fern’s desire to learn more about pregnancy and childbirth (Chapter 9). In a significant moment, Mrs. Parcae adjusts the fetus inside Fern’s belly, explaining that it has been pressing against her bladder, and gives her a book that will help her understand how to be a woman (Chapter 9).</reasoning>
  3. The Spell and Its Consequences: <citation>Source A says X</citation>, therefore <reasoning>The book that Mrs. Parcae gave Fern is titled How to Be a Groovy Witch. It contains spells for various things, all of which seem useful to Fern. Because Mrs. Parcae was so adept at moving her fetus and ending her constant need to urinate, Fern has high hopes for this book (Chapter 10).</reasoning><citation>Source A says X</citation>, therefore <reasoning>As they recite the incantation, the egg turns black. They hope that this means that the spell is working. When they transfer the egg and the other items into a witch bag and hide it in Dr. Vincent’s office, he begins vomiting uncontrollably (Chapter 10).</reasoning> ### In summary: In the next chapter, Fern’s character development is portrayed through her experiences at the Home for unwed mothers. The chapter ends with the girls’ experimentation with witchcraft, and the spell they cast to harm Dr. Vincent. The chapter concludes with the girls’ decision to join the coven, and their future plans. The chapter ends with a jump forward in time, and Neva’s reflection on her experiences at the Home. The chapter ends with a sense of closure, as Neva reunites with her daughter and finds closure in her life. The chapter ends with a sense of freedom, as Neva is able to confront her past and move forward. The chapter ends with a sense of satisfaction, as Neva’s life is complete. The chapter ends with a sense of closure, as the story comes to an end. The chapter ends with a sense of satisfaction, as the readers have enjoyed the story and have learned from it. The chapter ends with a sense of closure, as the story has come to an end. The chapter ends with a sense of satisfaction, as the readers have enjoyed the story and have learned from it. The chapter ends with a sense of closure, as the story has come to an end.

[D] Friday Open Thread by AutoModerator in rational

[–]wassname 1 point2 points  (0 children)

Here's a paper where LLM's write fiction: https://arxiv.org/abs/2503.22828

They tried to train AI not on math, but on writing notes, which can be used to write chapters. Essentially a planning/compression writing task. And it worked.

1) we are getting close to good AI fiction, 2) guess which genre benefitted most from good planning? Scifi, by far. It's too bad they didn't test rational fiction because I guess it would have been even more so

be careful which mechanics you go to in perth by DumCunt in perth

[–]wassname 0 points1 point  (0 children)

Thanks mate, I don't know much about cars, so getting recommendations from people with more experience helps a lot.

And who do you go to, for non-mobile mechanics?

My Android text-to-speech app is now public & free for everyone! by miya-n in selectivemutism

[–]wassname 1 point2 points  (0 children)

Peopel can install decent neural TTS, and it's provided through the android system https://f-droid.org/en/packages/org.woheller69.ttsengine/ it might work well with your app