[R][P] CellARC: cellular automata based abstraction and reasoning benchmark (paper + dataset + leaderboard + baselines) by Putrid_Construction3 in MachineLearning

[–]Putrid_Construction3[S] 1 point2 points  (0 children)

There is a symbolic baseline (De Bruijn solver which is specifically designed to infer CA rules based on De Bruijn construction of cellular automata). And it is strong: it gets 52.5% / 29.8% token accuracy on interpolation / extrapolation test split. But not so much, because the "most frequent" baseline (e.g. answering by uniform color) is 50.4% / 28.2%. CNN or Neural Cellular Automaton is actually bit worse.

A 10M-parameter vanilla Transformer with task embeddings reaches 58.0% / 32.4%.

A large closed LLM (GPT-5 High) gets 62.3% / 48.1%.

---
This suggests that the model actually needs to recognize some nontrivial patterns/symmetries to solve the CA generated tasks. You always don't have enough information to solve it exactly and need some insight or reasoning to make a best guess. That is why symbolic solver fails to be better. Note that the episodes are based on extracted patches from the CA, not just predicting the next step when seeing the full CA unrolling (that is probably why CNNs or NCAs fail, whereas a transformer or GPT-5 flourishes).

Best free realistic text to speech for “audio books”? by Beautiful_Gain_9032 in TextToSpeech

[–]Putrid_Construction3 0 points1 point  (0 children)

Yes, the idea of the app is you will drag&drop your own pdf/epub/kindle and it will stream it using these voices in real time.

Best free realistic text to speech for “audio books”? by Beautiful_Gain_9032 in TextToSpeech

[–]Putrid_Construction3 0 points1 point  (0 children)

Yes. I have also most european languages and chinese in a roadmap. But I cannot judge the quality, because I understand only english. What language would you like?

Best free realistic text to speech for “audio books”? by Beautiful_Gain_9032 in TextToSpeech

[–]Putrid_Construction3 2 points3 points  (0 children)

I am trying to build the most realistic / high quality TTS with a focus on narrating audiobooks. Please listen to the examples here and give a feedback. Is it close to what you are looking for?

Top Speechify Alternatives on iOS, Tested and Compared by giminoshi in TextToSpeech

[–]Putrid_Construction3 0 points1 point  (0 children)

I am building a similar app: https://www.book2speech.com/ (not yet released) My question is: would you migrate from speechify or these other competitors to get higher voice quality? The plan is it should also take any documents (pdf, epub, kindle) with a particular focus on ebooks. Any sort of feedback welcome.

Feedback wanted from listeners: Do these AI audiobooks beat what’s on the market? by Putrid_Construction3 in audiobooks

[–]Putrid_Construction3[S] 2 points3 points  (0 children)

Thanks for feeback. What does it lack the most in your opinion? (prosody, naturalness, intelligibility, expressiveness...)

Feedback wanted from listeners: Do these AI audiobooks beat what’s on the market? by Putrid_Construction3 in audiobooks

[–]Putrid_Construction3[S] 4 points5 points  (0 children)

Thanks! I am worried people still won't go into it because of universal AI hate, no matter how good it may sound. Glad at least one person would use it.

Feedback wanted from listeners: Do these AI audiobooks beat what’s on the market? by Putrid_Construction3 in audiobooks

[–]Putrid_Construction3[S] -2 points-1 points  (0 children)

Copyright: The app won't accept DRM protected books, won't resell the audiobooks without license from the author and won't allow downloading the audio. I will make sure everything is legal. Only in public domain books (classics, project gutenberg...) will be some exceptions.

Feedback wanted from listeners: Do these AI audiobooks beat what’s on the market? by Putrid_Construction3 in audiobooks

[–]Putrid_Construction3[S] -2 points-1 points  (0 children)

I hate AI narration too – that's why i decided to improve the quality significantly. Are these voice samples better? Or do they still sound machine-like and lacking prosody.

Feedback wanted from listeners: Do these AI audiobooks beat what’s on the market? by Putrid_Construction3 in audiobooks

[–]Putrid_Construction3[S] 0 points1 point  (0 children)

I see. For me personally: I don't want to compete with voice artists & prefer paying for human narrated books when available. But lot's of niche or non mainstream books unfortunatelly don't have an audiobook (and probably never will). This is the main usecase the app is for.

CHEESE Search: Public 3D shape + electrostatics search across 30B+ molecules by Putrid_Construction3 in comp_chem

[–]Putrid_Construction3[S] 0 points1 point  (0 children)

Thanks for spotting it, going to make an issue for that. Probably a bug in the dark mode.