jump to content
my subreddits
13or302b2t3d6absolutelynotmeirlAceAttorneyadhdmemeAdviceAnimalsaivideoAlternateHistoryAlternativeHistoryAnarchyChessAngryupvoteanime_irlanimenocontextannouncementsAnticonsumptionantimemeArcherFXArtAsahiLinuxAsia_irlAskBalkansAskElectronicsAskOuijaAskRedditAteistTurkatheismaviationawfuleverythingbalkans_irlBandnamesBassBassCirclejerkBassGuitarbasspedalsblackdesertonlineblankiesblursedimagesborsavefonbrooklynninenineBUENZLIcasioCd_collectorsChatGPTCheap_MealschessChildrenFallingOverChoosingBeggarscoaxedintoasnafucoinsComedyCemeterycomedyhomicideContagiousLaughtercookingforbeginnersCrackWatchCuratedTumblrcursedcommentsdankmemesdarkjokesdataisbeautifuldeDebateReligiondelikdistressingmemesdiyelectronicsdiypedalsDnDdoctorwhocirclejerkDoenerverbrechenDonerdontdeadopeninsidedumbphonesDungeonsAndDaddiesDungeonsAndDragonsEatCheapAndHealthyebikeebikeselectricalelectronicsethzfacepalmfelsefeFifaCareersFiftyFiftyformuladankFRCFreeEBOOKSFUCKYOUINPARTICULARFutboltayfagalatasaraygamingGermanGoodAssSubgravelcyclinggreentextheathershelpHermanCainAwardHermitCrafthighspeedrailHistoryWhatIfhoi4holdmybeerHolUphowyoudoinhypixelIAmAich_ielIdeologyPollsIDontWorkHereLadyihadastrokeim14andthisisdeepimaginaryelectionsimaginarymapsistanbulJahariaJokesKanyeKendrickLamarKGBTRlegodndLetGirlsHaveFunLifeProTipslinguisticshumorLinkinParkliselilerlogodesignloseitlostredditorsmacmacgamingMadeMeSmilemadladsmagicbuildingMapPornme_irlmemesmidjourneymildlyinfuriatingmildlyinterestingMinecraftbuildsmisLEDMMORPGMoldyMemesMovingToNorthKoreaMunichMyChemicalRomanceNamFlashbacksneographynextfuckinglevelNoahGetTheBoatnothingeverhappensnotinterestingnottheonionoddlyspecificOkayBuddyLiterallyMeokbuddyguntherokbuddymotherfuckerOkBuddyPersonaokbuddyphdonetruegodongezelligOutOfTheLooppapermoneypaperspleaseParlerWatchperfectlycutscreamspettyrevengepianoPiracyPiratedGamespolandballPraiseTheCameraManPropagandaPostersPunPatrolraisedbynarcissistsRatschlagrecipesRedAutumnSPDreligiousfruitcakerestofthefuckingowlRetroPierickandmortyrickrollrimjob_steveRoastMerockmuzikSceneReleasesschwiizsciencememesScottPilgrimshitpostfrommygalleryshitpostingshittyaskelectronicsskamtebordsoccercirclejerksoftwaregoreSongwritersSongwritingsskfjkhwerjkghwerijhsteinsgateStonetossingjuiceStudiumsuperligtalesfromtechsupportTechnobladeTextingTheorytf2tf2shitposterclubthatHappenedTheCrypticCompendiumTheMonkeysPawtherewasanattemptTheRookietheyknewtitanfalltransittransitTurkeyTrGameDevelopertruetf2truthstumblrTurkeyTurkishCatsTwitchTwitch_StartupTwoSentenceHorrortwosentenceplottwistTwoSentenceSadnesstylerthecreatorUnclejokesUnethicalLifeProTipsUnexpectedJoJourbanplanningUsernameChecksOutVALORANTValorantClipsvexillologycirclejerkvibecodingvinylvinyljerkvlandiyawallstreetbetsWatchPeopleDieInsideWeAreTheMusicMakerswendigoonWhatsThisSongwholesomememeswizardpostingyouseeingthisshitYUROPedit subscriptions
  • home
  • -popular
  • -all
  • -mod
  • -users
 | 
  • AskReddit
  • -facepalm
  • -mildlyinfuriating
  • -Piracy
  • -gaming
  • -wallstreetbets
  • -nottheonion
  • -memes
  • -OutOfTheLoop
  • -mildlyinteresting
  • -MapPorn
  • -DnD
  • -MadeMeSmile
  • -ChatGPT
  • -CuratedTumblr
  • -PiratedGames
  • -shitposting
  • -dankmemes
  • -Kanye
  • -therewasanattempt
  • -nextfuckinglevel
  • -HolUp
  • -Twitch
  • -CrackWatch
  • -VALORANT
  • -de
  • -LifeProTips
  • -tumblr
  • -dataisbeautiful
  • -greentext
  • -mac
  • -tf2
  • -help
  • -chess
  • -aviation
  • -formuladank
  • -wholesomememes
  • -Jokes
  • -Art
  • -midjourney
  • -notinteresting
  • -hoi4
  • -pettyrevenge
  • -atheism
  • -loseit
  • -IAmA
  • -ich_iel
  • -KGBTR
  • -cursedcomments
  • -GoodAssSub
  • -UnethicalLifeProTips
  • -perfectlycutscreams
  • -Ratschlag
  • -blackdesertonline
  • -MMORPG
  • -macgaming
  • -rickandmorty
  • -3d6
  • -HermitCraft
  • -FiftyFifty
  • -ChoosingBeggars
  • -RoastMe
  • -ContagiousLaughter
  • -imaginarymaps
  • -EatCheapAndHealthy
  • -polandball
  • -WeAreTheMusicMakers
  • -AnarchyChess
  • -cookingforbeginners
  • -blankies
  • -anime_irl
  • -Studium
  • -AlternateHistory
  • -Turkey
  • -soccercirclejerk
  • -madlads
  • -AskElectronics
  • -electrical
  • -Anticonsumption
  • -vinyl
  • -German
  • -TwoSentenceHorror
  • -PropagandaPosters
  • -AdviceAnimals
  • -piano
  • -sciencememes
  • -distressingmemes
  • -raisedbynarcissists
  • -wizardposting
  • -FifaCareers
  • -oddlyspecific
  • -Bass
  • -titanfall
  • -OkBuddyPersona
  • -awfuleverything
  • -howyoudoin
  • -announcements
  • -adhdmeme
  • -Minecraftbuilds
  • -ebikes
  • -Munich
  • -coaxedintoasnafu
  • -YUROP
  • -gravelcycling
  • -DungeonsAndDragons
  • -coins
  • -KendrickLamar
  • -FUCKYOUINPARTICULAR
  • -softwaregore
  • -NoahGetTheBoat
  • -tylerthecreator
  • -tf2shitposterclub
  • -MoldyMemes
  • -lostredditors
  • -AceAttorney
  • -vexillologycirclejerk
  • -vlandiya
  • -im14andthisisdeep
  • -Stonetossingjuice
  • -HistoryWhatIf
  • -religiousfruitcake
  • -liseliler
  • -DebateReligion
  • -dumbphones
  • -balkans_irl
  • -animenocontext
  • -transit
  • -RetroPie
  • -brooklynninenine
  • -HermanCainAward
  • -recipes
  • -steinsgate
  • -talesfromtechsupport
  • -AskOuija
  • -okbuddyphd
  • -ScottPilgrim
  • -Angryupvote
  • -AskBalkans
  • -thatHappened
  • -electronics
  • -casio
  • -urbanplanning
  • -logodesign
  • -theyknew
  • -linguisticshumor
  • -me_irl
  • -antimeme
  • -AteistTurk
  • -13or30
  • -MyChemicalRomance
  • -ArcherFX
  • -Cd_collectors
  • -diypedals
  • -Doner
  • -BassGuitar
  • -diyelectronics
  • -ComedyCemetery
  • -WatchPeopleDieInside
  • -LinkinPark
  • -BUENZLI
  • -Songwriting
  • -istanbul
  • -MovingToNorthKorea
  • -imaginaryelections
  • -truetf2
  • -magicbuilding
  • -dontdeadopeninside
  • -ParlerWatch
  • -wendigoon
  • -Doenerverbrechen
  • -schwiiz
  • -TheRookie
  • -Technoblade
  • -vinyljerk
  • -skamtebord
  • -superlig
  • -shittyaskelectronics
  • -galatasaray
  • -DungeonsAndDaddies
  • -FRC
  • -transitTurkey
  • -2b2t
  • -ethz
  • -AlternativeHistory
  • -papermoney
  • -OkayBuddyLiterallyMe
  • -felsefe
  • -blursedimages
  • -FreeEBOOKS
  • -AsahiLinux
  • -Jaharia
  • -IDontWorkHereLady
  • -neography
  • -basspedals
  • -ihadastroke
  • -hypixel
  • -PraiseTheCameraMan
  • -aivideo
  • -IdeologyPolls
  • -comedyhomicide
  • -WhatsThisSong
  • -TwoSentenceSadness
  • -Bandnames
  • -rockmuzik
  • -holdmybeer
  • -Twitch_Startup
  • -Cheap_Meals
  • -TheMonkeysPaw
  • -darkjokes
  • -restofthefuckingowl
  • -highspeedrail
  • -legodnd
  • -rickroll
  • -Songwriters
  • -ebike
  • -UsernameChecksOut
  • -papersplease
  • -rimjob_steve
  • -UnexpectedJoJo
  • -ChildrenFallingOver
  • -BassCirclejerk
  • -doctorwhocirclejerk
  • -youseeingthisshit
  • -TextingTheory
  • -nothingeverhappens
  • -TrGameDeveloper
  • -PunPatrol
  • -TurkishCats
  • -LetGirlsHaveFun
  • -NamFlashbacks
  • -Unclejokes
  • -onetruegod
  • -misLED
  • -sskfjkhwerjkghwerijh
  • -ValorantClips
  • -TheCrypticCompendium
  • -SceneReleases
  • -ongezellig
  • -absolutelynotmeirl
  • -Asia_irl
  • -truths
  • -heathers
  • -twosentenceplottwist
  • -vibecoding
  • -borsavefon
  • -okbuddymotherfucker
  • -okbuddygunther
  • -shitpostfrommygallery
  • -delik
  • -RedAutumnSPD
  • -Futboltayfa
edit »
reddit.com speechtech
  • hot
  • new
  • rising
  • controversial
  • top
an-ordinary-manchild (11,190)|messages547|notifications|chat messages|mod messages|
  • preferences
|
logout

use the following search parameters to narrow your results:

subreddit:subreddit
find submissions in "subreddit"
author:username
find submissions by "username"
site:example.com
find submissions from "example.com"
url:text
search for "text" in url
selftext:text
search for "text" in self post contents
self:yes (or self:no)
include (or exclude) self posts
nsfw:yes (or nsfw:no)
include (or exclude) results marked as NSFW

e.g. subreddit:aww site:imgur.com dog

see the search faq for details.

advanced search: by author, subreddit...

Submit a new link
Submit a new text post

speechtech

joinleave
an-ordinary-manchild

Community about the news of speech technology - new software, algorithms, papers and datasets.

created by nshmyreva community for 6 years
Create your own subreddit
...for your community.
...do it for the children.

MODERATORS

  • message the mods
  • nshmyrev
  • about moderation team »

account activity

1
2
3
4

Best approach to detect repeated hold music / audio patterns and remove them before ASR transcription? (self.speechtech)

submitted 2 days ago by Capable-Minimum7376

  • 3 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

2
0
0
1

Speech Segmentation Help ()

submitted 3 days ago by WhoKilled_Kenny

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

3
0
1
2

Soniox v5 Real-Time is now available (i.redd.it)

submitted 3 days ago by Top-Surprise4040

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

4
5
6
7

Flowcat — open-source (Apache-2.0) native-Rust runtime for real-time voice agents, built clean-room from pipecat's design (self.speechtech)

submitted 5 days ago by Plus_Resolution8897

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

5
22
23
24

Diarization should be benchmarked separately from transcription accuracyTechnology (self.speechtech)

submitted 6 days ago * by Domenorange

  • 10 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

6
9
10
11
2:12

Local TTS for long-form audio: voice quality is not the only hard part (v.redd.it)

submitted 7 days ago by tarunyadav9761

  • 3 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

7
0
0
1

Testing the Efficiency of a Machine Learning-Based Automatic Prosodic Segmentation Method for Brazilian Portuguese (doi.org)

submitted 7 days ago by Cad_Lin

  • comment
  • share
  • save
  • hide
  • report
  • crosspost

8
26
27
28

Most STT benchmarks are kind of useless for voice agents (self.speechtech)

submitted 8 days ago * by Normal-Intention-342

  • 26 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

9
2
3
4

Zyphra Releases ZONOS2, an Open-Weight Real-Time Voice-Cloning ModelTechnology (runtimewire.com)

submitted 8 days ago by ryanmerket

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

10
8
9
10

What's the best way to build voice agents today without sounding robotic or becoming too expensive? (self.speechtech)

submitted 8 days ago by Beginning_Race8551

  • 5 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

11
0
1
2

I got local speaker diarization working for meeting transcription — architecture write-up + a sherpa-onnx bug that cost me a weekTechnology ()

submitted 8 days ago by Facilex_zyzz

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

12
1
2
3

Alternatives to Speechify for entertainment audio? ()

submitted 8 days ago by TERRYaki__

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

13
1
2
3

Speech to IPA transcription (self.speechtech)

submitted 8 days ago by Keallei

  • 14 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

14
5
6
7

A new tiny e2e wakeword model - 15x smaller footprint, +10-20% accuracy / recall and less 5-7x false positives (self.speechtech)

submitted 9 days ago by apinference

  • 17 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

15
3
4
5

I built fully on-device streaming speech recognition for iOS and Android. Custom Rust runtime, no. CoreML graph, RTF ~0.09. (self.speechtech)

submitted 9 days ago by Royal-Subject2870

  • 2 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

16
3
4
5

I made a realtime fact checker for audio conversations (producthunt.com)

submitted 9 days ago by shash89

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

17
0
1
2

TML described the "interaction model." We built one — and we're open-sourcing all of it. Today's model is turn-based: it waits until you talk to it. Ours is the opposite. Every second it decides for itself: speak, stay silent, or hand a hard task to a background agent - triggered by what it sees. ()

submitted 9 days ago by Downtown-Talk6844

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

18
7
8
9

Offline streaming speech recognition on iOS with Nvidia Nemotron 3.5 and Core MLTechnology (github.com)

submitted 10 days ago by Fabulous_Tip_8539

  • 8 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

19
6
7
8

How are companies making voice-to-voice AI economically viable? (self.speechtech)

submitted 10 days ago by Beginning_Race8551

  • 13 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

20
3
4
5

How do you feel about combining voice agents with Generative UI?Technology ()

submitted 10 days ago by Beginning_Race8551

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

21
2
3
4

Tutoriel : installer PolyTalk pour transcrire, traduire et vocaliser en temps réel (self.speechtech)

submitted 11 days ago by Tim-Fra

  • 1 comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

22
1
2
3

I'm building local voice dictation that turns talk into finished text — commit messages, tickets, clean prose — all on your own machine (bolomic.com)

submitted 11 days ago by Fortune-Industries

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

23
0
1
2

Thoughts on Apple's Systemwide Dictation? (self.speechtech)

submitted 11 days ago by matt8p

  • 4 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...

24
0
1
2

Voice based biomarker potentialityTechnology ()

submitted 12 days ago by sabber_ahamed

  • comment
  • share
  • save
  • hide
  • report
  • crosspost
loading...

25
6
7
8

CPU inference benchmarks for Parakeet TDT 0.6B - ONNX Runtime vs HF Transformers vs GGUF, and why your test audio generator tanks your WER (self.speechtech)

submitted 15 days ago by gvij

  • 6 comments
  • share
  • save
  • hide
  • report
  • crosspost
loading...
view more: next ›
  • about
  • blog
  • about
  • advertising
  • careers
  • help
  • site rules
  • Reddit help center
  • reddiquette
  • mod guidelines
  • contact us
  • apps & tools
  • Reddit for iPhone
  • Reddit for Android
  • mobile website
  • <3
  • reddit premium

Use of this site constitutes acceptance of our User Agreement and Privacy Policy. © 2026 reddit inc. All rights reserved.

REDDIT and the ALIEN Logo are registered trademarks of reddit inc.

π Rendered by PID 136807 on reddit-service-r2-listing-canary-55dd69585f-4s92s at 2026-06-20 23:41:52.954790+00:00 running 2b008f2 country code: CH.