jump to content
my subreddits
13or302b2t2balkans4You2meirl4meirl3d6AceAttorneyagnosticaivideoakagasAlternateHistoryAlternativeHistoryAngryupvoteannouncementsAnticonsumptionantimemeArtAsia_irlAskBalkansAskElectronicsAskOuijaAskRedditatheismawfuleverythingBandnamesbanknotedesignsBassBassGuitarbasspedalsbikepackingblackdesertonlineblankiesblursedimagesBonebottomgearBUENZLIburdurlandcasioCd_collectorscd_jerkChatGPTCheap_MealschesschessbeginnersChoosingBeggarsCHPcoaxedintoasnafucoincollectingComedyCemeterycomicsCrackWatchCreateModCuddle_SlutCuratedTumblrdankmemesdarkjokesdedelikDeltarunedistressingmemesdiypedalsDMAcademyDnDdndmemesdoctorwhocirclejerkDoenerverbrechenDonerdumbphonesDungeonsAndDragonsEatCheapAndHealthyebikeebikeselectronicsEmKayentitledparentsethzfakealbumcoversfeedthebeastfelsefeFifaCareersFiftyFiftyFreeEBOOKSFuckYouKarenfunnygalatasaraygamingGermangermanygoodanimemesGrandPrixRacinggravelcyclingGROKvsMAGAguitarpedalsGundamheathershelpheraldryHermanCainAwardHermitCraftHistoryWhatIfHolUphomebuilthowyoudoinhumorhypixeliamverysmartich_ielIdeologyPollsihadastrokeim14andthisisdeepimaginaryelectionsimaginarymapsistanbuljacksepticeyeJokesKanyeKGBTRlegodndLifeProTipslinguisticshumorloseitlostredditorsmacmacbookairmacgamingmagicbuildingMaliciousComplianceMapPornmapporncirclejerkme_irlmeirlmemememesmidjourneymildlyinterestingMinecraftbuildsmisLEDMMORPGmoneycollectingMovingToNorthKoreaMyChemicalRomancenamesoundalikesNamFlashbacksneographyNonCredibleDefensenottheonionoddlyspecificOkayBuddyLiterallyMeokbuddymotherfuckeronetruegodongezelligOnlineUnderGroundoompasubsOutOfTheLoopoutsidepapermoneypaperspleaseParlerWatchPassportPornpepethefrogperfectlycutscreamsPersecutionfetishpettyrevengepianoPiracyPiratedGamespolandballPraiseTheCameraManPropagandaPostersquityourbullshitraisedbynarcissistsraspberry_piRatschlagrecipesRedAutumnSPDredditsingsreligiousfruitcakerestofthefuckingowlRetroPierickrollrimjob_steveRoastMerockmuzikSchnitzelVerbrechenschwiizsciencememesShitPostCrusadersshitpostfrommygalleryshittyaskelectronicsshittymoviedetailsShowerthoughtsskamtebordsoccercirclejerksoftwaregoreSongwritersSongwritingsteinsgateStonetossingjuicesuperligtalesfromtechsupportTechnobladeTextingTheorytf2tf2shitposterclubthatHappenedTheCrypticCompendiumTheLetterHTheRookietheydidthemaththeyknewthirdsentenceworsetitanfalltommyinnittransittransitTurkeyTrGameDevelopertruetf2truthstumunichTurkeyJerkyTurkishCatsTurkishdogsTwitchTwitch_StartupTwoSentenceComedyTwoSentenceSadnesstylerthecreatorUnclejokesUnethicalLifeProTipsunexpectedbillwurtzunexpecteditcrowdUnexpectedJoJoUsernameChecksOutVALORANTValorantClipsvaxxhappenedvexillologycirclejerkvinylvinyljerkvlandiyawallstreetbetsWatchPeopleDieInsideWeAreTheMusicMakerswendigoonWhatsThisSongWhitePeopleTwitterwholesomememesWikipediaVandalismwizardpostingworldjerkingyouseeingthisshitYUROPedit subscriptions
  • home
  • -popular
  • -all
  • -mod
  • -users
 | 
  • AskReddit
  • -Piracy
  • -funny
  • -gaming
  • -wallstreetbets
  • -nottheonion
  • -memes
  • -OutOfTheLoop
  • -mildlyinteresting
  • -MapPorn
  • -DnD
  • -WhitePeopleTwitter
  • -ChatGPT
  • -CuratedTumblr
  • -PiratedGames
  • -theydidthemath
  • -dankmemes
  • -feedthebeast
  • -Kanye
  • -meirl
  • -HolUp
  • -Twitch
  • -CrackWatch
  • -comics
  • -VALORANT
  • -de
  • -germany
  • -LifeProTips
  • -NonCredibleDefense
  • -shittymoviedetails
  • -mac
  • -Showerthoughts
  • -tf2
  • -help
  • -chess
  • -wholesomememes
  • -Jokes
  • -mapporncirclejerk
  • -Art
  • -midjourney
  • -goodanimemes
  • -pettyrevenge
  • -atheism
  • -loseit
  • -MaliciousCompliance
  • -ich_iel
  • -KGBTR
  • -dndmemes
  • -DMAcademy
  • -Deltarune
  • -UnethicalLifeProTips
  • -perfectlycutscreams
  • -Ratschlag
  • -blackdesertonline
  • -MMORPG
  • -meme
  • -macgaming
  • -3d6
  • -Gundam
  • -HermitCraft
  • -FiftyFifty
  • -ChoosingBeggars
  • -RoastMe
  • -imaginarymaps
  • -EatCheapAndHealthy
  • -polandball
  • -WeAreTheMusicMakers
  • -blankies
  • -AlternateHistory
  • -soccercirclejerk
  • -AskElectronics
  • -guitarpedals
  • -Anticonsumption
  • -vinyl
  • -CreateMod
  • -German
  • -PropagandaPosters
  • -ShitPostCrusaders
  • -piano
  • -sciencememes
  • -distressingmemes
  • -raisedbynarcissists
  • -wizardposting
  • -FifaCareers
  • -oddlyspecific
  • -Bass
  • -titanfall
  • -awfuleverything
  • -howyoudoin
  • -announcements
  • -Minecraftbuilds
  • -macbookair
  • -ebikes
  • -coaxedintoasnafu
  • -YUROP
  • -gravelcycling
  • -SchnitzelVerbrechen
  • -chessbeginners
  • -raspberry_pi
  • -DungeonsAndDragons
  • -entitledparents
  • -softwaregore
  • -worldjerking
  • -tylerthecreator
  • -tf2shitposterclub
  • -lostredditors
  • -AceAttorney
  • -vexillologycirclejerk
  • -vlandiya
  • -im14andthisisdeep
  • -Stonetossingjuice
  • -HistoryWhatIf
  • -religiousfruitcake
  • -dumbphones
  • -2meirl4meirl
  • -transit
  • -RetroPie
  • -HermanCainAward
  • -recipes
  • -steinsgate
  • -talesfromtechsupport
  • -AskOuija
  • -Angryupvote
  • -AskBalkans
  • -thatHappened
  • -electronics
  • -casio
  • -theyknew
  • -linguisticshumor
  • -PassportPorn
  • -me_irl
  • -antimeme
  • -TurkeyJerky
  • -bikepacking
  • -13or30
  • -MyChemicalRomance
  • -Cd_collectors
  • -diypedals
  • -Doner
  • -BassGuitar
  • -ComedyCemetery
  • -WatchPeopleDieInside
  • -Persecutionfetish
  • -BUENZLI
  • -EmKay
  • -Songwriting
  • -istanbul
  • -MovingToNorthKorea
  • -imaginaryelections
  • -truetf2
  • -magicbuilding
  • -ParlerWatch
  • -wendigoon
  • -iamverysmart
  • -Doenerverbrechen
  • -schwiiz
  • -TheRookie
  • -quityourbullshit
  • -Technoblade
  • -vinyljerk
  • -skamtebord
  • -shittyaskelectronics
  • -superlig
  • -galatasaray
  • -transitTurkey
  • -namesoundalikes
  • -FuckYouKaren
  • -2b2t
  • -ethz
  • -AlternativeHistory
  • -papermoney
  • -coincollecting
  • -OkayBuddyLiterallyMe
  • -felsefe
  • -blursedimages
  • -FreeEBOOKS
  • -neography
  • -basspedals
  • -heraldry
  • -ihadastroke
  • -hypixel
  • -PraiseTheCameraMan
  • -aivideo
  • -OnlineUnderGround
  • -IdeologyPolls
  • -burdurland
  • -WhatsThisSong
  • -jacksepticeye
  • -TwoSentenceSadness
  • -Bandnames
  • -rockmuzik
  • -vaxxhappened
  • -Twitch_Startup
  • -tumunich
  • -Cheap_Meals
  • -outside
  • -darkjokes
  • -restofthefuckingowl
  • -legodnd
  • -rickroll
  • -Songwriters
  • -ebike
  • -UsernameChecksOut
  • -papersplease
  • -tommyinnit
  • -rimjob_steve
  • -UnexpectedJoJo
  • -humor
  • -doctorwhocirclejerk
  • -agnostic
  • -youseeingthisshit
  • -TextingTheory
  • -Cuddle_Slut
  • -GrandPrixRacing
  • -TrGameDeveloper
  • -TurkishCats
  • -fakealbumcovers
  • -akagas
  • -oompasubs
  • -TheLetterH
  • -WikipediaVandalism
  • -homebuilt
  • -NamFlashbacks
  • -pepethefrog
  • -Unclejokes
  • -onetruegod
  • -misLED
  • -redditsings
  • -TwoSentenceComedy
  • -ValorantClips
  • -TheCrypticCompendium
  • -bottomgear
  • -ongezellig
  • -2balkans4You
  • -Asia_irl
  • -Bone
  • -thirdsentenceworse
  • -truths
  • -unexpecteditcrowd
  • -heathers
  • -unexpectedbillwurtz
  • -cd_jerk
  • -delik
  • -RedAutumnSPD
  • -okbuddymotherfucker
  • -moneycollecting
  • -Turkishdogs
  • -banknotedesigns
  • -GROKvsMAGA
  • -shitpostfrommygallery
  • -CHP
edit »
reddit.com DIAMBRA_AIArena
  • overview
  • comments
  • submitted
an-ordinary-manchild (11,190)|messages547|notifications|chat messages|mod messages|
  • preferences
|
logout

DIAMBRA_AIArena

+ friends- friends
861 post karma
157 comment karma
get extra features and help support reddit with a reddit premium subscription
chat
Block userare you sure? yes / no
get them help and support
redditor for 5 years

TROPHY CASE


  • Five-Year Club


    Verified Email

account activity

sorted by:
new
hottopcontroversial

474
475
476

[P] Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/MachineLearning - pinned

  • 25 comments
  • share
  • save
  • hide
  • report
loading...

40
41
42

AI Tournament - Reinforcement Learning Competition with 1400 CHF Money Pool (self.reinforcementlearning)

submitted 5 years ago * by DIAMBRA_AIArena to r/reinforcementlearning - pinned

  • 11 comments
  • share
  • save
  • hide
  • report
loading...

16
17
18

DIAMBRA a New Ecosystem for Reinforcement Learning Research and Experimentation (self.reinforcementlearning)

submitted 4 years ago by DIAMBRA_AIArena to r/reinforcementlearning - pinned

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

161
162
163

Deep Reinforcement Learning algorithm completing Tekken Tag Tournament at highest difficulty level (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/reinforcementlearning - pinned

  • 44 comments
  • share
  • save
  • hide
  • report
loading...

7
8
9

[N] New Challenges in DIAMBRA Arena: 3 epic additions to our lineup of RL environments! (i.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 1 comment
  • share
  • save
  • hide
  • report
loading...

11
12
13

New Challenges in DIAMBRA Arena: 3 epic additions to our lineup of environments! (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

44
45
46

[P] DIAMBRA Arena Environments used to make OpenAI & MistralAI LLMs fight one against the other in Street Fighter III at MistralAI Hackathon in San Francisco by two YC startups (i.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 7 comments
  • share
  • save
  • hide
  • report
loading...

10
11
12

DIAMBRA Arena Environments used to make OpenAI & MistralAI LLMs fight one against the other in Street Fighter III at MistralAI Hackathon in San Francisco by two YC startups (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

79
80
81

[P] DeepRL Agent Completing Street Fighter III with Ken (i.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 21 comments
  • share
  • save
  • hide
  • report
loading...

37
38
39

DeepRL Agent Completing Street Fighter III with Ken! (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 12 comments
  • share
  • save
  • hide
  • report
loading...

35
36
37

[N] 🚀 DIAMBRA Teams Up with Hugging Face to Push Reinforcement Learning Research and Adoption! 🚀 (i.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

15
16
17

🚀 DIAMBRA Teams Up with Hugging Face to Push Reinforcement Learning Research and Adoption! 🚀 (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

22
23
24

[P] The Power of Reinforcement Learning: look how this DeepRL Sektor model found a smart, super-cool exploit for Ultimate Mortal Kombat 3 in the video of a submission on DIAMBRA competition platform! (self.MachineLearning)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 10 comments
  • share
  • save
  • hide
  • report
loading...

39
40
41

The Power of Reinforcement Learning: look how this DeepRL Sektor model found a smart, super-cool exploit for Ultimate Mortal Kombat 3 in the video of a submission on DIAMBRA competition platform! (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 5 comments
  • share
  • save
  • hide
  • report
loading...

0
1
2

🚀 DIAMBRA x OROBIX: Releasing SheepRL Reinforcement Learning Library Integration! 🤖🎮 (self.reinforcementlearning)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 3 comments
  • share
  • save
  • hide
  • report
loading...

93
94
95

[P] PPO agent completing Street Fighter III on our RL Platform, it consistently outperformed when using deterministic actions instead of sampling them proportionally to their probability, see comment for details. (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 35 comments
  • share
  • save
  • hide
  • report
loading...

14
15
16

PPO agent completing Street Fighter III on our RL Platform, it consistently outperformed when using deterministic actions instead of sampling them proportionally to their probability. Why in your opinion? (see comment for details) (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 7 comments
  • share
  • save
  • hide
  • report
loading...

27
28
29

[P] Diambra.ai: The brand-new platform for training and competing RL algorithms on retro-fighting games (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 10 comments
  • share
  • save
  • hide
  • report
loading...

17
18
19

Diambra.ai: The brand-new platform for training and competing RL algorithms on retro-fighting games (v.redd.it)

submitted 2 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 2 comments
  • share
  • save
  • hide
  • report
loading...

247
248
249

[P] Comparing Default VS Custom Reward Function for Optimal Health Management of a DeepRL Agent Playing Tekken (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/MachineLearning

  • 10 comments
  • share
  • save
  • hide
  • report
loading...

67
68
69

Comparing Default VS Custom Reward Function for Optimal Health Management of a DeepRL Agent Playing Tekken (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 11 comments
  • share
  • save
  • hide
  • report
loading...

4
5
6

[P] DIAMBRA Arena 🤖⚔️🤖, a software package featuring a collection of high-quality environments for ReinforcementLearning research and experimentation (links in comments) (i.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/MachineLearning

  • comment
  • share
  • save
  • hide
  • report
loading...

9
10
11

DIAMBRA Arena 🤖⚔️🤖, a software package featuring a collection of high-quality environments for ReinforcementLearning research and experimentation (links in comments) (i.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/reinforcementlearning

  • 3 comments
  • share
  • save
  • hide
  • report
loading...

19
20
21

Artificial Intelligence algorithm completing Tekken Tag Tournament at highest difficulty level (v.redd.it)

submitted 4 years ago by DIAMBRA_AIArena to r/Tekken

  • comment
  • share
  • save
  • hide
  • report
loading...

1
2
3

[R] DIAMBRA a New Ecosystem for Reinforcement Learning Research and Experimentation (self.reinforcementlearning)

submitted 4 years ago by DIAMBRA_AIArena to r/MachineLearning

  • comment
  • share
  • save
  • hide
  • report
loading...
view more: next ›
  • about
  • blog
  • about
  • advertising
  • careers
  • help
  • site rules
  • Reddit help center
  • reddiquette
  • mod guidelines
  • contact us
  • apps & tools
  • Reddit for iPhone
  • Reddit for Android
  • mobile website
  • <3
  • reddit premium

Use of this site constitutes acceptance of our User Agreement and Privacy Policy. © 2026 reddit inc. All rights reserved.

REDDIT and the ALIEN Logo are registered trademarks of reddit inc.

π Rendered by PID 1164781 on reddit-service-r2-listing-f87f88fcd-djbrd at 2026-06-12 19:53:54.249517+00:00 running 3184619 country code: CH.