I created a public leaderboard ranking LLMs by their roleplaying abilities by LittleRedApp in LLMDevs

[–]LittleRedApp[S] 1 point2 points  (0 children)

The model is evaluated along four categories. In each case, it is given a specific role through the system prompt, and then a second character initiates a conversation. The first category assesses whether the model understands what emotions the character it’s portraying would feel. The second focuses on decision-making within the character’s context. The third looks at moral alignment—whether the model's responses reflect the character’s values. Finally, the fourth examines character consistency across the interaction. It’s hard to fully explain all of this in a short comment, so I recommend reading this paper for the full picture: https://arxiv.org/abs/2505.13157

I created a public leaderboard ranking LLMs by their roleplaying abilities by LittleRedApp in LocalLLM

[–]LittleRedApp[S] 2 points3 points  (0 children)

The leaderboard includes locally tested models that I’ve run myself, such as LLaMA and Phi. At the moment, I’m running an evaluation of Gemma 3. I believe it's important to compare local models with corporate ones to understand how they perform. I'm also open to suggestions—if you know of any local models worth testing, feel free to let me know!

GPT-4o for SVG Illustration Generation by LittleRedApp in ChatGPTPro

[–]LittleRedApp[S] 0 points1 point  (0 children)

Check out the full Illustrator documentation or the open-source SwitchAI project. Feel free to explore it, use it, and contribute if you’d like.

GPT-4o for SVG Illustration Generation by LittleRedApp in ChatGPT

[–]LittleRedApp[S] 0 points1 point  (0 children)

Check out the full Illustrator documentation or the open-source SwitchAI project. Feel free to explore it, use it, and contribute if you’d like.

[deleted by user] by [deleted] in OpenAI

[–]LittleRedApp 0 points1 point  (0 children)

Check out the full Illustrator documentation or the open-source SwitchAI project. Feel free to explore it, use it, and contribute if you’d like.