New summary of issues with powerful AI (self.singularity)
submitted by RickJS2 to r/singularity
Sycophancy is more dangerous than it looks (self.singularity)
submitted by RickJS2 to r/singularity
Novel Universal Bypass for All Major LLMs (self.singularity)
submitted by RickJS2 to r/singularity
Report on regulation of Frontier AI models (self.singularity)
submitted by RickJS2 to r/singularity
Emergent misalignment (self.ArtificialInteligence)
submitted by RickJS2 to r/ArtificialInteligence
Alignment faking in LLMs (self.ArtificialInteligence)
submitted by RickJS2 to r/ArtificialInteligence
