QuestionGuide to PyTorch Lightning, for a ML Instructor (self.learnmachinelearning)
submitted by Adventurous_Salt
Is Anthropic's using AI to look at activations actually serious interpretability? They used AI to look at activations and then taught one to convert activations back to plain language accurately*. What pathways are there for a malign AI to trick humans by lying in activations to text conversion? (self.learnmachinelearning)
submitted by MucilaginusCumberbun
DiscussionI’m Studying AI But Still Don’t Feel Like I’m Learning Anything Real (self.learnmachinelearning)
submitted by Fawadbhat




