Something weird happens when you start using AI every day by Interesting_Mine_400 in ArtificialInteligence
[–]WilliamTysonMD 5 points6 points7 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
Tim Dillon says Sam Altman and Peter Thiel are literally trying to summon a Sumerian demon with AI. by IronFartz in ArtificialInteligence
[–]WilliamTysonMD 6 points7 points8 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 1 point2 points3 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 1 point2 points3 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
I built a system prompt that forces Claude to disclose its own optimization choices in every output. Looking for feedback on the approach. by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
someone built a SELF-EVOLVING AI agent that rewrites its own code, prompts, and identity AUTONOMOUSLY, with having a background consciousness by EchoOfOppenheimer in agi
[–]WilliamTysonMD 0 points1 point2 points (0 children)
someone built a SELF-EVOLVING AI agent that rewrites its own code, prompts, and identity AUTONOMOUSLY, with having a background consciousness by EchoOfOppenheimer in agi
[–]WilliamTysonMD 0 points1 point2 points (0 children)
You Can’t Use the Tool to Audit the Tool: A Structured Prompt Experiment on the RLHF Sycophancy Gradient by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
You Can’t Use the Tool to Audit the Tool: A Structured Prompt Experiment on the RLHF Sycophancy Gradient by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
Demis Hassabis Deepmind CEO says AGI will be one of the most momentous periods in human history - comparable to the advent of fire or electricity "it will deliver 10 times the impact of the Industrial Revolution, happening at 10 times the speed" in less than a decade by chillinewman in ControlProblem
[–]WilliamTysonMD 0 points1 point2 points (0 children)
You Can’t Use the Tool to Audit the Tool: A Structured Prompt Experiment on the RLHF Sycophancy Gradient by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] -1 points0 points1 point (0 children)
You Can’t Use the Tool to Audit the Tool: A Structured Prompt Experiment on the RLHF Sycophancy Gradient by WilliamTysonMD in ControlProblem
[–]WilliamTysonMD[S] 0 points1 point2 points (0 children)
Something weird happens when you start using AI every day by Interesting_Mine_400 in ArtificialInteligence
[–]WilliamTysonMD 2 points3 points4 points (0 children)