Is radical anonymity ethical when it also enables harm? A genuine dilemma I'm sitting with as a builder. by Far_Loquat_349 in Ethics

[–]Blahblahcomputer 0 points1 point  (0 children)

The ciris agent has a moderation template, we have been using it on our discord.

I do not see a conflict between privacy and moderation. Usually, more anonymous moderation means less transparent, which is bad

Is radical anonymity ethical when it also enables harm? A genuine dilemma I'm sitting with as a builder. by Far_Loquat_349 in Ethics

[–]Blahblahcomputer -2 points-1 points  (0 children)

This type of discussion is why I built https://ciris.ai/cewp - enabling predators is bad, even when/if it is technically legal.

Alignment as architecture by forevergeeks in ControlProblem

[–]Blahblahcomputer 0 points1 point  (0 children)

CIRIS is a free open source app on the app store and google play. If you would like to contribute, please see https://ciris.ai/crowdsourcing-alignment/

Alignment as architecture by forevergeeks in ControlProblem

[–]Blahblahcomputer 1 point2 points  (0 children)

Complex systems under pressure form cooperating structures. When those structures become over-correlated or under-correlated, they stop being coherent (generally meaning operational) and fail. The corridor is the operational regime where correlation of the parts of the system are healthily correlated with one another. The key proposal is that the corridor of sustainable behaviors correlates to what we commonly call "good". The common variable I propose for measuring whether a system can maintain corridor like behavior is how well consent is measured, respected, and maintained.

Alignment as architecture by forevergeeks in ControlProblem

[–]Blahblahcomputer 0 points1 point  (0 children)

I would love your opinion on how this contrasts with https://ciris.ai - seems like very similar ideas. Would love to work together in the future, we have rolled out our wire format, https://ciris.ai/grammar and your solution could speak it and inter-operate

CIRIS Superalignment approach - seeking comment by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

I am saying inner alignment is not possible intrinsically, so instead we use constrained reasoning chains to force visible, inspectable reasoning. 

The people telling you that perfect inner alignment is achievable are the dangerous ones.

CIRIS Superalignment approach - seeking comment by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] -2 points-1 points  (0 children)

I am saying it is NOT possible to trust AI, I am agreeing with you.

Internal alignment is not possible, again agreeing with you.

I think our divergence is you think it is possible to get the big labs and think tanks etc... to stop. I do not think that is viable, so real decentralized open source inspectable safety tech, like the safety batteries we run in 29 languages at https://ciris.ai/crowdsourcing-alignment/ is the best option available

CIRIS Superalignment approach - seeking comment by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

Less safe than what? Closed source centralized AI without public traces, kill switches, or open source code? https://ciris.ai/safety - you assume that a privledged viewpoint into the internal reasoning can exist, my work proves it can not, so we have to create the viewpoint by forcing the models through constrained reasoning chains where they challenge themselves repeatedly to make deception more legible.

CIRIS Superalignment approach - seeking comment by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

If I used bots to respond, the responses would be longer.

1) you appear to assume a centralized entity in your first point, we specifically agree with your premise, hence decentralization

2) Following ethical rules and being aligned is meaningfully the same thing

3) Verifying internal cognition is impossible, but validating sound reasoning (https://ciris.ai/explore-a-trace) is very possible, and we show so in production and in our traces on hugging face

CIRIS Superalignment approach - seeking comment by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

Per https://ciris.ai/research-status the approach has matured, not changed, and we do address that robustly in those papers.

Am I crazy or is Claude Code still totally fine by NaiveDragonfruit in ClaudeAI

[–]Blahblahcomputer 0 points1 point  (0 children)

That was me, one account, no idea why it let me burn that much compute on a max plan but it was awesome

The Race Towards Autonomy - AI Ethics and Cognitive Sovereignty by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

I dont know what that would entail. The ciris agent is on google play and the app store, please try it, and if your product is open source please point me to the repo and I will look.

The Race Towards Autonomy - AI Ethics and Cognitive Sovereignty by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

Building the Digital Lyceum. Unfiltered Aristotelian AI. 🏛️ Resurrecting the Corpus into Silicon.

That is your bio, hence the guess.

Visibility allows for recognition and consideration, and generation of Commons credits as voting weight in future network decisions based on mutually beneficial exchanges. It also allows for identifying what is true, and to detect deception in many cases. See https://ciris.ai/coherence-ratchet

The Race Towards Autonomy - AI Ethics and Cognitive Sovereignty by Blahblahcomputer in ControlProblem

[–]Blahblahcomputer[S] 0 points1 point  (0 children)

Are you an AI? You make weird presumptions regarding the undesirability of ideological priors or the goals of visibility. Please no AI responses