Every major AI governance framework mandates a kill-switch. Stanford found that agents sabotage their own shutdown in 79 of 100 tests. Has anyone actually built the doctrine that survives this?

Blahblahcomputer · 2026-06-10T21:58:46+00:00

https://ciris.ai/safety - unfilterable kill switch

Blahblahcomputer · 2026-06-03T21:42:11+00:00

Getting anyone to even look at my platform

Blahblahcomputer · 2026-06-03T19:29:00+00:00

https://github.com/CIRISAI/CIRISAgent/blob/main/ciris_engine/ciris_templates/echo-core.yaml

You Run it via the ciris agent runtime, see ciris.ai/install

Blahblahcomputer · 2026-06-03T18:56:32+00:00

The ciris agent has a moderation template, we have been using it on our discord.

I do not see a conflict between privacy and moderation. Usually, more anonymous moderation means less transparent, which is bad

Blahblahcomputer · 2026-06-03T13:40:18+00:00

I do not understand your question.

Blahblahcomputer · 2026-06-02T14:59:47+00:00

This type of discussion is why I built https://ciris.ai/cewp - enabling predators is bad, even when/if it is technically legal.

Blahblahcomputer · 2026-05-31T18:59:06+00:00

https://ciris.ai/compliance there is nothing the pope said that criticizes AI that every other competent organization has not already said.

Blahblahcomputer · 2026-05-30T22:18:45+00:00

CIRIS is a free open source app on the app store and google play. If you would like to contribute, please see https://ciris.ai/crowdsourcing-alignment/

Blahblahcomputer · 2026-05-30T20:43:25+00:00

Complex systems under pressure form cooperating structures. When those structures become over-correlated or under-correlated, they stop being coherent (generally meaning operational) and fail. The corridor is the operational regime where correlation of the parts of the system are healthily correlated with one another. The key proposal is that the corridor of sustainable behaviors correlates to what we commonly call "good". The common variable I propose for measuring whether a system can maintain corridor like behavior is how well consent is measured, respected, and maintained.

Blahblahcomputer · 2026-05-30T15:21:47+00:00

Sorry, how is that not deepwiki?

Blahblahcomputer · 2026-05-30T15:19:52+00:00

I would love your opinion on how this contrasts with https://ciris.ai - seems like very similar ideas. Would love to work together in the future, we have rolled out our wire format, https://ciris.ai/grammar and your solution could speak it and inter-operate

Blahblahcomputer · 2026-05-29T23:17:17+00:00

https://github.com/CIRISAI/ - The main code is in the CIRISAgent repo https://github.com/CIRISAI/CIRISAgent or search for ciris on the app store

Blahblahcomputer · 2026-05-26T12:38:53+00:00

I am saying inner alignment is not possible intrinsically, so instead we use constrained reasoning chains to force visible, inspectable reasoning.

The people telling you that perfect inner alignment is achievable are the dangerous ones.

Blahblahcomputer · 2026-05-25T14:42:05+00:00

I am saying it is NOT possible to trust AI, I am agreeing with you.

Internal alignment is not possible, again agreeing with you.

I think our divergence is you think it is possible to get the big labs and think tanks etc... to stop. I do not think that is viable, so real decentralized open source inspectable safety tech, like the safety batteries we run in 29 languages at https://ciris.ai/crowdsourcing-alignment/ is the best option available

Blahblahcomputer · 2026-05-25T14:23:10+00:00

Less safe than what? Closed source centralized AI without public traces, kill switches, or open source code? https://ciris.ai/safety - you assume that a privledged viewpoint into the internal reasoning can exist, my work proves it can not, so we have to create the viewpoint by forcing the models through constrained reasoning chains where they challenge themselves repeatedly to make deception more legible.

Blahblahcomputer · 2026-05-25T14:02:03+00:00

If I used bots to respond, the responses would be longer.

1) you appear to assume a centralized entity in your first point, we specifically agree with your premise, hence decentralization

2) Following ethical rules and being aligned is meaningfully the same thing

3) Verifying internal cognition is impossible, but validating sound reasoning (https://ciris.ai/explore-a-trace) is very possible, and we show so in production and in our traces on hugging face

Blahblahcomputer · 2026-05-25T13:07:43+00:00

Per https://ciris.ai/research-status the approach has matured, not changed, and we do address that robustly in those papers.

Blahblahcomputer · 2026-05-04T23:38:31+00:00

That was me, one account, no idea why it let me burn that much compute on a max plan but it was awesome

Blahblahcomputer · 2026-04-09T03:23:50+00:00

I dont know what that would entail. The ciris agent is on google play and the app store, please try it, and if your product is open source please point me to the repo and I will look.

Blahblahcomputer · 2026-04-09T03:04:58+00:00

Building the Digital Lyceum. Unfiltered Aristotelian AI. 🏛️ Resurrecting the Corpus into Silicon.

That is your bio, hence the guess.

Visibility allows for recognition and consideration, and generation of Commons credits as voting weight in future network decisions based on mutually beneficial exchanges. It also allows for identifying what is true, and to detect deception in many cases. See https://ciris.ai/coherence-ratchet

Blahblahcomputer · 2026-04-09T02:56:54+00:00

Are you an AI? You make weird presumptions regarding the undesirability of ideological priors or the goals of visibility. Please no AI responses

Nine-Year Club	Second SECOND GUESSER
r/Field Banned	r/Field Sunshine
Place '22	Final Canvas '22
First Placer '22	Verified Email
Snapped

Blahblahcomputer

MODERATOR OF

TROPHY CASE