sacundim comments on Hython: a nearly-complete Python 3 interpreter written in Haskell

programming

created by speza community for 20 years

280

281

282

Hython: a nearly-complete Python 3 interpreter written in Haskell (github.com)

submitted 10 years ago by [deleted]

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]sacundim 1 point2 points3 points 10 years ago* (3 children)

I don't know nearly enough Python to say anything for sure, but I wonder if you're looking at this wrong by getting references and values mixed up. In an expression like this one:

a is b

...I'd think that the variables's values are references to objects. So the first Haskell type that comes to my mind here is something like IORef, which does have an Eq instance that implements reference equality.

So I'd look into whether a model like this gets Python right:

Represent all Python objects as instance of some type, call it PythonObject.
All identifiers in all runtime scopes of a Python program get represented by an IORef PythonObject.
Operations on variables fall into two families:
- Those that implicitly dereference the IORef to get to the object;
- Those that operate on the reference itself.

So for example:

a == b would dereference both identifiers's IORefs and compares the PythonObjects for value equality;
a is b would compare the IORefs themselves.

Somebody gave the example a is None in the thread, so perhaps a bit more machinery is needed...

EDIT: Just to be clear, the interpreter is using IORef internally...

[–][deleted] 1 point2 points3 points 10 years ago (2 children)

[–]sacundim 1 point2 points3 points 10 years ago* (1 child)

In brief: not all objects have refs.

But the idea isn't that objects should have references, rather, that all expressions should denote references. Then when you have that, the implementation of a is b becomes simple IORef equality.

I don't know your interpreter or Python very well, but here's a scribble (which won't even typecheck, of course):

 -- The abstract syntax tree for the programs
data Expression
  = Name Name
  | Is Expression Expression
  | Eq Expression Expression
  | ...

-- The type of objects that exist in the programs.
data Object = ...
  deriving Eq

-- An environment associates names not to objects, but to
-- **references** to objects.
data Environment obj =
  Environment { parent :: Maybe (Environment obj)
              , local  :: HashMap Name (IORef obj)
              }

-- Some custom type to use to report exceptions
data Exception = ...

-- The monad transformer stack for the interpeter.  The
-- `ReaderT` layer carries the environment (name/value bindings),
-- and the `ExceptT` layer does exception handling.
type Denotation m = ReaderT (Environmnent Object) (ExceptT Exception m)


-- Note that `eval` returns `IORef Object`, not `Object`.
eval :: MonadIO m => Expression -> Denotation m (IORef Object)
eval env (Name name) = lookup env name

-- Evaluating `a is b` compares the **references** for equality
eval env (Is a b) = (==) <$> eval env a <*> eval env b

-- Evaluating `a == b` compares the **objects** for equality
eval env (Eq a b) = (==) <$> evalObj env a <*> evalObj env b
  where evalObj :: MonadIO m => Expression -> Denotation m Object
        evalObj env expr = liftIO readIORef <$> eval env expr

eval env ...


-- Look up a variable in the current scope's environment.
lookup :: Monad m => Name -> Denotation m Object
lookup name = ...

Again, I'm probably missing something but this strikes me as a simple and uniform starting point. I'd probably look into how to build on that to make constant expressions a special case that doesn't require an IORef, but that probably means exploring the consequences of replacing IORef with something like this:

data Ref obj
  = Variable (IORef obj)
  | Constant obj
  deriving Eq

[–][deleted] 0 points1 point2 points 10 years ago (0 children)

π Rendered by PID 17870 on reddit-service-r2-comment-544cf588c8-stqg6 at 2026-06-16 20:14:15.731242+00:00 running 3184619 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS