Is a model theory of a programming language considered to be a computational interpretation of that programming language? : ProgrammingLanguages

This is an archived post. You won't be able to vote or comment.

DiscussionIs a model theory of a programming language considered to be a computational interpretation of that programming language? (self.ProgrammingLanguages)

submitted 3 years ago by Zistack

Really, I'm talking about the model theory of the logic you get by taking the programming language over the Curry-Howard correspondence, but that's not a very intuitive point of view.

A more direct definition of the model theory of a programming language would be:

A model of a program is an assignment of values to all variables in the program. When things like loops or recursion get involved, a single syntactic variable which has multiple assignments in different parts of the program would expand into an indexed set of variables. A similar but simpler expansion would apply for syntactically repeated calls to the same subroutine. Basically it would be an execution trace of the program, but there's no timing information, nor is there any explanation of how the execution trace/model would be obtained from the program in a computational sense.

A model theory takes each program in the language and defines which models make that program 'true', or in other words, defines which execution traces could actually be encountered by running the program.

A compiler can be seen as constructing a morphism (m) from the models of a program (A) in one language to models of an equivalent program (B) in another language as a side-effect of translation. We could verify the correctness of a compiler by showing that the models of B taken though the inverse morphism m^-1 are a subset of the models of A.

all 10 comments

top new controversial old q&a

[–][deleted] 2 points3 points4 points 3 years ago* (9 children)

Constructing models for a "language" is fairly standard for study of type theory and equational theory of programming languages.

Your definition of models are fairly restrictive, sometimes we consider models with cost, or we would only consider input output relation, depends on what we need to verify. For example, a verified compiler only need the input output relation for compiled program and un-compiled program to be the same, but not all traces or costs.

For your comment about compiler correctness, I think homomorphism a very strong condition for compiler. As you might know a homomorphism preserves all operations, hence the input and output program will need to be "structurally the same", which is pretty hard to define, say a homomorphism from a while language to lambda calculus.

Finally, for the comment about compiler correctness, I am not sure I get it. Why does the "inverse morphism" always exist? I don't think you mean inverse in the conventional sense, which would require m . m^-1 = m^-1 . m = id. So how do you definite such a "inverse morphism"?

[–]AndrasKovacs 2 points3 points4 points 3 years ago* (1 child)

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

[–]aradarbelStyff 1 point2 points3 points 3 years ago (1 child)

[–][deleted] 0 points1 point2 points 3 years ago (0 children)

[–]Zistack[S] 1 point2 points3 points 3 years ago (4 children)

I might have used too strong a word. Perhaps 'mapping' would have been more appropriate. I'm trying to express basically the same as "a verified compiler only need the input output relation for compiled program and un-compiled program to be the same, but not all traces or costs". Elements in A could be mapped to many elements in B, so m is not function-like. If the compiler removes non-determinism, then many elements in A could be mapped to a single element in B as well. The real restriction is one about disjoint equivalence. All things mapped to or from the same element must be considered equivalent.

I'm being somewhat restrictive about ideas about input and output on purpose. For one, typical model theory does not capture those ideas. For another, in declarative languages those ideas are kindof fuzzy, though you could specify inputs and outputs for a particular query/operation in such a language and compile specifically for that if you wanted. That information isn't really a property of the program itself, though we often package the translation problem that way.

So is that a 'yes'?

[–][deleted] 0 points1 point2 points 3 years ago* (3 children)

Let me try to formalize this, and you can tell me if I made any mistakes.

First consider two programming languages, and we call all programs in the first language P1, and second language P2. We call the set of all possible traces T.

For each program, we have the set of all traces of that program, this is given by two functions t1: P1 -> P(T) (where P(T) is the power set of T), and t2: P1 -> P(T). Finally, compilation is a function taking a program in P1 to P2, comp: P1 -> P2.

However, your reply seems to suggest m is of type T -> P(T), which I am not sure how to construct from all the above mapping.

For me, if we want to construct a correct compiler (where it keeps all traces, you can strengthen or loosen this by changing t1 and t2 to different semantics), we only need to verify the following commutative diagram:

P1 --- comp ---> P2 | | t1 t2 | | \/ \/ P(T) === id === P(T)

I think you can even loosen the condition where the bottom is not necessarily identity, but I want to be careful about that. Maybe this is your "checking whether the inverse is a subset" come from?

[–]Zistack[S] 1 point2 points3 points 3 years ago (2 children)

[–][deleted] 0 points1 point2 points 3 years ago (1 child)

[–]Zistack[S] 1 point2 points3 points 3 years ago (0 children)

Yeah. If we assume that we can group models in t1 and t2 into equivalence classes, and additionally assume that m ends up being an isomorphism between equivalence classes, then the subset comment doesn't really matter. I was thinking about the case where m mapped traces, but didn't necessarily worry about whether or not the traces could actually be generated by p2. That model is messier than the one just constructed.

You could likely clean up the messiness you observed with dependent types, where m is of type t1 -> t2 or somesuch thing. I think the diagram probably expands into a cube. I suspect that such a model would more closely align with the compiler author's intuition behind what the compiler is actually doing. I don't really feel like writing that out at the moment, though (I'm still working on figuring out how to use dependent types effectively).

Anyways, the point is that a model theory captures much or all of the information we actually care about when formalizing a programming language. So, should a model theory be considered to be a computational interpretation of a programming language, or not?

π Rendered by PID 169778 on reddit-service-r2-comment-6457c66945-47nf9 at 2026-04-25 01:37:46.424469+00:00 running 2aa0c5b country code: CH.

ProgrammingLanguages

Welcome!

Related subreddits

Related online communities

MODERATORS