Coalitional agency thoughts

Making public some stuff I wrote in an email because it has some interesting ideas.

General directions I have in mind.

Collectives of agents can be modeled as agents when they trade

Take two agents A and B which action sets XA and XB and utility functions UA(xA,xB), UB(xA,xB). In general if they are individually maximizing their utility functions, then maybe their chosen actions (xA*,xB*) will be some Nash-equilibrium of the game — but it may not be possible to interpret this as the action of a “super-agent”.

There are two ways to interpret the words “the action of a super-agent”:

More precisely by “trade” we mean: perfect information, zero transaction costs, and (I guess the last one is only relevant in economics) competitive market structure. I guess it also depends on perfect property rights/perfect assurance, which is implicitly assumed under zero transaction costs.

Some questions I can explore:

property rights stuff

We discussed this a bit over our call, but: “trade aligns incentives” requires two basic assumptions: (a) that there is no information asymmetry (b) there is perfect assurance, equivalently property rights/contract enforcement.

(In general I prefer this formulation to “outer and inner alignment” — the former is outer alignment; the relationship of the latter to inner alignment is more subtle but the idea is that there would be no “inner alignment problem” if your reward mechanism reliably persists forever; but once the AI gets smart enough it can just completely upend your reward mechanism.)

For instance the second fundamental theorem of welfare economics states a correspondence between “initial endowments” (which is a special case of “property rights allocations”) and “aggregate utility functions”.

I don’t know if right now I have a more precise idea than “property rights are probably worth studying; e.g. we should at least think about how to economically model settings where agents can violate rights/the initial rights allocations aren’t stable”.

I wrote some stuff about this as an undergrad: https://arxiv.org/pdf/2107.09651 — I don’t know if it’s very useful, but I did discover that there is interesting mathematical structure there, which I generally take as a sign of usefulness.

And as always, another question is how “property rights” translates into a thermodynamic context.

simply continuing work on my existing market-based RL project

i.e. on the attached paper. The key questions to work on are: - how can we make market-based agents more efficient — either by introducing “parent programs” or figuring out how to backpropagate into the space of agents - how can we get “modularity of state” in an arbitrary RL setting? - proving that market-based agents maximize e.g. infinite-horizon average reward in the long run

Re: your point about economics neglecting dynamics and computational complexity

Some essays and surveys on this are linked on this stack exchange post and this survey by Alfred Norman, but yeah — a proper general theory of this is lacking. Two thoughts I have:

This also applies the thermodynamics; everything is in terms of equilibrium states, not the dynamics of getting there. In both cases dynamics is linked to “learning”. But I do wonder: maybe in thermodynamics there is some physics describing the exact time-evolution, maybe a variational principle that can be transferred over to economics?

Maybe the “conditional on algorithmic information” intuition underlying bounded rationality also applies to time?