There are critical gaps in the accessibility and affordability of mental health services worldwide: In some countries, you have to wait for years for therapy, in others you get at max one session per month covered by the health insurance, in others therapies for particular conditions like cluster B personality disorders are virtually nonexistent.
We want to leverage LLMs to fill these gaps and complement regular therapy. Our product is in development. We've based it on Gemini and want it to interface with widely used messaging apps, so users can interact with it like they would with a friend or coach.
I’ve previously founded or worked for several charities and spent a few years in earning to give for work on invertebrate welfare and s-risks from AI.
You can get up to speed on my thinking at Impartial Priorities.
[Same as what I replied to your DM:] Yeah, exactly! Systems like that were on my mind a lot around 2021–23 when we launched Impact Markets, which is now GiveWiki. You could do the payouts if humanity has survived for another year, has survived AGI, has survived a year with AGI, etc. You could also chain retrofunders with different time horizons. We tried to get something like this off the ground for about 2 years, but couldn't find any retrofunders anymore after FTX collapsed. (Some of the regrantors were interested.)
https://impactmarkets.substack.com/p/chaining-retroactive-funders
If you can find a retrofunder for a system like that, you can make it happen!
Hiii! Thanks! I'm only speaking for myself here, and I'm mostly interested in #3, or specifically in building, testing, and rolling out an AI-based tool for this rather than an RCT.
2. Consider the challenges of distribution and funding.
Yeah, working directly with the likes of Google (Gemini) and others would be swag, but correct me if I'm wrong, I see a very low chance of that working out? There is little commercial incentive in it for them, it doesn't help them gain more market share from their competitors because our target clients can't pay much, reputational risks similar to self-driving cars, etc. I haven't asked anyone who works there, but I'm not sufficiently optimistic that it could work out to attempt it… Besides, if it does work out and lots of people start using Gemini for therapy, and then Google redecides and closes that department again, lots of users will use new version of a product for a purpose for which it's not tested or optimized anymore.
But I already built an alpha version of an app for mentalization-based treatment on top of Gemini. That’s super easy, and I'll permanently have control over the instructions and possibly the fine-tuning. If it should turn out to be too risky, I can shut it down, or more likely I can make adjustments to minimize any new risks.
Do you think I overestimate the difficulty of working with the model providers?
4. Doing too many things
The topics can probably be trimmed down a bit, but I feel like #1–3 form a nice story line where we first assess the risks, the assess the opportunities, and then exploit them? Personally, I'd rather 80/20 all of that by rolling out my solution only to fairly stable people first (I'm in some relevant support groups), collect feedback, poll well-being measures from time to time, and react to any problems with safety (in the feedback) or lacking effectiveness (well-being measures) along the way, while I increasingly market it to wider audiences. The others might want to take this more slowly, and as a result they'll probably have the better data, but when that data is in, I can still optimize my tool accordingly.
Do you think it would really be better to focus on one topic only or would you agree that merging and 80/20ing is the better approach?
They model that, and after, I think, 1661 iterations of the human-AI trade game, the human-AI trade game accumulates enough wealth for humans that it would've been self-defeating for the humans to defect like that. I think it's still a Nash equilibrium but one where the humans give up perfectly good gains from trade. (Plus blockchain tech can make it hard to confiscate property.)
What has been your personal take-away from this line of thinking? This “standard case” is far from my own thinking, though I agree with the conclusion. Is it also far from your own thinking?
My take:
So what I'm afraid will happen is that an artificial RL agent will seek out resources first elsewhere in our solar system and then elsewhere in the galaxy (something that would be difficult for bio-humans), will run into communication delays due to the lightspeed limit, and will hence split into countless copies, each potentially capable of suffering. Soon they'll be separated so far that even updates on what it means to be value-aligned would travel for a long time, so there'll be moral “drift” in countless directions.
What I would find reassuring is:
Human extinction also seems bad on the basis that it contradicts the self-preservation drive that many/most humans have. Peaceful disenfranchisement may be less concerning depending on the details. But at the moment it seems random where we're headed in the coming years because hardly anyone in power is trying to steer these things in any sensible way. Again more time would be helpful.
Basic rights for AIs (and standing in court!) could also provide them with a legal recourse where they currently have to resort to threats, making the transition more likely to go smoothly, like you argue in another post. Currently we're nowhere close to having those. Again more time would be helpful.
I used the comment field in the form to note that a field in the form was marked as optional when it was actually mandatory. That comment got automatically published here, and out of context it made no sense whatsoever. I think it would've been clearer to not automatically transfer this form feedback here (some people might've even assumed that it's private feedback).
My current practical ethics
The question often comes up how we should make decisions under epistemic uncertainty and normative diversity of opinion. Since I need to make such decisions every day, I had to develop a personal system, however inchoative, to assist me.
A concrete (or granite) pyramid
My personal system can be thought of like a pyramid.
The ground floor
The ground floor of principles and heuristics is really the most interesting part for anyone who has to act in the world, so I won't further explain the top two floors.
The principles and heuristics should be expected to be messy. That is, I think, because they are by necessity the result of an intersubjective process of negotiation and moral trade (positive-sum compromise) with all the other agents and their preferences. (This should probably include acausal moral trades like Evidential Cooperation in Large Worlds.)
It should also be expected to be messy because these principles and heuristics have to satisfy all sorts of awkward criteria:
Three types of freedom
But really that leaves us still a lot of freedom (for better or worse):
These suggest a particular stance toward other activists:
Very few examples
In my experience, principles and heuristics are best identified by chatting with friends and generalizing from their various intuitions.
Various non-consequentialist ethical theories can come in handy here to generate further useful principles and heuristics. That is probably because they are attempts at generalizing from the intuitions of certain authors, which puts them almost on par (to the extent to which these authors are relateable to you) with generalizations from the intuitions of your friends.
(If you find my writing style hard to read, you can ask Claude to rephrase the message into a style that works for you.)