I've been developing RoastMyPost (currently in beta) and wrestling with how to systematically analyze documents. The space of possible document checks is vast, easily thousands of potential analyses. Building on familiar concepts like "spell check" and "fact check," I've made a taxonomy
We've upgraded SquiggleAI to use Claude Sonnet 4.5, Claude Haiku 4.5, and Grok Code Fast 1. This is a significant upgrade over the previous Claude Sonnet 3.7 and Claude Haiku 3.5. All three are available now. Initial testing shows meaningful improvements in code generation
Dear Squiggle Community, At QURI, we're focused on tools that advance forecasting and epistemics to improve decision-making. As you know, we care deeply about evaluation, and we're holding a survey on Squiggle to better understand how and why people use our work. Honestly, developing this tooling
Epistemic status: speculative fiction It's difficult to imagine how human epistemics and AI will play out. On one hand, AI could provide much better information and general intellect. On the other hand, AI could help people with incorrect beliefs preserve those false beliefs indefinitely. Will advanced AIs attempting
Epistemic Status: Early idea A common challenge in nonprofit/project evaluation is the tension between social norms and honest assessment. We've seen reluctance for effective altruists to publicly rate certain projects because of the fear of upsetting someone. One potential tool to use could be something like an
Squiggle AI & Sonnet 3.7 We've updated Squiggle AI to use the new Anthropic Sonnet 3.7 model. In our limited experimentation with it so far, it seems like this model is capable of making significantly longer Squiggle models (roughly ~200 lines to ~500 lines), but that
Update I recently posted this to the EA Forum, LessWrong, and my Facebook page, each of which has some comments. Epistemic Status A collection of thoughts I've had over the last few years, lightly edited using Claude. I think we're at the point in this discussion
We're launching a short competition to make Fermi models, in order to encourage more experimentation of AI and Fermi modeling workflows. Squiggle AI is a recommended option, but is not at all required. The ideal submission might be as simple as a particularly clever prompt paired with the
Thanks to Slava Matyuhin for comments Summary 1. AIs can be used to resolve forecasting questions on platforms like Manifold and Metaculus. 2. AI question resolution, in theory, can be far more predictable, accessible, and inexpensive to human resolution. 3. Current AI tools (combinations of LLM calls and software) are
Introducing Squiggle AI — EA ForumWe’re releasing Squiggle AI, a tool that generates probabilistic models using the Squiggle language. This can provide early cost-effectiveness models…Effective Altruism Forum LogoOzzie Gooen We have previously written about Squiggle AI here, but waited until it was more tested and we had a better
We’ve been busy with a variety of things over the November-December season, though few make clean public blog posts or releases. * Software improvements and maintenance * Squiggle: Published Squiggle v0.10 * SquiggleHub: Replaced GraphQL with React Server Components. This means that pages load faster and developer velocity should be higher.
If you happen to be around Washington DC, I'll be doing a free ~1.5hr Squiggle workshop this Thursday evening. It will cover the basics, and I'll of course be around afterwards for extended discussion. Make sure to bring a laptop. See more information and RSVP