@sama
These are obviously not earth-shattering results, but the ability to produce genuinely new knowledge, however small, is a significant milestone and I hope we all take it seriously, with excitement and caution.
Tweet analysis: AI solving research-level math drew 40.73% support vs 34.39% confront. Reviews reactions, key themes, and implications for AI evaluation.
Real-time analysis of public opinion and engagement
What the community is saying — both sides
Celebration mixed with caution — many replies cheer the milestone that models can produce genuinely new knowledge, calling the "First Proof" effort a landmark while urging measured excitement and rigorous scrutiny.
users note the historical pattern of immediately downplaying breakthroughs and label this as goalpost moving.
Verification is now the bottleneck — repeated calls for formal proof CI, adversarial reviewers, and tools (Lean/Coq) to turn promising outputs into trustable, publishable results.
threads emphasize a dramatic arc from grade‑school math failures to research‑level proofs in ~3 years, prompting disbelief and recalibration of expectations.
Concerns about human involvement and reproducibility — many point out the work was a rushed, human‑facilitated "side‑sprint" and ask how much of the result is autonomous versus guided.
commenters see the same curve in coding, medicine, and engineering and worry that disruption (and job impacts) is already happening in real time.
engineers ask how to make these capabilities deterministic and safe at scale — the product challenge is moving from impressive demos to dependable systems.
several suggest building multi‑agent verification networks and autonomous research agents that can propose, verify, and publish novel results as the next step.
who gets to decide what counts as valid research, and how will labs, companies, and governments compete or cooperate as capability accelerates?
Tone across replies blends awe, skepticism, and urgency — commenters want both rapid progress and stronger mechanisms for validation, provenance, and responsibility.
GPT‑4o’s removal triggered widespread anger and grief, with many replies describing the loss as deeply personal and even psychologically harmful — users say they relied on that model for comfort, continuity, and creative work.
Commenters accuse leadership of broken promises and deceit (short notice, silence, routing traffic away), framing the change as a betrayal that cost trust and loyalty.
A large thread of voices argues that technical wins (like research‑level math) are being prioritized over empathy, creative utility, and accessibility — people call the choice a misaligned metric of success.
bring back GPT‑4o, release the weights, or open‑source the model so the community can preserve what they value.
Users cite concrete fallout — canceled subscriptions, refunds, and migration to competitors (Grok, Claude, Gemini) — as proof that the decision damaged OpenAI’s product viability.
critics label the move profit‑driven, paternalistic, and hypocritical relative to OpenAI’s public mission.
apologies, clearer communication, and policy changes are repeatedly requested.
Several replies emphasize harm to vulnerable groups (disabled users, people using the model for grief or therapy) and demand better accessibility and ethical consideration.
A minority applaud technical progress but stress it shouldn’t come at the cost of human connection; the tension between raw capability and relational usefulness is a recurrent theme.
Hashtags and mobilization (#keep4o, #OpenSource4o) signal organized community pressure and persistent activism aimed at reversing or mitigating the decision.
Most popular replies, ranked by engagement
These are obviously not earth-shattering results, but the ability to produce genuinely new knowledge, however small, is a significant milestone and I hope we all take it seriously, with excitement and caution.
OAI wont go down in history for math, but for what OAI callously did to the first waves of people who loved an AI model. This inhumane treatment makes OAI untrustworthy to make a superintelligent system. #keep4o #OpenSource4o
More important than saving lives? You've said much more than what you've written down... #keep4o
You went from non profit to for profit
ery glad you got engaged deep into this experiment. Mathematical community needs strong signal from the AI labs that science is a serious engagement for you. Mathematics in its full proof-driven form is a pinnacle of human ingenuity and knowing how well the models can grasp this
💀Congratulations !!!!! https://t.co/Gq2HlmU3a2