Show HN: Trained an LLM to predict "What will Trump do?"

(huggingface.co)

9 points | by bturtel 5 hours ago ago

2 comments

sleno 5 hours ago
interesting...what were some examples of things trump did that your model got right and gpt-5 got wrong?
[-]
- bturtel 5 hours ago
  Great question! It's probabilistic so not really "right vs wrong" on any single question, but who better estimated the likelihood. One big difference shows up when there's no useful context - we ran the same eval WITHOUT including any useful up-to-date context with questions. In this case, GPT-5 stays overconfident and its BSS drops to -11.3% (vs -4.3% ours) - worse than just guessing the base rate. So one advantage of the RL training is just learning to know what you don't know, and identify when there's real signal.