• Notes
  • Articles
  • Followers 30
  • Following 50
  • Remote follow

Nevkontakte
@me@m.nevkontakte.com

Lawful neutral. Definitely not a cat in a hat. Opinions are of Cthulhu. what/why.

Long-form

nevkontakte.com

Chirp

twitter.com/nevkontakte

Code

github.com/nevkontakte

🔑
Nevkontakte's avatar
Nevkontakte
@me@m.nevkontakte.com

For a long time I was curious if Claude Code works so well because of Claude (the model) or Code (the CLI tool / agent). This weekend, I tried to find out. Turns out that both matter, but more than anything post-training fine tuning of the model makes a big difference. If the model has been tuned for planning and tool usage in a certain way, it would provide much more reliable results.

Details as in https://nevkontakte.com/2025/swap-ai-brains.html

#ai #llm #agents

What happens if we swap AI brains? | Ne v kontakte nevkontakte.com
  • permalink
  • interact from your instance
  • 29 days ago
  • 1 like
  • 1 reply
Likes
@Jay42@mastodon.social
Jay's avatar
Jay
@Jay42@mastodon.social

in reply to this object

@me Love the experiment.

Video clip is my mood when I saw Gemini2.5 was the secondary. "That model is nasty in general."

Leaderboards aren't everything but Gemini's position on https://gorilla.cs.berkeley.edu/leaderboard.html are a cry for help. "You okay over there, google?"

Berkeley Function Calling Leaderboard (BFCL) V4 gorilla.cs.berkeley.edu
GIF
  • permalink
  • 29 days ago
Powered by microblog.pub 2.0.0+dev and the ActivityPub protocol. Admin.