13 points | by upmostly 13 hours ago ago
2 comments
Which model was used for the benchmark results shown on your GitHub README.md?
Hey Leynos, we used Claude Sonnet 4.5 and benchmarks we used were the Martian code review bench: https://codereview.withmartian.com/?mode=offline
Which model was used for the benchmark results shown on your GitHub README.md?
Hey Leynos, we used Claude Sonnet 4.5 and benchmarks we used were the Martian code review bench: https://codereview.withmartian.com/?mode=offline