289 Commits

Author SHA1 Message Date
Rowan Walshe
2f82ba0d2b Merge branch 'topic/Add-human-eval-problems' into 'main'
Start adding HumanEval problems

See merge request eng/ai/ada-eval!22
2026-02-09 11:32:15 +00:00
Rowan Walshe
8f9cc4897a Start adding HumanEval problems 2026-02-09 11:32:15 +00:00
Rowan Walshe
cca2da835c Merge branch 'topic/Add-ability-to-unpack-generated-and-evaluated-datasets' into 'main'
Make it possible to unpack generated and evaluated datasets

See merge request eng/ai/ada-eval!20
2026-01-29 13:54:45 +00:00
Rowan Walshe
9293f60bb2 Apply 1 suggestion(s) to 1 file(s)
Co-authored-by: Sebastian M'Caw <mcaw@adacore.com>
2026-01-29 13:47:27 +00:00
Rowan Walshe
52a450c624 Address review comments 2026-01-28 17:28:46 +00:00
Rowan Walshe
80f1e39d1a Apply 9 suggestion(s) to 2 file(s)
Co-authored-by: Sebastian M'Caw <mcaw@adacore.com>
2026-01-28 16:52:18 +00:00
Rowan Walshe
6338a6b92d Merge branch 'topic/Remove-proof_steps-metric' into 'main'
Remove proof steps metric

See merge request eng/ai/ada-eval!21
2026-01-28 11:50:52 +00:00
Rowan Walshe
fd3e42bf1c Remove the proof_steps metric 2026-01-28 10:32:37 +00:00
Rowan Walshe
891654ba98 Remove data/evalauted and data/generated 2026-01-28 10:32:12 +00:00
Rowan Walshe
88bd0c3f18 Make it possible to unpack generated and evaluated datasets 2026-01-27 17:53:00 +00:00
Rowan Walshe
abb670fd40 Merge branch 'topic/Update-CI' into 'main'
Add tox.ini and update CI to test using the oldest and newest supported Python
2026-01-27 11:58:05 +00:00
Rowan Walshe
46b1d70e4c Add tox.ini and update CI to test using the oldest and newest supported Python 2026-01-27 11:58:04 +00:00
Rowan Walshe
4e470a3fcb Merge branch 'topic/Fix-issues-from-non-utf-8-files-in-generated-solution' into 'main'
Migrate dataset file contents to base64 encoding

See merge request eng/ai/ada-eval!15
2026-01-26 17:17:13 +00:00
Rowan Walshe
80a1c293d1 Migrate dataset file contents to base64 encoding 2026-01-26 17:17:13 +00:00
Rowan Walshe
580cf7b68e Merge branch 'topic/Add-license' into 'main'
Add LICENSE

See merge request eng/ai/ada-eval!14
2025-11-21 16:47:28 +00:00
Sebastian M'Caw
cc52701b27 Merge branch 'mr/recommend-alire-setup' into 'main'
Recommend only Alire setup

See merge request eng/ai/ada-eval!13
2025-11-21 16:21:39 +00:00
Rowan Walshe
a1b18426b4 Add LICENSE 2025-11-21 16:18:21 +00:00
Seb M'Caw
8ea540daf9 Update contents 2025-11-21 16:15:26 +00:00
Sebastian M'Caw
adae40b523 Apply 1 suggestion(s) to 1 file(s)
Co-authored-by: Rowan Walshe <walshe@adacore.com>
2025-11-21 16:14:45 +00:00
Seb M'Caw
0299f594dc Recommend using Alire for toolchain installation 2025-11-21 10:18:11 +00:00
Seb M'Caw
0a1dd8c9d9 Remove references to GitLab package registry 2025-11-21 10:05:39 +00:00
Sebastian M'Caw
cd2e8b3909 Merge branch 'mr/fix-mypy-ci' into 'main'
Treat mypy errors as CI failure

See merge request eng/ai/ada-eval!12
2025-11-20 16:54:24 +00:00
Seb M'Caw
2a4a1b7ab0 Treat mypy errors as CI failure
Co-authored-by: Rowan Walshe <walshe@adacore.com>
2025-11-20 16:37:31 +00:00
Sebastian M'Caw
361a3c808f Merge branch 'mr/report-cmd' into 'main'
Add `report` command

See merge request eng/ai/ada-eval!10
2025-11-20 16:34:48 +00:00
Sebastian M'Caw
2fbfb81b34 Merge branch 'mr/use-public-pypi' into 'main'
Use public PyPI

See merge request eng/ai/ada-eval!11
2025-11-20 16:03:10 +00:00