37 Commits

Author SHA1 Message Date
Rowan Walshe
fd3e42bf1c Remove the proof_steps metric 2026-01-28 10:32:37 +00:00
Rowan Walshe
80a1c293d1 Migrate dataset file contents to base64 encoding 2026-01-26 17:17:13 +00:00
Seb M'Caw
19cf9928f5 Report proof step count relative to canonical 2025-11-19 13:11:50 +00:00
Seb M'Caw
902a67c419 Add tests 2025-11-17 18:44:24 +00:00
Seb M'Caw
9421c7a4ed Add explain sample to test 2025-10-27 18:18:09 +00:00
Seb M'Caw
69be909dca Check that initial sources are not already solved 2025-10-24 16:46:16 +01:00
Seb M'Caw
d2e92fd3d3 Check accuracy of canonical results 2025-10-24 16:44:47 +01:00
Seb M'Caw
2900718503 Check for failing canonical evaluation results 2025-10-24 16:44:47 +01:00
Seb M'Caw
84a7e7289d Check compacted and expanded datasets match 2025-10-24 16:44:47 +01:00
Seb M'Caw
03f0735207 Remove test_prove_ci() 2025-10-23 13:27:45 +01:00
Seb M'Caw
cea04a022e Self-review 2025-10-16 17:13:06 +01:00
Seb M'Caw
a1516f44ef Remove old EvaluationStats format 2025-10-16 14:43:00 +01:00
Seb M'Caw
dd108a234c Remove redundant test sample 2025-10-16 11:19:16 +01:00
Seb M'Caw
a2467c978c Simplify delegated_fails sample 2025-10-16 10:14:20 +01:00
Seb M'Caw
78f05f3d38 Detect when SPARK_Mode is set to Off 2025-10-15 12:13:33 +01:00
Seb M'Caw
43041c4113 Generalise src_pattern matching 2025-10-14 18:03:40 +01:00
Seb M'Caw
37486bb0c6 Fix alignment 2025-10-13 17:21:52 +01:00
Seb M'Caw
ff7cd0d0cd Add "required_checks" to loader tests 2025-10-03 16:34:41 +01:00
Seb M'Caw
f05bb560c1 Permit filtering "required_checks" by entity 2025-10-03 15:32:48 +01:00
Seb M'Caw
4b763b2113 Add "entities" to expected_spark_files 2025-10-03 14:29:56 +01:00
Seb M'Caw
e329908105 Fix test_prove_ci() 2025-10-03 11:47:55 +01:00
Seb M'Caw
d056776522 Add delegated_fails sample 2025-10-03 10:35:26 +01:00
Seb M'Caw
814b26249b Recategorise warnings as incorrect proof 2025-09-26 12:07:45 +01:00
Seb M'Caw
b339923ee7 Fix test_prove_ci() 2025-09-25 18:06:46 +01:00
Seb M'Caw
b1e67beae4 Update test_prove() 2025-09-25 18:04:52 +01:00