Commit Graph

  • 8501aeb5c9 Add "required_checks" where appropriate Seb M'Caw 2025-09-26 17:36:25 +01:00
  • 814b26249b Recategorise warnings as incorrect proof Seb M'Caw 2025-09-26 12:07:45 +01:00
  • b339923ee7 Fix test_prove_ci() Seb M'Caw 2025-09-25 17:36:47 +01:00
  • b1e67beae4 Update test_prove() Seb M'Caw 2025-09-25 16:43:23 +01:00
  • 5a88bb6bee Add required_checks field to SparkSample Seb M'Caw 2025-09-24 18:18:06 +01:00
  • 4a9609b4d0 Remove redundant Loop_Invariant from char_count_2 Seb M'Caw 2025-09-23 10:25:42 +01:00
  • 20131b39df Read GNATprove results from .spark file Seb M'Caw 2025-09-23 10:24:43 +01:00
  • bdd43daa5e Merge branch 'topic/Add-simple-unit-tests' into 'main' Rowan Walshe 2025-09-25 08:33:47 +01:00
  • 75fcb8902b Add simple unit tests Rowan Walshe 2025-09-25 08:33:47 +01:00
  • 7251d409e4 Merge branch 'mr/mcaw/implement-eval' into 'main' Sebastian M'Caw 2025-09-09 10:30:40 +01:00
  • bee14685ab Add .python-version file Seb M'Caw 2025-09-08 16:12:04 +01:00
  • d3c5080a9c Increase minimum Python version to 3.12 Seb M'Caw 2025-09-08 16:10:12 +01:00
  • 5c9684f837 Add pre-run to mock shell_script tool Seb M'Caw 2025-09-08 13:06:24 +01:00
  • 0786cba6e5 Also fix parameter order warning in base files of swap_depends Seb M'Caw 2025-09-08 12:33:32 +01:00
  • b3028e3b8f Update prompt for absolute_value to mention overflow warning Seb M'Caw 2025-09-08 12:26:08 +01:00
  • c8d5838a0d Self-review Seb M'Caw 2025-09-03 17:09:08 +01:00
  • 9275c65f8f Run make generate-dummy and make evaluate Seb M'Caw 2025-09-02 14:28:12 +01:00
  • 273fb0fb1a Use default --jobs for Makefile targets Seb M'Caw 2025-09-02 14:18:29 +01:00
  • f052729eaf Update README Seb M'Caw 2025-09-02 14:17:32 +01:00
  • 5238bc63d0 Add prove test which can be run in CI Seb M'Caw 2025-09-02 13:22:48 +01:00
  • 29dca5fac5 Fix running canonical evaluation on individual datasets Seb M'Caw 2025-09-02 11:53:27 +01:00
  • dc588f9d9f Rename build stats attributes Seb M'Caw 2025-09-02 09:33:06 +01:00
  • bac5ab71cf Rename to_evaluated_sample -> as_evaluated_sample Seb M'Caw 2025-09-01 18:26:21 +01:00
  • b186f7e0d1 Specify project file explicitly Seb M'Caw 2025-09-01 18:08:17 +01:00
  • 4e4da134cf Run test_build() in CI Seb M'Caw 2025-09-01 17:31:36 +01:00
  • 7d158b0399 Use enum for EvaluationStats discriminant Seb M'Caw 2025-09-01 17:07:34 +01:00
  • 77ac783af0 Fix test_eval_path_checks() Seb M'Caw 2025-09-01 10:09:22 +01:00
  • 855e01d02f Add miscellaneous further tests Seb M'Caw 2025-08-29 16:10:56 +01:00
  • 24e37be267 Pack eval test datasets Seb M'Caw 2025-08-29 15:32:41 +01:00
  • 76ceb05eaa Add prove eval test Seb M'Caw 2025-08-29 15:24:38 +01:00
  • 0401dc9e3d Add build eval test Seb M'Caw 2025-08-29 14:12:29 +01:00
  • bd2596ddd4 Add subprocess debug logs Seb M'Caw 2025-08-29 13:52:50 +01:00
  • 3834eea7ff Add --verbose switch Seb M'Caw 2025-08-29 13:51:54 +01:00
  • 940eeaeeeb Use mocks for all generic eval tests Seb M'Caw 2025-08-29 10:48:58 +01:00
  • 330d7b3eac Define SampleStage enum Seb M'Caw 2025-08-29 10:41:10 +01:00
  • c4c0072e92 Add tests for evaluate_directory() Seb M'Caw 2025-08-28 17:09:48 +01:00
  • 665ef0e22c Add warning for empty evals list Seb M'Caw 2025-08-28 12:11:34 +01:00
  • 0d6c61f04f Propagate WrongEvalOutputTypeError Seb M'Caw 2025-08-28 12:03:23 +01:00
  • 39f1a1aa99 Add test_generic_eval() Seb M'Caw 2025-08-27 17:01:01 +01:00
  • 82151b55a2 Add test for loading evaluated data Seb M'Caw 2025-08-27 10:57:43 +01:00
  • 3191657214 Add explicit check for mixed sample types in load_packed_dataset() Seb M'Caw 2025-08-27 10:37:44 +01:00
  • e18fa7633d Fix discrepancy between canonical and generated explain solutions Seb M'Caw 2025-08-27 09:31:38 +01:00
  • 81d98a5440 Add support for loading evaluated datasets Seb M'Caw 2025-08-26 16:42:01 +01:00
  • f0b5da214a Cleaner fix for swap_depends compiler warning Seb M'Caw 2025-08-26 10:59:45 +01:00
  • 016d111c9d Remove -f gprbuild switch Seb M'Caw 2025-08-26 10:53:39 +01:00
  • e6f71a156e Reorder evals Seb M'Caw 2025-08-22 17:21:41 +01:00
  • 3f4e934b44 Fix serialisation of EvaluationStats.eval_name Seb M'Caw 2025-08-22 17:12:23 +01:00
  • 003e02d256 Shorten eval names Seb M'Caw 2025-08-22 16:15:00 +01:00
  • b3a3e1bb89 Initial implementation of gprbuild eval Seb M'Caw 2025-08-22 13:29:46 +01:00
  • 26dab5bb52 Fix after merge Seb M'Caw 2025-08-22 09:38:40 +01:00
  • c5cfcb7871 Merge branch 'main' into mr/mcaw/implement-eval Seb M'Caw 2025-08-22 09:36:18 +01:00
  • e39ec33b35 Merge branch 'mr/mcaw/implement-eval-preparatory-refactoring' into 'main' Sebastian M'Caw 2025-08-22 09:34:33 +01:00
  • 30c13d8b9a Implement review suggestions Seb M'Caw 2025-08-21 17:17:57 +01:00
  • b8cec1b07f Apply 3 suggestion(s) to 3 file(s) Sebastian M'Caw 2025-08-21 16:31:06 +01:00
  • 72c97968c0 Improve merging of canonical_evaluation_results Seb M'Caw 2025-08-21 12:34:10 +01:00
  • c41daf6f06 Merge branch 'mr/mcaw/implement-eval-preparatory-refactoring' into mr/mcaw/implement-eval Seb M'Caw 2025-08-20 17:11:26 +01:00
  • f0acd445fa Self-review Seb M'Caw 2025-08-20 17:06:13 +01:00
  • 5e37e3c1fe Run make generate-dummy Seb M'Caw 2025-08-20 16:49:01 +01:00
  • ea188d635f Refactor in preparation for adding evaluation framework Seb M'Caw 2025-08-20 16:36:39 +01:00
  • 68df455659 Self-review Seb M'Caw 2025-08-20 15:21:12 +01:00
  • fc01a146af Move DirectoryContents to own module Seb M'Caw 2025-08-20 11:09:50 +01:00
  • 817606e01f Rerun dummy generation/evaluation Seb M'Caw 2025-08-20 10:46:13 +01:00
  • cb9b596e2e Remove redundant global tool/eval name constants Seb M'Caw 2025-08-20 10:39:23 +01:00
  • 7a0151eb2e Clean up imports Seb M'Caw 2025-08-20 10:34:11 +01:00
  • 6170aa233c Remove forward reference quotes Seb M'Caw 2025-08-20 09:47:07 +01:00
  • 687be119b1 Remove pointless check for duplicate dataset names Seb M'Caw 2025-08-19 17:22:50 +01:00
  • c28aee5362 Fix test Seb M'Caw 2025-08-19 17:15:15 +01:00
  • 99ac15ed7f Fix outdated tool_config_file reference in README Seb M'Caw 2025-08-19 17:09:37 +01:00
  • 42d807031b Fix Makefile Seb M'Caw 2025-08-19 17:05:46 +01:00
  • 085f7983f8 Detect packed/unpacked automatically in load_dir() Seb M'Caw 2025-08-19 16:49:26 +01:00
  • 245523f5e4 Add tools test Seb M'Caw 2025-08-19 12:43:31 +01:00
  • 752af27d32 Clean up sample_operation.py Seb M'Caw 2025-08-18 16:53:50 +01:00
  • ef5a3a09ec Reduce tool/eval boilerplate Seb M'Caw 2025-08-18 16:52:43 +01:00
  • cb38aa466a Rename call_{generate,evaluate}_completions() Seb M'Caw 2025-08-15 10:50:14 +01:00
  • 68929ce52a Remove redundant get_explain_solution() Seb M'Caw 2025-08-15 10:40:03 +01:00
  • 4a45062312 Fix test_no_args() Seb M'Caw 2025-08-14 16:42:53 +01:00
  • b4ca95b964 Improve coverage reporting Seb M'Caw 2025-08-14 16:38:35 +01:00
  • 0bed3cf7ab Add tests for data handling Seb M'Caw 2025-08-14 16:30:50 +01:00
  • daf59a44a3 Add CLI tests Seb M'Caw 2025-08-14 15:19:56 +01:00
  • fff6318cfe Improve exception handling Seb M'Caw 2025-08-14 15:04:31 +01:00
  • 2dd0045b6d Add files to DirectoryContents Seb M'Caw 2025-08-12 17:08:31 +01:00
  • 8dc7be23e0 Fix get_contents_git_aware when not in a Git worktree Seb M'Caw 2025-08-12 15:57:39 +01:00
  • 7faf7f7e61 Add sample name to exceptions in load_unpacked_dataset() Seb M'Caw 2025-08-12 15:55:57 +01:00
  • 8de8b39e10 Add unpacked option to load_dir() Seb M'Caw 2025-08-12 15:53:07 +01:00
  • 699e464f63 Rename & relocate save_to_dir Seb M'Caw 2025-08-12 15:47:38 +01:00
  • 636de7c659 Fix <lambda> in output of invalid Enum value CLI error Seb M'Caw 2025-08-12 15:36:49 +01:00
  • d6c8fbc997 Detect missing subprograms explicitly Seb M'Caw 2025-08-07 12:39:14 +01:00
  • c6bd319a6a Restrict analysis with --limit-name Seb M'Caw 2025-08-07 11:42:31 +01:00
  • 4be79ccdb2 Fix absolute_value sample Seb M'Caw 2025-08-06 17:11:56 +01:00
  • eb553d1042 Merge branch 'main' into mr/mcaw/implement-eval Seb M'Caw 2025-08-06 16:27:24 +01:00
  • 710e8f8c75 Remove unused gnatprove.json tool config Seb M'Caw 2025-08-06 16:02:57 +01:00
  • 04eee6ee13 Define save_to_dir() Seb M'Caw 2025-08-06 16:02:36 +01:00
  • 34a2b49c50 Reduce duplication in load_unpacked_sample()s Seb M'Caw 2025-08-06 15:17:47 +01:00
  • e7d2631607 Fix packing/unpacking with canonical_evaluation_results Seb M'Caw 2025-08-06 14:29:32 +01:00
  • 7849926649 Record only timeouts in EvaluationStats Seb M'Caw 2025-08-06 11:36:29 +01:00
  • 1f1a0c0ead Rename type maps Seb M'Caw 2025-08-06 11:23:36 +01:00
  • a9c88d4086 Use repr() for exception in EvaluationStatsFailed Seb M'Caw 2025-08-06 11:15:43 +01:00
  • 74830b3ce2 Define check_on_path() Seb M'Caw 2025-08-06 11:11:08 +01:00
  • 1cd5a9bfa5 Rename compiler -> gprbuild Seb M'Caw 2025-08-06 11:01:40 +01:00
  • 751d0ed7ac Remove redundant warning from SampleOperation.apply_to_datasets() Seb M'Caw 2025-08-06 10:49:23 +01:00