Week 3

Previous Goals:

  • Resolve how to handle Flexeme’s inconsistent clustering results for end-to-end tests
  • Write a draft of .yml file to install tools on MacOS for github workflows
  • Get the regression test running in continuous integration on Github Actions
  • Refactoring: Identify parts of the pipeline where unix diff can be used instead of CSV and implement if possible
  • Implement tangled lines/hunks: add new functionality

This Week’s Progress:

  • I got to implement 4 out of the 5 goals set out at the beginning of the week.
  • The end to end tests were made deterministic implemented successfully for all different test runs, as Mike suggested a way to override all randomness using export PYTHONHASHSEED=0. The PR is currently approved and waited to be merged.
  • I finished writing the ci.yml file, but implementing CI is blocked because of several issues. On Github Actions, installation of the tools and dependencies takes~15 minutes and is expensive, causing us to exceed our Github runtime limit. To solve this, I have worked on my own fork of the repository instead and tried to implement caching for CI. On Docker, I tried to create a Docker image from a base image that runs on Mac guest operating systems. However, I am stuck on trying to debug failures of building the Docker image. To solve this, Mike intends to get the end-to-end tests working on Linux, but we have issues with Flexeme dependencies.
  • Regarding new functionality:
    • I have modified ground_truth.py to reduce duplicated code by writing a for loop to calculate untangling score for each tool.
    • I have modified the grountruth.sh Bash script by using Unix patch commands to create the 3 diff artifacts (version_control.diff, bug_fix.diff, and non_bug_fix.diff). This eliminates the need for the inversion of the minimized bug-inducing patch, and actually stores the non bug fixing diff. We now have the 3 unified diff files for manual analysis.
      • A higher goal is to eliminate the need for repairing the diff lines by identity over linewise content. To achieve this, I want to implement patch to obtain the non_bug_fix diff in a different way by patching original source code with the bug_fix.diff only. This is halted due to issues regarding (1) tangled lines, and (2) tangled hunks. I have written a script to clean the unified diff files before patching, but I want to consult Thomas and Mike on whether this current approach is error-prone.

Next Goals:

  • Respond to Thomas’ reviews about PRs and hopefully merge all open PRs
  • Consult Mike and Thomas on Unix diff implementation and try to get it further down in the pipeline
  • Convert Flexeme and SmartCommit results to diff formats
  • Resolve CI issues (with Github Actions caching, Docker)

Miscellaneous

For teatime event in PLSE this week, I tried out Scandinavian pastries for the first time!

teatime 1

Written on June 23, 2023