Accelerate a pnpm migration with Issues, Epics, and Batch Changes
Large-scale migration efforts are challenging for even the most sophisticated organizations: how do you identify, track, update, and validate all of the changes that need to be made? Seemingly straightforward projects can become logistical nightmares. In this case, we’ll use the example of migrating to pnpm.
For an organization with hundreds or thousands of repositories that use yarn or npm, the task of migrating to pnpm can take a lot of precious time and resources! Commands need to be run to install pnpm, substitute pnpm commands, re-import packages, commit the changes, and open merge requests. On top of this, you may need to manage rate limits at the code host which might slow you down when downloading the code, pushing commits, and opening merge requests.
Such a large migration project, which touches hundreds or thousands of repositories, could benefit from some automation and project management tools. The steps to perform the migration are similar for every repository, which makes the migration a perfect candidate for Sourcegraph’s Batch Changes. In this post, we’ll show how you can make API calls to integrate Sourcegraph’s Batch Changes with GitLab Issues and Epics.
The combination of Batch Changes with project-management tools like Issues and Epics provide an effective way to not only jump-start such a massive migration project, but create a space for teams to collaborate when handling the intricacies and nuances that inevitably arise once the work has begun.
pnpm
pnpm is a fast, disk-space-efficient package manager that has quickly become a favored alternative to yarn and npm. At the time of this writing, there are dozens of blog posts by individuals and organizations that have migrated from npm or yarn to begin using pnpm.
Sourcegraph Batch Changes
Batch changes are used to make large-scale changes across many repositories, and even across multiple code hosts. This can be extremely useful when, for example, you want to kick-start a huge initiative to make a change to hundreds, or even thousands, of repositories.
A huge benefit of batch changes is that they run in a modular container architecture. This lets you run multiple containers, stepwise, over the repository files in a completely isolated virtual environment.
Batch changes will: * Run your migration script(s) via auto-scaling executors in the cloud * Preview the changesets * Automate the creation of merge requests * Allow the use of template language for titles, descriptions, and commit messages * Provide a clear interface which displays the status of each changeset * Allow bulk edits
GitLab Issues & Epics
GitLab Issues are used to collaborate on ideas, solve problems, and plan work. When issues share a theme across projects and milestones, you can manage them by using Epics.
GitLab Epics serve as the parent of one or more issues, and/or of one or more child epics.
In our example today, we’ll use a parent epic to track the pnpm migration at the organization level, and we’ll create one issue per merge request to perform each repository-level pnpm migration.
Sourcegraph Batch Changes, used in conjunction with GitLab Issues and Epics, allows an organization to programmatically and efficiently execute a large — or a not so large — migration project.
Implementation Details
At a high level, this working example has a few key files: * batch-spec.yaml * createIssue.js * gitlab-script.Dockerfile * node-pnpm.Dockerfile
batch-spec.yaml
The batch spec file is the most important part of this process, because it ties everything together. This is where the steps of the change are defined, in the following format:
```yaml
steps: * run: # The command to run when the container starts
container: # The container to use
env: # Environment variables and secrets that are needed by the container
outputs: # Any outputs to use in the changeset template or in future run commands
Additional steps can be defined in sequence:
- run:
container:
```
createIssue.js
This is the script that calls the GitLab API to create an Issue as part of a changeset, and assigns the created Issue to the parent Epic.
Dockerfiles
The gitlab-script.Dockerfile and node-pnpm.Dockerfile are example Dockerfiles to demonstrate building containers to do specific tasks - in this case, one to copy the createIssue.js script, and another to install pnpm so that pnpm commands can be run in the container. You don’t need these to run the batch spec file as-is, but you will need to write similar Dockerfiles if you want to update the GitLab script, add additional packages to help with your migration needs, use a different version of pnpm, etc.
Putting it all together
The example batch-spec.yaml has 3 steps: 1. Use Ruby scripting to update package.json files in the repository 2. Run “pnpm import†and delete lockfiles 3. Run the createIssue.js script (which requires environment variables) and output the issue link to be used in the merge request description
Batch changes can be run locally (for faster iteration during development/testing) or on a Sourcegraph Enterprise instance with cloud executors. Once the changesets are applied, Sourcegraph uses the configured user access tokens to open the merge requests.
Since we linked the issue in the merge request description of each changeset, GitLab links all the merge requests to their associated issues, and all the issues to the parent Epic.