Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • M metaseq
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 95
    • Issues 95
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 41
    • Merge requests 41
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • Administrator
  • metaseq
  • Merge requests
  • !202

[jsonl] Build cache on main worker only.

  • Review changes

  • Download
  • Email patches
  • Plain diff
Merged Administrator requested to merge mainworkerparty into main Jul 05, 2022
  • Overview 2
  • Commits 2
  • Pipelines 0
  • Changes 1

Created by: stephenroller

Patch Description As a mild convenience, we want the indices of datasets to be built automatically when they're first used. However, if we build them on every worker, we can easily flood NFS and the process is slow. However, if we build them on the primary worker, then it only takes a couple minutes to handle several TB.

This PR causes the indexes to be built on the main worker, while the rest take a break.

Testing steps Used in internal ablation

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: mainworkerparty