Explore GitLab
Discover projects, groups and snippets. Share your projects with others
-
Collection of data engineering DAGs to be executed by the WMF Airflow instances.
-
a python library for working with the HTML dumps
-
This is a repository for various gitlab ci-related security templates. The design of this repository is an a la carte selection of different security tools which users can add to their various repositories as needed.
-
-
A fork of MediaWiki's BoilerPlate template to show example security CI template setup.
-
Quickly spins up a local Dockerized DDD / Datasette instance, from scratch.
-
-
-
Repository to store Phabricator remarkup security templates for manual security reviews (https://www.mediawiki.org/wiki/Security/SOP/Application_Security_Reviews)
-
Data³ is a toolkit and general framework for visualizing just about any data. More specifically, it currently provides dashboards to explore Phabricator workflows and MediaWiki production deployments.
-
Code for research on curios and critical readers
-
Repository for configuration of Trusted GitLab Runner.
-
Data jobs owned by the Global Data and Insights team.
These should use conda-dist from workflow_utils to generate conda env artifacts, which are then deployed to the Analytics Data Lake for scheduling and running by Airflow.
-
Scripts and tools for testing the SimilarEditors extension
-
Datapipelines operationalised by the Generated Data Platform team.
-
Research on copyedits as a Structured Taks for newcomers
-
This is a simple tool to merge various, externally-hosted semgrep rules and policies for packaging and consumption by a semgrep cli. This tool is largely needed as an alternative to the semgrep.dev//r rules/policies repository.