Superset
Apache Superset is a platform for data exploration, collaboration, and visualization.
Instances
Wikimedia Cloud Services
A public instance of Superset is currently being tested. It has access to Wiki Replicas and to ToolsDB (public schemas only). Source code can be found here: https://github.com/toolforge/superset-deploy. Patches are welcome!
Data Engineering
The data engineering team runs a restricted access instance. For access, see Analytics/Data access#LDAP access.
How do I ...
Get access to Superset?
Please read Superset Access.
See a list of recent queries from all users?
Look at the Query History under SQL Lab.
Run a SQL query?
Use SQL Lab, it can interactively build your query. See upstream documentation.
Database configuration has a map of which schema is in which database.
Collaborate with others?
Create a dashboard, and add additional owners. Publish the same dashboard and it will also be publicly visible. To publish, click the "Draft" text adjacent to the star icon.
Create a visualization?
A chart can be created from any query. See the upstream documentation.
Use the API?
See https://superset.wmcloud.org/swagger/v1 and the upstream documentation.
Add a feature
For new upstream features, submit them to the Apache superset project: https://github.com/apache/superset. For specific configuration related to WMF deployments, see https://github.com/toolforge/superset-deploy.