This article provides an overview of how to run data exchange projects on Senate.
It takes you through the steps for creating a new project, getting your collaborators on board, through to performing your analysis and taking insights off the platform.
Setting up a project involves 8 steps:
- Create a project: Set up a governed space to manage your data exchange. Provide some basic information about your project and the business questions you are seeking to answer.
2. Add people to your project: Add project members; such as data analysts, project managers, data custodians (owners) or anyone else you want to have in your project. Note: If you have requested access to a data listing from the Catalog, the owner of the data package will automatically be added to the project to discuss your data request.
3. Set up communications: The project’s Conversations tab allows you to communicate with your project members and discuss the permitted use of data requested. Initiate a conversation and create a Data License as a basis for further discussion with Data Custodians.
4. Add packages to your project: In order to create a data license, you must first add a data package. Learn how to create a data package. A data package references all the files associated with a data share, not just data tables and views. If you’re working with your own data, then you add the packages, otherwise, the data custodian adds their package to your project. Users from other organizations will not have access to view the contents of your package.
If the data owner does not have a data package ready, they can create an empty package as a placeholder so you can proceed to create a draft license.
5. Create a data license: Provide data owners with information about your project and the permitted use of data you are requesting. This information will help them assess the commercial terms for your data license, pricing and any risks associated with your project.
- Be specific as possible about the data you are requesting, as data owners may need to apply filters on the dataset requested. Also, be specific about what output you intend to extract.
- Return to the Conversations tab as needed to update project members when the permitted use is ready for review. Data custodians may request for the permitted use to be revised to include additional information.
Proceed to submit the draft data license for approval when all have agreed on the permitted use and the final version of data packages have been added.
Once the data license has been approved by participating organizations, access to data can be granted for permitted uses.
6. Request a Data Science Workspace: If you will be analyzing datasets via Senate, request a Discovery Workspace or DW to access approved dataset(s) and run your analysis.
7. Do your analysis: When a workspace is approved and is in running status, connect to the workspace to complete your analysis.
8. Get your results: If permitted by your data license, run your analysis and contact Data Republic to extract your output. Depending on the level of governance selected for your project, Data Republic will perform a release check to ensure that the output aligns with the permitted use in the data license.
9. Closing your project workspace: If you have a running workspace and have extracted your output from Senate, request to close your workspace if it is no longer required.