Once a Discovery Workspace (DW) is approved and is in running status, connect to the workspace to complete your analysis.
In this article you will learn how to
- Connect to a DW
- Access files and tables in a workspace
- Collaborate between workspaces
- Change your workspace set up
- Understand rules for data-use and extraction
- you have created a project
- you have added people to your project
- your data license has been approved by all parties
- your Discovery Workspace (DW) has been approved and is running
- you have loaded data to your workspace
Connect to the DW
Navigate to your project’s Workspaces tab and click the workspace name.
Take note of the workspace reference number and click Connect to this workspace.
The next screen lists of the virtual machines (workspaces) that you have access to.
Click on the workspace reference number to connect.
Note: If the workspace reference number is not listed you may not have been added to the workspace; click the 'Change workspace set up' button in the Workspace Summary screen in Senate to edit the list of users who can access the workspace, and submit your request for approval.
The DW will launch as a new tab in your browser. No special firewall rules should be required, the access protocol is via HTTPS.
Accessing files and tables
Once data packages have been added to the DW:
- Files can be accessed here C:\Users\Public\Public Documents\ ; a new folder will be created if there are any files
- Tables can be accessed by connecting to the redshift database using SQL Workbench, Python or R. Your Redshift database connection profile will automatically be configured. To view your Redshift connection profile, open SQL Workbench > Files > Connect window. You will only be able to access the database from within your secure Workspace(s).
Collaboration between Workspaces
Set up one or more Workspaces in a project to allow team members to collaborate in real time. (Note: Check with your project lead before requesting additional Workspaces as additional costs may be incurred).
Within a Project, users between Workspaces can:
1. Share files between Workspaces by adding them to the Project Drive (total drive storage is 100Mb).
- The Project Drive can be accessed via a shortcut on the desktop or File Explorer. Any user in a Workspace can read, modify or delete any files added to the Project Drive.
- If there is only one Workspace in your Project, the Project Drive will function like any other local drive in your Workspace as there will be no other Workspaces to share files with.
- I don't have a Project Drive, how do I request one? You will need to submit a request to add the Project Drive. Save what you are working on in the Workspace, then submit a request to add the Project Drive: to do this, click 'Change Workspace setup', do not make any changes and check the boxes to agree to the terms before clicking Submit. Your request will automatically be processed and approved (the status of your request will be displayed on the top right of the Workspace details screen). Your Project Drive should be available on the same business day. Connect to the Workspace, log out of Windows and log back in - the Project Drive should be visible. You will need to complete this step for any other Workspaces in your project that do not have the Shared Drive.
2. Query and write to the same Redshift database from any Workspace within your Project, using languages such as SQL, python or R. Workspaces created in the same Project will automatically connect to the same Redshift database. Open SQL Workbench to view your Redshift database connection details.
To learn more about Workspaces, refer to the Workspace FAQ
Change your workspace setup
Request a change in your workspace set up if you need to:
- Add users
- Change the workspace size / spec
- Install other software
Follow the prompts to edit the workspace set up and and submit your request for approval.
Rules for data-use and extraction
Prior to running your analysis, it is important to read the rules for data-use and extraction in the license (found in the license tab of your project) to ensure:
- You understand conditions of permitted use of data, including how the data should be prepared for extraction (if permitted) and if there are any rules you need to apply to your queries (for e.g. the data custodian may ask you to apply rules of aggregation to mitigate any risk of re-identification)
- Comments should be included in your queries to demonstrate that you have applied the requirements of the permitted use in your data license. This will help Data Republic to quickly approve your data extract request.