A Workspace is a quarantined analytical environment, created within a governed project space after a data license has been approved by all project participants.
Depending on the level of governance selected for your project, data owners may request for sensitive data to be accessed only via a secure Workspace and for all outputs to be manually checked prior to approval for extraction. This ensures that any output created in a workspace complies with the permitted use and extraction rules agreed to in the data license.
A workspace is a Windows or Linux virtual machines (VM) where approved datasets are loaded for analysis. Each governed project space may consist of one or more VMs.
Senate offers a variety of workspace configurations so that analysts can choose the configuration most suited to the analytics they are conducting
What workspace configurations are available on Senate?
Senate offers both Windows and Linux based virtual machines in three Workspace sizes each with varying costs based on storage and processing specifications.
Each Workspace is allocated by default with:
- 100GB of local storage space
- 550GB of redshift database storage
- 2GB of Project Drive storage space (available for new projects only, workspaces in existing projects are allocated 100MB)
- Add-on storage is available at an additional cost
GPU* workspaces can be added to either Windows or Linux based workspaces. Enhancing flexibility and processing capacity suitable for AI and ML activities.
*Please note: GPU incurs additional costs, please speak to your account manager before requesting a GPU workspace.
What is installed in a Workspace?
Each Workspace is created to meet your specific needs, with the installation of all of the below software.
Please note: Software licenses are required if applicable, for e.g. Excel, Tableau, etc.
Anaconda3 (Python 3.7) 2019.10; includes Jupyter Notebook and QT Console. Anaconda in the standard Workspace currently comes with any packages ticked on this page: https://docs.anaconda.com/anaconda/packages/old-pkg-lists/2019.03/py3.7_win-64/.
Analysts are able to install library dependencies without needing Data Republic's help, giving you more control over your Workspace.
- Python packages from PyPi and Anaconda can be installed by users to Workspaces
- R packages from from CRAN within RStudio (Linux) can be installed by users to Workspaces.
Please note: packages that have external binary dependencies, or other components not directly installable from PyPi, PythonHosted, CRAN, or Anaconda will require a support request to be installed.
The Linux workspace runs Python 3.6 by default. In addition, ipykernal for Jupyter is installed to run operations from the Python 3.7 shell.
To access Python 3.7 run 'source activate python37' from the Workspace terminal command line. The base installs are made to Python 3.6.
If you require installs to the Python 3.7 shell please contact email@example.com.
The Windows workspaces runs Python 3.7 by default. To downgrade your environment please contact firstname.lastname@example.org.
Can I request to install other software, versions or packages?
- Yes, however, software applications or versions not listed above will need to be approved by Data Republic and tested prior to installation. Your request will be reviewed and if approved, it will be processed with 1-2 weeks. We cannot guarantee ongoing support for any applications not listed above.
- If you require any python packages that are not part of the standard Workspace (i.e. the package you require is not listed and ticked on this page), or not directly installable from PyPi, PythonHosted, CRAN, or Anaconda then please contact Data Republic directly to discuss your requirements. Any packages to be installed must be compatible with the Windows Workspace environment and should not require an internet connection to run.
How much does a Workspace cost?
Pricing for the Workspace is determined based on workspace size and associated costs. Your Workspace will be invoiced monthly. Please contact your Data Republic account manager for more information.
How do users access the Workspace?
Users can access the workspace in their web browser by logging into Senate and navigating to the workspace tab in their project.
The request to add users to a workspace is part of the workspace set up form. Once a workspace request is approved by Data Republic, a new version of the workspace request form will need to be submitted if new users need access to the workspace.
Can multiple users access the Workspace at the same time?
Multiple users can be given access to the same Workspace (just like multiple people can log into a single computer), however, only one user can connect to the Workspace at any time. If you require multiple users in your project to access approved datasets in the Workspace at the same time, you’ll need to request for multiple Workspaces to be created.
Can two or more Workspaces be set up for a project? How will this work?
- This collaborative arrangement allows users working in different Workspaces to connect to the same database at the same time. Any changes made to a table will be visible to users in all Workspaces.
- If there are files in an approved data package, all Workspaces will get a copy of these files stored locally; any locally stored files in a Workspace cannot not shared with other Workspaces. This means that any changes made to a file in one Workspace will not be reflected in other Workspaces.
- Any output request is subject to Data Republic approval and must align with the permitted use agreed to in the data license. If approved, output can be extracted from any Workspace once users have completed their analysis.