![]() $(lsb_release -cs) stable" | sudo tee /etc/apt//docker.list > /dev/null If you created your VM using ubuntu as the boot disk, you should follow theĬurl -fsSL | sudo gpg -dearmor -o /etc/apt/keyrings/docker.gpg To learn more about your workspace Cloud Environment storage, see Detachable Persistent Disks.įor additional bash capabilities, see Using the terminal and interactive shell in Terra.Ī deeper dive: Terra's Cloud Environment To understand what's under the hood and why RStudio and notebooks have these characteristics, see this article about key notebook components or this article about key notebook operations.Follow the process using the console Install docker Be sure to check the size of the files in the bucket after copying! If you want to copy individual files, you can replace `*` with the file name to copy. To make sure the files are in the bucket, you can run the following:īe careful when copying all filesUsing `*` can mean copying a lot of large files, which can be expensive. To copy all files generated in the notebook into the bucket, use the command: Note: Workspace storage is a Google bucket, so basic bash commands in the RStudio terminal need to be preceded by "gsutil." Save files to "bucket" with bash commands This helps avoid hardcoding these variables into the code to move the data. Using environment variables lets RStudio grab the workspace Google bucket directly. WORKSPACE_BUCKET is an environment variable that is predefined when using the terminal in Terra. To save data to an external Google bucket, run the command bucket="$gs://". To use the workspace bucket for storage, run the command bucket="$WORKSPACE_BUCKET". Setting a variable makes it so you can copy/paste the commands from the documentation. Set the variable "bucket" for the destination storage You can access a bash terminal from the Terminal tab in the main RStudio pane.Ģ. Note: This can be workspace storage or an external Google bucket. To move generated data to permanent cloud storage, follow the directions below. How to copy RStudio data to workspace storage To protect data (Jupyter and RStudio have a shared persistent disk), it's important to limit changes to those maintaining your PD data integrity (i.e., increasing disk size and keeping the same disk type). If you delete your PD or reconfigure your Cloud Environment in some ways (decreasing your virtual machine (VM) memory or persistent disk), you can lose all or some generated data unless you explicitly save your output to workspace or external storage (i.e., Google bucket).ĭon't lose data when running both Jupyter and RStudio!Note: You have to re-create the Cloud Environments when swapping between RStudio and Jupyter in the same workspace. To safeguard data when re-creating or deleting the Persistent Disk (PD) If you want to save valuable data, or archive data, especially if you want to copy it to less expensive Nearline or C oldlinestorage, first, you need to copy it to an external bucket. This is true even if you are working in a shared workspace, since each user has their own unique workspace Cloud Environment and Persistent Disk. ![]() Note: You need to copy data to workspace storage if you want colleagues to have access. ![]() To share data with collaborators (including in a shared workspace) To use as input for a workflowįiles generated by RStudio are not automatically saved in workspace storage (Google bucket) and are not accessible outside your personal virtual RStudio Cloud Environment. Why copy RStudio-generated data to workspace or external storage?īelow are the primary reasons you may want to copy data generated in RStudio (stored in the RStudio persistent disk by default) to workspace storage (or external Google bucket). This article describes how to move data from the RStudio persistent disk storage to a Google bucket (including your workspace storage) when working in RStudio in Terra. Sometimes, though, you will want to copy data files to more permanent cloud storage: e.g., when archiving data, or to allow collaborators or workflows access. Your RStudio Cloud Environment includes a detachable persistent disk (PD) that maintains generated data even when you re-create the virtual machine (VM).
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |