Best Method for large data sets in r

0

Apologies if this is a naive question or not the correct place to asked, new to AWS and not finding answers online.

I am working with sequencing data in r. Based on some research, using AWS EC2 rstudio seemed like a good approach for me since I do not have access to servers at my institution. I uploaded my data files from an external drive to S3. My challenge is how to work with the multiple files in r. From what I understand, you cannot use S3 as a working directory, just as a location to store the files. I also believe r has a memory limitation of 1 GB. I have multiple files that total approximately 14 GB. So reading the files into r is not a solution. Any suggestions around this, other solutions, or tutorials available?

asked a year ago23 views
No Answers

You are not logged in. Log in to post an answer.

A good answer clearly answers the question and provides constructive feedback and encourages professional growth in the question asker.

Guidelines for Answering Questions