Questions tagged with High Performance Compute
Content language: English
Select up to 5 tags to filter
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Hello,
I want to ask if its possible to connect or unify the resources of two or more EC2 instances? Background is that we use heavy machine learning on prem right now and are running into hardware...
2
answers
0
votes
262
views
asked 2 years agolg...
we need to deploy ec2 with following EBS configuration-
random 4k write : 2,00,000 IOPS
sequential read-2000Mb/s
sequential write-2000Mb/s
this is database server requirement.
1. I have some...
1
answers
0
votes
798
views
asked 2 years agolg...
I've been working on some code that would benefit from some level of awareness about the platform on which its running. When it runs on bare metal, several options are available (lshw, hwloc and so...
1
answers
0
votes
1275
views
asked 2 years agolg...
Hi folks,
I'm trying to set the spark executor instances & memory, driver memory and switch of dynamic allocation. What is the correct way to do it?
1
answers
0
votes
1113
views
asked 2 years agolg...
Does using SPOT_CAPACITY _OPTIMIZED launch spot instances into an auto-scaling group in AWS Batch?lg...
I am trying to run multiple jobs in a compute environment using AWS Batch. From my understanding, when there are multiple jobs in a job queue and the allocation strategy is BEST_FIT, AWS Batch will...
0
answers
0
votes
139
views
asked 2 years agolg...
hi, i am getting instance reachability check failed. I took an image and launched an instance from the image, I had vpc, security group, and also i associated elastic IP. In the system logs, it was...
1
answers
0
votes
367
views
asked 2 years agolg...
I follow the guide https://www.hpcworkshops.com/07-efa/01-create-efa-cluster.html to create a HPC cluster, and running the MPI hello world application(git clone...
2
answers
1
votes
697
views
asked 2 years agolg...
Hello,
I am working with AWS ECS capacity providers to scale out instances for jobs we run. Those jobs have a large variation in the amount of memory that is needed per ECS task. Those memory needs...
1
answers
0
votes
819
views
asked 2 years agolg...
Dear all
I can not start my Ubuntu 20.4 g4dn.8xlarge EC2 because it's status is failed.
The Ami I used is ami-088da9557aae42f39
It's message is "Instance reachability check failed"
When I check system...
1
answers
0
votes
308
views
asked 2 years agolg...
I have rebooted the ec2 instance and also done stop the instance but still I am facing issues please help me out.what should i do?
2
answers
0
votes
725
views
asked 2 years agolg...
I can no longer RDP onto the server. I takes a while on the configuring remote session and then it quits.
1
answers
0
votes
554
views
asked 2 years agolg...
I have a modest size dataset, and I am running Jupyter Notebook in Sagemaker (instance type ml.c5.xlarge with 200G instance size). I receive the error message " GC overhead limit exceeded" Everything...
1
answers
0
votes
711
views
asked 2 years agolg...