As of December 2023, the cheapest instance with an Nvidia GPU is g4nd.xlarge, so let's try that out. In that instance, lspci contains:so we see that it runs a Nvidia T4 GPU.
00:1e.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1)
Be careful not to confuse it with g4ad.xlarge, which has an AMD GPU instead. TODO meaning of "ad"? "a" presumably means AMD, but what is the "d"?
Some documentation on which GPU is in each instance can seen at: docs.aws.amazon.com/dlami/latest/devguide/gpu.html (archive) with a list of which GPUs they have at that random point in time. Can the GPU ever change for a given instance name? Likely not. Also as of December 2023 the list is already outdated, e.g. P5 is now shown, though it is mentioned at: aws.amazon.com/ec2/instance-types/p5/
When selecting the instance to launch, the GPU does not show anywhere apparently on the instance information page, it is so bad!
Also note that this instance has 4 vCPUs, so on a new account you must first make a customer support request to Amazon to increase your limit from the default of 0 to 4, see also: stackoverflow.com/questions/68347900/you-have-requested-more-vcpu-capacity-than-your-current-vcpu-limit-of-0, otherwise instance launch will fail with:
You have requested more vCPU capacity than your current vCPU limit of 0 allows for the instance bucket that the specified instance type belongs to. Please visit aws.amazon.com/contact-us/ec2-request to request an adjustment to this limit.
When starting up the instance, also select:Once you finally managed to SSH into the instance, first we have to install drivers and reboot:and now running:shows something like:
- image: Ubuntu 22.04
- storage size: 30 GB (maximum free tier allowance)
sudo apt update
sudo apt install nvidia-driver-510 nvidia-utils-510 nvidia-cuda-toolkit
sudo reboot
nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.147.05 Driver Version: 525.147.05 CUDA Version: 12.0 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla T4 Off | 00000000:00:1E.0 Off | 0 |
| N/A 25C P8 12W / 70W | 2MiB / 15360MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
If we start from the raw Ubuntu 22.04, first we have to install drivers:
- docs.aws.amazon.com/AWSEC2/latest/UserGuide/install-nvidia-driver.html official docs
- stackoverflow.com/questions/63689325/how-to-activate-the-use-of-a-gpu-on-aws-ec2-instance
- askubuntu.com/questions/1109662/how-do-i-install-cuda-on-an-ec2-ubuntu-18-04-instance
- askubuntu.com/questions/1397934/how-to-install-nvidia-cuda-driver-on-aws-ec2-instance
From there basically everything should just work as normal. E.g. we were able to run a CUDA hello world just fine along:
nvcc inc.cu
./a.out
One issue with this setup, besides the time it takes to setup, is that you might also have to pay some network charges as it downloads a bunch of stuff into the instance. We should try out some of the pre-built images. But it is also good to know this pristine setup just in case.
We then managed to run Ollama just fine with:which gave:so way faster than on my local desktop CPU, hurray.
curl https://ollama.ai/install.sh | sh
/bin/time ollama run llama2 'What is quantum field theory?'
0.07user 0.05system 0:16.91elapsed 0%CPU (0avgtext+0avgdata 16896maxresident)k
0inputs+0outputs (0major+1960minor)pagefaults 0swaps
After setup from: askubuntu.com/a/1309774/52975 we were able to run:which gave:so only marginally better than on P14s. It would be fun to see how much faster we could make things on a more powerful GPU.
head -n1000 pap.txt | ARGOS_DEVICE_TYPE=cuda time argos-translate --from-lang en --to-lang fr > pap-fr.txt
77.95user 2.87system 0:39.93elapsed 202%CPU (0avgtext+0avgdata 4345988maxresident)k
0inputs+88outputs (0major+910748minor)pagefaults 0swaps
How AMD went from nearly Bankrupt to Booming by Brandon Yen (2021)
Source. - youtu.be/Rtb4mjIACTY?t=118 Buldozer series CPUs was a disaster
- youtu.be/Rtb4mjIACTY?t=324 got sued for marketing claims on number of cores vs number of hyperthreads
- youtu.be/Rtb4mjIACTY?t=556 Ryzen first gen was rushed and a bit buggy, but it had potential. Gen 2 fixed those.
- youtu.be/Rtb4mjIACTY?t=757 Ryzen Gen 3 surpased single thread performance of Intel. Previously Gen 2 had won multicore.
2025 round one during week of April 21st, not toning down online profiles:
- Turned down two days later before anything.As evidenced by my Stack Exchange contributions, I love trying out new software to see if it works and how well. I love benchmarking it. And I love documenting what I observed in great detail to help others to choose the best software for them. I also love meeting various new people and understanding what they are up to and how I can help.
- Alice&Bob:
I'm looking to do a meaningful job in a deeptech field, and quantum computing seems like it could become huge. I've learnt a few basics, and would like to go further with job experience in the area.
Salary: 90k.Rejected after first interview with the hiring manager.- jobs.lever.co/alice-bob/b4632e27-cf56-4570-84bb-d56a169d1c43 Senior Software Engineer - Cloud. I could do this. But do I want to.
- Pasqal: careers.pasqal.com/jobs/5817098-software-development-engineer-integration Software Development Engineer IntegrationApplication auto-reply also points to:which is cool.I'm looking to do a meaningful job in a deeptech field, and quantum computing seems like it could become huge. I've learnt a few basics, and would like to go further with job experience in the area.Rejected 1 week later without interview even though my CV seemed like a perfect match for this job. Sent an email to the contributors of Pulser.I applied for this job careers.pasqal.com/jobs/5817098-software-development-engineer-integration but got rejected by HR lady without interview.Pinging you guys here just in case because sometimes my profiles scare the HR people and then when I ping the programmers they like me. I grabbed your emails from GitHub.
- H Company:Fastest initial job application steps ever! Name, email, CV, over.
- jobs.ashbyhq.com/hcompany/e6793ce6-918b-48a6-bf56-205c477cc1c0 Member of technical staff (Evaluations)
- jobs.ashbyhq.com/hcompany/89d867e7-2bd3-4918-aebc-cabbac526b6f Senior Back-end Engineer
- Mistral AI:
- jobs.lever.co/mistral/db67d7a2-bcec-4151-9b3a-8212ddabf419 Senior Software Engineer, Data Engineering - Paris. Declined May 7th without interview, at least they said they have lots of applicants and some have more closely related qualifications.
- Poolside AI:> Of all the applications of deep learning, code generation is one of those that interest me the most as they seem one of the most pertinent in order to one day achieve AGI (the others being theorem proving and robotics), and I'd like to try and get some work experience in the area, which is why I'm applying to your company.
- poolside.ai/careers/member-of-engineering-evaluations--ba11fe78-f6f6-4165-b76b-020a46ad8fee Member of Engineering (Evaluations)
Of all the applications of deep learning, code generation is one of those that interest me the most as they seem one of the most pertinent in order to one day achieve AGI (the others being theorem proving and robotics), and I'd like to try and get some work experience in the area.
- poolside.ai/careers/member-of-engineering-data-platform--13d32f62-d530-4372-b458-0687d99eea04 Member of Engineering (Data Platform)
This job seemed like a possibility as I've done some personal "data intensive" projects in the past (not distributed unfortunately, ~500 GB so it fit on my local disk), and I kind of enjoyed it and would be interested in trying out a more "data heavy" job like this for a change.
Rejected May 6th without interview. - poolside.ai/careers/member-of-engineering-evaluations--ba11fe78-f6f6-4165-b76b-020a46ad8fee Member of Engineering (Evaluations)
- Google:
- www.google.com/about/careers/applications/jobs/results/91141790538572486-senior-software-engineer-google-pixel-graphics Senior Software Engineer, Google Pixel Graphics
- www.google.com/about/careers/applications/jobs/results/96937277808091846-senior-software-engineer-pixel-graphics-gpu-software Senior Software Engineer, Pixel Graphics, GPU Software
- InstaDeep:
- www.instadeep.com/job-offer/fea583b3-d333-447e-8811-8ce58367c003/ Software Engineer (Simulation) asking 85k
I really like the idea of using simulations to speed up development of ideas or AI. I've done this in my past jobs in the semiconductor industry, and a bit on side time tinkering with simple simulation games that might be useful to train AI. I'm curious to what you will be simulating! The general areas that your company operates in, biotech and PCB design are also in my interest.lI really like the idea of using simulations to speed up development of ideas or AI. I've done this in my past jobs in the semiconductor industry, and a bit on side time tinkering with simple simulation games that might be useful to train AI. I'm curious to what you will be simulating!
- www.instadeep.com/job-offer/fea583b3-d333-447e-8811-8ce58367c003/ Software Engineer (Simulation) asking 85k
- AMD:
* careers.amd.com/careers-home/jobs/57882 Senior Software Development Engineer on ROCm. Rejected a few days later without interview. - Common Crawl: commoncrawl.org/jobs Software Engineer/Data Scientist, Python, Spark
A network interface controller that does more than just the base OSI model protocols, notably in a programmable way.
- www.nextplatform.com/2022/05/11/intel-unrolls-dpu-roadmap-with-a-two-year-cadence/
- www.trentonsystems.com/blog/what-is-a-smartnic
- blogs.nvidia.com/blog/2021/10/29/what-is-a-smartnic/ "Some are using FPGAs which promise flexibility"
- www.servethehome.com/intel-ipu-exotic-answer-to-industry-dpu/ "Intel IPU is an Exotic Answer to the Industry DPU"
- 2022 www.datacenterdynamics.com/en/news/amd-to-buy-smartnic-firm-pensando-for-19-billion/ "AMD to buy SmartNIC firm Pensando for $1.9 billion"
- www.theregister.com/2022/06/14/alibaba_dpu_cloud/ mentions that Alibaba Cloud created their own.
Hyperscalers Lead The Way To The Future With SmartNICs by The Next Platform (2019)
Source. - youtu.be/kwroXmFJJf0?t=599 financial industry is one of the users, notably high-frequency trading
Associated article: www.nextplatform.com/2019/10/31/hypercalers-lead-the-way-to-the-future-with-smartnics/ mentions that:
Google is widely believed to be working on its own design.