michael_thm
Arch-Supremacy Member
- Joined
- Jan 1, 2000
- Messages
- 21,689
- Reaction score
- 2,470
Haha. wait I strike toto 1st prize then I can go it your way ha.If you need self-hosted LLMs, sticking to Nvidia cards provide better software support on Linux. So it will be a better choice imo, even for 2x GPUs setup. Windows operating system generally has better support on AMD for AI. Choosing Linux over Windows for such use case are generally preferred. Though there will be some learning curve if you are not sufficiently "Linux savvy". Another setup option could be the newer AMD AI pro mini-PCs if you want to run really large models which requires lots of Vram, but are not going to come cheap too.
Personally, I am running my own self-hosted LLMs using the 3x RTX4000 SFF Ada Edition with 20GB VRAM which runs on 3 different hosts (each nodes on 5950X with 128GB ECC memory), loaded with 3 different LLMs models on Linux Mint 22.2 Zara (based on Ubuntu 24.04) virtual machines. I am also running a hyper-converged system with storage on 3x TrueNAS storage servers with a mixture of enterprise HDD and SSDs, inter-connected on a 2x 10GbE network backbone infrastructure (with MLAGG redundancy setup) which I use to cluster the hosts for bigger models when needed, though the latency will still be high (200G fiber network with RDMA preferred) but suitable for once in a while testing purposes.
Setting up your very own local LLMs or AI servers are really fun and opens up lots of possibilities. You can integrate with your smart home assistant over the Home Assistant platform, runs self monitoring 3D-printing farms, AI video/picture generator or create useful AI automations using n8n, just naming a few. Have fun!
On the topic of OS... I just happened to be thinking about deleting the ubuntu installation on SSD #2 to delegate it to loading models. Linux applications can live on WSL 2 I reckon? linux is annoying(for me)... even use chrome also it will demand password. Been using ubuntu on and off for many years but I won't dare to say I am savvy, just capable of following instructions nia. Windows is easier. Like today, I tried to learn how to use github to keep repositories up to date and I figured it out in 30mins, on linux I think it would take me a few hours with all the commandline entries? But still keeping the ubuntu for the rare use cases where windows just won't cut it... at least until I have time to try out WSL 2. In fact I bought my 6x8TB HDDs to try out ZFS. Overkill for my use case.
