AMD has allowed you to dedicate more of your system memory as VRAM for the GPU for a while now, and Intel is following suit.
AMD has had a feature on its APUs for a while now that's attractive not just to gamers, but also local AI users; Variable Graphics Memory. Now, Intel is following suit, by adding a similar feature to its Core Ultra chips.
It was revealed by Intel's Bob Duffy (via VideoCardz), with the new Shared GPU Memory Override feature tagging along with the latest version of the Arc drivers.
In simplest terms, just as on AMD's recent APUs, you will now be able to decide how much of your total system memory is reserved for the GPU. This can help with gaming, but it's especially useful if you're using local LLMs on your machine.
Ollama doesn't currently support integrated GPUs, but something like LM Studio does, and allows you to load up even fairly chunky models such as gpt-oss:20b onto the GPU instead of the CPU.
Such models will work without manually selecting larger amounts of memory for the GPU, but there are benefits to doing it. Intel's Core Ultra chips aren't yet using true Unified Memory, such as you find on an Apple Mac or on AMD's latest Strix Halo chips. It sounds the same, but it isn't. This feature would be redundant on Unified Memory.
In my own (albeit brief) testing on an AMD Ryzen AI 9 HX 370 which doesn't utilize Unified Memory, setting a large amount for the GPU to use has performance benefits.
Recommended Comments
There are no comments to display.
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.