Skip to main content

Windows 11 will soon harness your GPU for generative AI

Following the introduction of Copilot, its latest smart assistant for Windows 11, Microsoft is yet again advancing the integration of generative AI with Windows. At the ongoing Ignite 2023 developer conference in Seattle, the company announced a partnership with Nvidia on TensorRT-LLM that promises to elevate user experiences on Windows desktops and laptops with RTX GPUs.

The new release is set to introduce support for new large language models, making demanding AI workloads more accessible. Particularly noteworthy is its compatibility with OpenAI’s Chat API, which enables local execution (rather than the cloud) on PCs and workstations with RTX GPUs starting at 8GB of VRAM.

Nvidia’s TensorRT-LLM library was released just last month and is said to help improve the performance of large language models (LLMs) using the Tensor Cores on RTX graphics cards. It provides developers with a Python API to define LLMs and build TensorRT engines faster without deep knowledge of C++ or CUDA.

Get your weekly teardown of the tech behind PC gaming
Check your inbox!

With the release of TensorRT-LLM v0.6.0, navigating the complexities of custom generative AI projects will be simplified thanks to the introduction of AI Workbench. This is a unified toolkit facilitating the quick creation, testing, and customization of pretrained generative AI models and LLMs. The platform is also expected to enable developers to streamline collaboration and deployment, ensuring efficient and scalable model development.

A graph showing TensorRT-LLM inference performance on Windows 11.
Nvidia

Recognizing the importance of supporting AI developers, Nvidia and Microsoft are also releasing DirectML enhancements. These optimizations accelerate foundational AI models like Llama 2 and Stable Diffusion, providing developers with increased options for cross-vendor deployment and setting new standards for performance.

The new TensorRT-LLM library update also promises a substantial improvement in inference performance, with speeds up to five times faster. This update also expands support for additional popular LLMs, including Mistral 7B and Nemotron-3 8B, and extends the capabilities of fast and accurate local LLMs to a broader range of portable Windows devices.

The integration of TensorRT-LLM for Windows with OpenAI’s Chat API through a new wrapper will allow hundreds of AI-powered projects and applications to run locally on RTX-equipped PCs. This will potentially eliminate the need to rely on cloud services and ensure the security of private and proprietary data on Windows 11 PCs.

The future of AI on Windows 11 PCs still has a long way to go. With AI models becoming increasingly available and developers continuing to innovate, harnessing the power of Nvidia’s RTX GPUs could be a game-changer. However, it is too early to say whether this will be the final piece of the puzzle that Microsoft desperately needs to fully unlock the capabilities of AI on Windows PCs.

Editors' Recommendations

Kunal Khullar
Kunal is a Computing writer contributing content around PC hardware, laptops, monitors, and more for Digital Trends. Having…
What is Recall? Window’s controversial new AI feature, explained
Microsoft introducing the Recall feature in Windows 11.

When Microsoft went to launch its new Copilot+ PCs, it needed an AI feature that could showcase the power of the new NPU and AI models. That feature is Recall.

On one hand, it's a privacy nightmare wrapped in a glorified search bar. On the other, it could represent the biggest change to the way we use PCs in years.
What is Recall?

Read more
Microsoft is adding a controversial app to Windows 11
Microsoft Surface Laptop 2 sitting on a table.

A new Windows 11 build is rolling out in Microsoft's Beta channel, and it includes an app that's been caught up in some controversy. Build 22635.3646 includes the PC Manager app for devices in China by default. This app is already available through the Microsoft Store, but the update suggests the app might be part of Windows 11 more broadly soon.

PC Manager falls in the category of "system optimizers" along the lines of the  Razer Cortex Game Booster. It cleans out temporary files, frees memory that's not being used, and digs deep into your hard drive to clean out unused files. According to Microsoft, it can even "reduce ads and app pop-up interruptions." An system optimizer from Microsoft sounds great as an official release in Windows 11.

Read more
Nvidia ARM laptops may be in the works, and that could change everything
Intel and Nvidia badges on the Asus ROG Zephyrus G16.

Imagine a laptop with an iteration of Nvidia’s ARM-based CPU combined with a powerful RTX graphics card, all enhanced by AI. Years ago, that would have sounded outlandish, but now it seems like it could actually happen.

In a recent interview with Bloomberg, Nvidia CEO Jensen Huang and Dell CEO Michael Dell more or less confirmed that Team Green will enter the AI-PC hype next year.

Read more