Новости

Events

Связаться с нами

Напишите нам

Show Sidebar

Блог

NVIDIA + OpenAI: модели GPT с открытым весом теперь работают локально на ПК с RTX AI

Posted by

Солнце

11/09/2025 On 27/05/2025

A new chapter in local AI computing is here. OpenAI has released two powerful open-weight models — gpt-oss-20b and gpt-oss-120b — specifically optimized to run on NVIDIA RTX and RTX Professional GPUs. These models are designed for reasoning, chain-of-thought, and complex natural language tasks, and are now bringing LLM-level intelligence to everyday RTX-powered desktops.

💡 What’s New?
gpt-oss-20b and gpt-oss-120b are trained with a context length of 131K tokens, enabling significantly deeper prompts, document analysis, and in-memory reasoning.
With RTX 5090, the models can achieve up to 256 tokens/sec inference speed — making locally deployed models not only viable but blazing fast.
Optimized for MXFP4 precision, these models maintain high accuracy while reducing compute and memory usage — ideal for PCs and edge deployments.

🛠️ Developer Ecosystem & Tools:
Developers can deploy and fine-tune these models on their local machines using tools such as:
Ollama – CLI tool for managing and running local models
llama.cpp – Efficient C++ implementation for inference
Microsoft’s AI Foundry Local – End-to-end environment for building local AI workflows
All of these are now optimized for the RTX AI PC ecosystem, making advanced AI development more accessible than ever.

🔍 Why This Matters:
Empowers developers, researchers, and startups to build and test LLM applications locally, without relying on cloud inference.
Accelerates on-device generative AI, including intelligent search, RAG, summarization, and coding copilots — all running on consumer hardware.
Offers privacy, cost-efficiency, and speed—critical for enterprise prototyping, offline workflows, and distributed development.
With this release, RTX AI PCs are evolving from high-performance gaming and content creation machines into AI innovation platforms, bridging the gap between open models and real-time user applications.

💬 As AI becomes more open, more local, and more efficient — the next wave of LLM-based apps may begin not in a cloud data center, but right on your desk.

About Sun

View all posts by Sun

04 Авг

Блог

Huawei OceanStor Dorado 5000 V6 — высокопроизводительное флэш-хранилище

Posted by

Солнце

11/09/2025

Мы рады представить Huawei OceanStor Dorado 5000 V6 — мощную флэш-систему хранения данных среднего класса, предназначенную для критически важных задач...

Продолжить чтение

04 Авг

Блог

Supermicro SuperServer AS-8125GS-TNMR2-G1 — мощный графический процессор 8U для экстремального ИИ и высокопроизводительных вычислений

Posted by

Солнце

11/09/2025

👍 AS-8125GS-TNMR2-G1 разработан для самых требовательных задач искусственного интеллекта и высокопроизводительных вычислений, сочетая в себе огромную мощность графического процессора...

Продолжить чтение

04 Авг

Блог

Cisco и Splunk: переосмысление цифровой устойчивости в эпоху агентного ИИ

Posted by

Солнце

11/09/2025

С 8 по 11 сентября 2025 года в Бостоне пройдёт ежегодная конференция Splunk .conf25, и в этом году Cisco окажется в центре событий. Вместе с...

Продолжить чтение

04 Авг

Блог

Lenovo Tech World представит мир технологий в сфере на выставке CES 2026 — новая эра иммерсивных инноваций в области искусственного интеллекта

Posted by

Солнце

11/09/2025

Lenovo только что опубликовала захватывающую новость: Tech World, ежегодная выставка инноваций бренда, займет центральное место на выставке Sphere в Лас-Вегасе.

Продолжить чтение

27 Май

Блог