NTT Introduces Revolutionary AI Inference Chip for Instantaneous 4K Video Processing on the Edge

NTT Corporation Unveils Groundbreaking AI Inference Chip for Real-Time Video Processing

In a significant advancement for edge AI processing, NTT Corporation has introduced a revolutionary AI inference chip capable of processing real-time 4K video at 30 frames per second while consuming less than 20 watts of power. This cutting-edge large-scale integration (LSI) chip is the first of its kind globally to achieve high-performance AI video inferencing in power-constrained environments, marking a breakthrough for edge computing applications.

Bringing AI Power to the Edge: NTT’s Next-Gen Chip Unveiled

Debuted at NTT’s Upgrade 2025 summit in San Francisco, this chip is designed specifically for deployment in edge devices, such as drones, smart cameras, and sensors. Unlike traditional AI systems that rely on cloud computing for inferencing, this chip delivers potent AI capabilities directly to the edge, significantly reducing latency and eliminating the need to transmit ultra-high-definition video to centralized cloud servers for analysis.

The Significance of Edge Computing: Redefining Data Processing

In the realm of edge computing, data is processed locally on or near the device itself. This approach slashes latency, conserves bandwidth, and enables real-time insights even in settings with limited or intermittent internet connectivity. Moreover, it fortifies privacy and data security by minimizing the transmission of sensitive data over public networks, a paradigm shift from traditional cloud computing methods.

NTT’s revolutionary AI chip fully embraces this edge-centric ethos by facilitating real-time 4K video analysis directly within the device, independent of cloud infrastructure.

Unlocking New Frontiers: Real-Time AI Applications Redefined

Equipped with this advanced chip, a drone can now detect people or objects from distances up to 150 meters, surpassing traditional detection ranges limited by resolution or processing speed. This breakthrough opens doors to various applications, including infrastructure inspections, disaster response, agricultural monitoring, and enhanced security and surveillance capabilities.

All these feats are achieved with a chip that consumes less than 20 watts, defying the hundreds of watts typically required by GPU-powered AI servers, rendering them unsuitable for mobile or battery-operated systems.

Breaking Down the Chip’s Inner Workings: NTT’s AI Inference Engine

Central to the LSI’s performance is NTT’s uniquely crafted AI inference engine, ensuring rapid, precise results while optimizing power consumption. Notable innovations include interframe correlation, dynamic bit-precision control, and native YOLOv3 execution, bolstering the chip’s ability to offer robust AI performance in once-constrained settings.

Commercialization and Beyond: NTT’s Vision for Integration

NTT plans to commercialize this game-changing chip by the fiscal year 2025 through NTT Innovative Devices Corporation. Researchers are actively exploring its integration into the Innovative Optical and Wireless Network (IOWN), NTT’s forward-looking infrastructure vision aimed at revolutionizing modern societal backbones. Coupled with All-Photonics Network technology for ultra-low latency communication, the chip’s local processing power amplifies its impact on edge devices.

Additionally, NTT is collaborating with NTT DATA, Inc. to merge the chip’s capabilities with Attribute-Based Encryption (ABE) technology, fostering secure, fine-grained access control over sensitive data. Together, these technologies will support AI applications necessitating speed and security, such as in healthcare, smart cities, and autonomous systems.

Empowering a Smarter Tomorrow: NTT’s Legacy of Innovation

This AI inference chip epitomizes NTT’s commitment to fostering a sustainable, intelligent society through deep technological innovation. As a global leader with a vast reach, NTT’s new chip heralds the dawn of a new era in AI at the edge—a realm where intelligence seamlessly melds with immediacy, paving the way for transformative advancements in various sectors.

  1. What is NTT’s breakthrough AI inference chip?
    NTT has unveiled a breakthrough AI inference chip designed for real-time 4K video processing at the edge. This chip is able to quickly and efficiently analyze and interpret data from high-resolution video streams.

  2. What makes this AI inference chip different from others on the market?
    NTT’s AI inference chip stands out from others on the market due to its ability to process high-resolution video data in real-time at the edge. This means that it can analyze information quickly and provide valuable insights without needing to send data to a centralized server.

  3. How can this AI inference chip be used in practical applications?
    This AI inference chip has a wide range of practical applications, including security monitoring, industrial automation, and smart city infrastructure. It can help analyze video data in real-time to improve safety, efficiency, and decision-making in various industries.

  4. What are the benefits of using NTT’s AI inference chip for real-time 4K video processing?
    Using NTT’s AI inference chip for real-time 4K video processing offers several benefits, including faster data analysis, reduced latency, improved security monitoring, and enhanced efficiency in handling large amounts of video data.

  5. Is NTT’s AI inference chip available for commercial use?
    NTT’s AI inference chip is currently in development and testing phases, with plans for commercial availability in the near future. Stay tuned for more updates on when this groundbreaking technology will be available for use in various industries.

Source link

The Emergence of Neural Processing Units: Improving On-Device Generative AI for Speed and Longevity

Experience the Revolution of Generative AI in Computing

The world of generative AI is not only reshaping our computing experiences but also revolutionizing the core of computing itself. Discover how neural processing units (NPUs) are stepping up to the challenge of running generative AI on devices with limited computational resources.

Overcoming Challenges in On-device Generative AI Infrastructure

Generative AI tasks demand significant computational resources for image synthesis, text generation, and music composition. Cloud platforms have traditionally met these demands, but they come with challenges for on-device generative AI. Discover how NPUs are emerging as the solution to these challenges.

The Rise of Neural Processing Units (NPUs)

Explore the cutting-edge technology of NPUs that are transforming the implementation of generative AI on devices. Drawing inspiration from the human brain’s structure, NPUs offer efficient and sustainable solutions for managing AI workloads.

Adapting to Diverse Computational Needs of Generative AI

Learn how NPUs, integrated into System-on-Chip (SoC) technology alongside CPUs and GPUs, cater to the diverse computational requirements of generative AI tasks. By leveraging a heterogeneous computing architecture, tasks can be allocated to processors based on their strengths.

Real World Examples of NPUs

  • Discover how leading tech giants like Qualcomm, Apple, Samsung, and Huawei are integrating NPUs into their devices to enhance AI performance and user experiences.

Unlock the Potential of NPUs for Enhanced On-device AI Capabilities

Experience the transformative power of NPUs in enhancing on-device AI capabilities, making applications more responsive and energy-efficient. As NPUs continue to evolve, the future of computing is brighter than ever.






1. What is a Neural Processing Unit (NPU) and how does it enhance generative AI on devices?
A Neural Processing Unit (NPU) is a specialized hardware component designed to accelerate the processing of neural networks, particularly for tasks like generative AI. By offloading intensive computations to an NPU, devices can run AI algorithms more efficiently and with greater speed, resulting in enhanced on-device generative AI capabilities.

2. How does the rise of NPUs contribute to the speed and sustainability of generative AI?
NPUs enable devices to perform complex AI tasks locally, without relying on cloud servers for processing. This reduces latency and enhances the speed of generative AI applications, while also lowering energy consumption and promoting sustainability by reducing the need for constant data transfer to and from remote servers.

3. What are some examples of how NPUs are being used to enhance on-device generative AI?
NPUs are being integrated into a wide range of devices, including smartphones, smart cameras, and IoT devices, to enable real-time AI-driven features such as image recognition, natural language processing, and content generation. For example, NPUs can power features like enhanced photo editing tools, voice assistants, and personalized recommendations without needing to rely on cloud resources.

4. How do NPUs compare to traditional CPUs and GPUs in terms of generative AI performance?
While traditional CPUs and GPUs are capable of running AI algorithms, NPUs are specifically optimized for neural network processing, making them more efficient and faster for tasks like generative AI. NPUs are designed to handle parallel computations required by AI algorithms, ensuring improved performance and responsiveness compared to general-purpose processors.

5. How can developers leverage NPUs to optimize their generative AI applications for speed and sustainability?
Developers can take advantage of NPUs by optimizing their AI models for deployment on devices with NPU support. By leveraging NPU-friendly frameworks and tools, developers can ensure that their generative AI applications run efficiently and sustainably on a variety of devices, delivering a seamless user experience with minimal latency and energy consumption.
Source link