Back

Blog details

AIOZ AI Research: Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving

AIOZ Network
4 min readSeptember 03, 2025
aioz-ai

We are delighted to announce that our AI research paper "Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving" has been accepted and will be published in November 2025 as the top-ranking submission at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025.

This research introduces Lightweight Temporal Transformer Decomposition, a novel method that enables efficient temporal modeling in federated autonomous driving without compromising privacy or edge performance.

Let's dive into the key aspects of this groundbreaking research below:

The Challenges in Autonomous Driving and the Need for Federated Learning

Autonomous driving requires reliable navigation through ever-changing environments, necessitating systems that can anticipate hazards and adapt quickly.

Traditional vision models, which rely on single-frame inputs, struggle significantly with motion prediction. Centralized data collection also complicates matters by raising privacy concerns due to the sensitivity of vehicle data.

Federated learning (FL) offers a viable solution by facilitating decentralized training across vehicles while keeping data local. However, multiple challenges persist, such as:

=> Processing temporal sequences

=> Managing computational loads on edge devices

=> Ensuring convergence across heterogeneous data

Our proposed solution, Lightweight Temporal Transformer Decomposition (LTTD), addresses these challenges by integrating temporal data—image frame sequences and steering signals—into a federated learning framework.

Key Innovations: Lightweight Temporal Transformer Decomposition

LTTD is a novel approach tailored for federated autonomous driving, which leverages temporal data and a lightweight design.

  • Federated Learning Setup: Enables decentralized model training across multiple vehicles while ensuring data remains local to preserve privacy.
  • Temporal Data Advantage: Leverages past frames and steering sequences to improve robustness in dynamic scenarios.
  • Lightweight Design: Decomposes large attention maps into smaller, computationally efficient matrices.
  • Real-Time Feasibility: Optimized for use on resource-limited edge devices.

This approach achieves state-of-the-art performance by striking a balance between privacy, efficiency, and accuracy. Its recognition as an oral presentation at the prestigious IROS 2025 conference underscores its significance.

Methodology: How LTTD Works

Building a robust autonomous driving model in a federated learning environment requires efficient handling of temporal data without overwhelming limited edge resources.

Our method achieves this through a combination of innovative techniques tailored for federated autonomous driving.

Key Methodological Innovations:

  • Federated Network Setup: A network of autonomous vehicles collaboratively trains a global driving policy by aggregating local model weights from each vehicle silo. Each vehicle minimizes a regression loss using local data, which ensures privacy.
  • Unitary Attention Decoupling: Applies a unitary attention mechanism to represent input triplets (e.g., image channels from multiple frames) as joint representations. This reduces attention tensor size by minimizing redundant linear combinations.
  • Tensor Factorization: Breaks down large attention maps into smaller, low-rank matrices using tensor factorization, which reduces computational complexity while preserving critical temporal information.
  • Efficient Training: The lightweight model design enables training on resource-constrained devices, ensuring convergence in federated settings and supporting real-time predictions.

This approach allows the integration of temporal data (past frames and steering sequences) into federated learning, improving prediction accuracy for dynamic driving conditions.

The lightweight architecture ensures feasibility for edge devices, making it practical for real-world autonomous vehicles.

Experimental Validation and Proven Performance

Validating the effectiveness of our Lightweight Temporal Transformer Decomposition method is critical for its adoption in real-world autonomous driving systems.

We conducted extensive experiments to demonstrate its superiority in federated learning environments.

Experimental Highlights:

  • Dataset Evaluation: Tested across three diverse datasets, LTTD outperforms other recent federated autonomous driving methods in both steering prediction and hazard anticipation.
  • Real-Time Performance: The lightweight design ensures efficient computation, enabling real-time predictions on resource-constrained edge devices.
  • Robustness to Temporal Data: Incorporating sequences of image frames and steering inputs (e.g., 5-30 frames) enables our model to effectively capture motion patterns and dynamic scenarios. Performance slightly degrades with excessive frame inputs due to approximation limitations.
  • Real Robot Experiments: Visualizations (Fig. 1) from real-world robot deployments confirm the model's smooth integration into edge systems while maintaining high performance.

Our approach sets a new benchmark for federated autonomous driving, surpassing state-of-the-art methods in both accuracy and efficiency.

Future improvements may involve addressing limitations such as the reduced expressiveness of rank-1 tensor approximations through adaptive decomposition parameters or regularization techniques.

Fig. 1

Why It Matters: Broader Impact of AIOZ AI Research

The acceptance of Lightweight Temporal Transformer Decomposition at IROS 2025 highlights its technical merit and aligns with the broader goals of the AIOZ DePIN ecosystem, which includes over 300,000 devices.

By introducing temporal data into federated learning, LTTD significantly enhances performance for real-world autonomous driving tasks, while its lightweight architecture ensures compatibility with edge deployment.

Although challenges like non-IID data scaling persist, our established track record in AI research suggests continued advancements.

Conclusion

AIOZ Network’s latest AI research paper, "Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving", marks a pivotal leap forward in AI for robotics.

By merging transformer efficiency with the privacy advantages of federated learning, LTTD brings us closer to safe, scalable, real-world autonomous driving.

Explore the full project through the GitHub repository and project page.

As IROS 2025 approaches, AIOZ DePIN solutions and collaborations signal a decentralized AI evolution.

Paper Link: https://arxiv.org/abs/2506.23523

Github Page: https://aioz-ai.github.io/IROS2025_LTFed_github-page/

Source Code: https://github.com/aioz-ai/IROS2025_LTFed/tree/master?tab=readme-ov-file

About the AIOZ Network

AIOZ Network is a DePIN for Web3 AI, Storage, and Streaming.

Powered by a global community of AIOZ DePINs, AIOZ rewards you for sharing your computational resources for storing, transcoding, and streaming digital media content and powering decentralized AI computation.

Find Us

AIOZ All Links | Website | X | Telegram

We only send updates when meaningful changes ship, and you can unsubscribe anytime

Related Content

blog thumbnail

Lightweight Text Generation with SmolLM-135M: Fast, Compact, and Capable

Text generation remains one of the most widely used AI capabilities. From drafting articles and composing captions to structuring short narratives and writing stories, creators and builders are constantly seeking models that can deliver high-quality text with minimal computational resources. SmolLM-135M introduces compact and efficient text generation that makes high-quality language synthesis more accessible and practical for real-world applications. About SmolLM-135M SmolLM-135M is a light

ai-models
2 min readMarch 13, 2026
blog thumbnail

Archer Image Generator: Authentic Archer-Style Artwork Made Simple

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—Archer Image Generator is a specialized text-to-image model designed to produce illustrations with sharp lines, flat shading, and the punchy, animated look fans associate with the TV show Archer. Trained on screenshots from the series alongside AI-generated images and user-contributed content, it captures the show’s unique look and feel by including “Archer style” tokens in your prompts. Whether you’re a fan of the ser

ai-models
2 min readJanuary 21, 2026
blog thumbnail

Bring Your Photos To Life With Cartoonize Image Diffusion

Now available on AIOZ AI V1—the collaborative marketplace powered by AIOZ DePIN—the Cartoonize Image Diffusion model transforms real photos into vibrant, stylized cartoons using simple, natural-language instructions. This customized diffusion model builds on Stable Diffusion 1.5 with instruction-tuning techniques from FLAN and the conditional editing approach of InstructPix2Pix, enabling direct & high-fidelity cartoonization without per-image fine-tuning. It excels at interpreting textual promp

ai-models
2 min readJanuary 13, 2026
blog thumbnail

XFeat: Accelerated Features for Lightweight Image Matching

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—the XFeat model delivers fast, lightweight, and accurate feature detection and matching for images captured from different viewpoints. Designed for efficiency, XFeat extracts discriminative keypoints and descriptors before performing rapid correspondence matching. This method makes it well-suited for resource-constrained environments where speed and reliability matter. Hosted on AIOZ AI using the PyTorch framework, XF

ai-models
2 min readDecember 31, 2025
blog thumbnail

Color Harmonization: Smarter Color Control For More Coherent Designs

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—the Color Harmonization model transforms images by adjusting and enhancing color balance according to harmony principles, creating visually captivating and aesthetically balanced compositions. This computational model applies selected harmony templates to align colors, improving coherence while preserving details and visual impact. Based on the work of Amir Hossein Kargaran and implemented in PyTorch, it excels in ima

ai-models
2 min readDecember 24, 2025
blog thumbnail

Git over SSH And Git LFS: A Major Update on AIOZ AI

AIOZ AI is rolling out a powerful new capability: full support for Git over SSH (Secure Shell) with Git LFS (Large File Storage). Developers and creators can now manage source code and large AI assets - model weights, datasets, media files - directly on the AIOZ AI platform with speed, security, and zero friction. This is version control built for modern AI workflows. Why This Update Matters AI projects are large, complex, and resource-heavy. Traditional Git isn’t built for multi-gigabyte fi

1 min readDecember 18, 2025