
We are delighted to announce that our AI research paper "Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving" has been accepted and will be published in November 2025 as the top-ranking submission at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2025.
This research introduces Lightweight Temporal Transformer Decomposition, a novel method that enables efficient temporal modeling in federated autonomous driving without compromising privacy or edge performance.
Let's dive into the key aspects of this groundbreaking research below:
Autonomous driving requires reliable navigation through ever-changing environments, necessitating systems that can anticipate hazards and adapt quickly.
Traditional vision models, which rely on single-frame inputs, struggle significantly with motion prediction. Centralized data collection also complicates matters by raising privacy concerns due to the sensitivity of vehicle data.
Federated learning (FL) offers a viable solution by facilitating decentralized training across vehicles while keeping data local. However, multiple challenges persist, such as:
=> Processing temporal sequences
=> Managing computational loads on edge devices
=> Ensuring convergence across heterogeneous data
Our proposed solution, Lightweight Temporal Transformer Decomposition (LTTD), addresses these challenges by integrating temporal data—image frame sequences and steering signals—into a federated learning framework.

LTTD is a novel approach tailored for federated autonomous driving, which leverages temporal data and a lightweight design.
This approach achieves state-of-the-art performance by striking a balance between privacy, efficiency, and accuracy. Its recognition as an oral presentation at the prestigious IROS 2025 conference underscores its significance.
Building a robust autonomous driving model in a federated learning environment requires efficient handling of temporal data without overwhelming limited edge resources.
Our method achieves this through a combination of innovative techniques tailored for federated autonomous driving.
Key Methodological Innovations:
This approach allows the integration of temporal data (past frames and steering sequences) into federated learning, improving prediction accuracy for dynamic driving conditions.
The lightweight architecture ensures feasibility for edge devices, making it practical for real-world autonomous vehicles.
Validating the effectiveness of our Lightweight Temporal Transformer Decomposition method is critical for its adoption in real-world autonomous driving systems.
We conducted extensive experiments to demonstrate its superiority in federated learning environments.
Experimental Highlights:
Our approach sets a new benchmark for federated autonomous driving, surpassing state-of-the-art methods in both accuracy and efficiency.
Future improvements may involve addressing limitations such as the reduced expressiveness of rank-1 tensor approximations through adaptive decomposition parameters or regularization techniques.

The acceptance of Lightweight Temporal Transformer Decomposition at IROS 2025 highlights its technical merit and aligns with the broader goals of the AIOZ DePIN ecosystem, which includes over 300,000 devices.
By introducing temporal data into federated learning, LTTD significantly enhances performance for real-world autonomous driving tasks, while its lightweight architecture ensures compatibility with edge deployment.
Although challenges like non-IID data scaling persist, our established track record in AI research suggests continued advancements.
AIOZ Network’s latest AI research paper, "Lightweight Temporal Transformer Decomposition for Federated Autonomous Driving", marks a pivotal leap forward in AI for robotics.
By merging transformer efficiency with the privacy advantages of federated learning, LTTD brings us closer to safe, scalable, real-world autonomous driving.
Explore the full project through the GitHub repository and project page.
As IROS 2025 approaches, AIOZ DePIN solutions and collaborations signal a decentralized AI evolution.
Paper Link: https://arxiv.org/abs/2506.23523
Github Page: https://aioz-ai.github.io/IROS2025_LTFed_github-page/
Source Code: https://github.com/aioz-ai/IROS2025_LTFed/tree/master?tab=readme-ov-file

AIOZ Network is a DePIN for Web3 AI, Storage, and Streaming.
Powered by a global community of AIOZ DePINs, AIOZ rewards you for sharing your computational resources for storing, transcoding, and streaming digital media content and powering decentralized AI computation.
AIOZ All Links | Website | X | Telegram

Text generation remains one of the most widely used AI capabilities. From drafting articles and composing captions to structuring short narratives and writing stories, creators and builders are constantly seeking models that can deliver high-quality text with minimal computational resources. SmolLM-135M introduces compact and efficient text generation that makes high-quality language synthesis more accessible and practical for real-world applications. About SmolLM-135M SmolLM-135M is a light

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—Archer Image Generator is a specialized text-to-image model designed to produce illustrations with sharp lines, flat shading, and the punchy, animated look fans associate with the TV show Archer. Trained on screenshots from the series alongside AI-generated images and user-contributed content, it captures the show’s unique look and feel by including “Archer style” tokens in your prompts. Whether you’re a fan of the ser

Now available on AIOZ AI V1—the collaborative marketplace powered by AIOZ DePIN—the Cartoonize Image Diffusion model transforms real photos into vibrant, stylized cartoons using simple, natural-language instructions. This customized diffusion model builds on Stable Diffusion 1.5 with instruction-tuning techniques from FLAN and the conditional editing approach of InstructPix2Pix, enabling direct & high-fidelity cartoonization without per-image fine-tuning. It excels at interpreting textual promp

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—the XFeat model delivers fast, lightweight, and accurate feature detection and matching for images captured from different viewpoints. Designed for efficiency, XFeat extracts discriminative keypoints and descriptors before performing rapid correspondence matching. This method makes it well-suited for resource-constrained environments where speed and reliability matter. Hosted on AIOZ AI using the PyTorch framework, XF

Now available on AIOZ AI—the collaborative marketplace powered by AIOZ DePIN—the Color Harmonization model transforms images by adjusting and enhancing color balance according to harmony principles, creating visually captivating and aesthetically balanced compositions. This computational model applies selected harmony templates to align colors, improving coherence while preserving details and visual impact. Based on the work of Amir Hossein Kargaran and implemented in PyTorch, it excels in ima

AIOZ AI is rolling out a powerful new capability: full support for Git over SSH (Secure Shell) with Git LFS (Large File Storage). Developers and creators can now manage source code and large AI assets - model weights, datasets, media files - directly on the AIOZ AI platform with speed, security, and zero friction. This is version control built for modern AI workflows. Why This Update Matters AI projects are large, complex, and resource-heavy. Traditional Git isn’t built for multi-gigabyte fi