How do I get access to PixVerse-R1?

Access is currently restricted to an invite-only beta program. You can sign up for the waitlist on the official website or monitor social media for code drops.

What is the maximum video resolution?

The model supports high-definition output up to 1080p. This resolution is maintained even during real-time interactive stream sessions.

How long can one session last?

Beta users can generate up to 5 minutes of continuous video in a single session. This allows for long-horizon narrative development within one digital world state.

Does it include audio generation?

Yes, the Omni model generates synchronized audio that matches the visual stream. It also supports character lip-syncing via the speech input feature.

Is there an API for developers?

Yes, AIsphere offers a RESTful API for enterprise partners. This allows for integration of real-time video generation into third-party applications.

How is the API priced?

Pricing uses a credit system where 1080p clips consume roughly 120 credits. Credit packs typically start at $10 for 1,000 credits.

What are the available generation modes?

R1 features POV, Ambient, and Dramatic modes. Each mode adjusts the camera style and intensity of the generated scene.

How does it compare to standard video generators?

Unlike traditional models that generate fixed 5-second clips, R1 provides a persistent world that responds live to new prompts.

PixVerse-R1

PixVerse-R1 is a next-gen real-time world model by AIsphere, offering interactive 1080p video generation with instant response and physics-aware continuity.

AI VideoWorld ModelReal-time AIMultimodalAIsphere

otherPixVerseJanuary 12, 2026

Context

360Ktokens

Max Output

360Ktokens

Modality:TextImageAudioVideo

Capabilities:Streaming

View API Documentation

About PixVerse-R1

Learn about PixVerse-R1's capabilities, features, and how it can help you achieve better results.

PixVerse-R1 functions as an interactive video generator, moving past static clip rendering into real-time world simulation. Launched by AIsphere in early 2026, it uses the Omni multimodal foundation model to process text, image, and video signals together. This model acts as a physics-aware simulator that maintains consistency across camera movements and object interactions. Unlike traditional frame interpolators, R1 predicts the next visual state based on user instructions and previous frames.

The system uses an Instantaneous Response Engine to minimize sampling to 1 to 4 steps. This efficiency allows for 1080p high-definition video with latency under 15 seconds. Users can participate in sessions lasting up to 5 minutes, changing scenes and character actions as the stream continues. The stateful nature of the digital environment ensures that space and time remain connected during long generations.

Continuity is managed through an autoregressive mechanism and memory-augmented attention. By unifying different input types natively, the model prevents the disjointed transitions common in multi-stage video pipelines. It is designed for creators who need immediate visual feedback and persistent narrative control.

Use Cases

Discover the different ways you can use PixVerse-R1 to achieve great results.

Live Stream Environments

Content creators can change weather or location effects in real-time based on live audience suggestions.

Collaborative Film Pre-viz

Directors can test camera angles and narrative beats during live brainstorming sessions to see results instantly.

Dynamic Game Worlds

Developers can generate persistent digital environments that respond to player commands without pre-rendered assets.

Virtual Production Backgrounds

Creating high-definition responsive backgrounds for LED volumes that react to lighting and camera shifts.

Immersive Brand Storytelling

Brands can build interactive visual experiences where customers guide the aesthetic flow of a product reveal.

Rapid Narrative Prototyping

Writers can visualize complex scenes as they write them, allowing for immediate iteration on pacing and visual logic.

Strengths

Limitations

Near-Instant Latency: The Instantaneous Response Engine enables 1080p generation with sub-15 second response times to user prompts.

Temporal Drift: Minor prediction errors can accumulate over long 5-minute windows, leading to occasional character distortion.

Persistent World State: Autoregressive modeling ensures physical continuity over 5-minute sessions rather than resetting between prompts.

Restricted Public Access: Availability is currently limited to an invite-only waitlist, restricting general commercial and developer use.

Native Multimodal Architecture: The Omni foundation model unifies text, image, and video tokens to prevent logical disconnects in generation.

Visual Jittering: High server utilization during the beta period can result in flickering or sudden disappearance of environmental objects.

Interaction Depth: Users can influence storylines and physics live, transforming passive video into a collaborative narrative tool.

Simplified Physics: Complex interactions are sometimes simplified to maintain the sampling speed required for real-time performance.

API Quick Start

aisphere/pixverse-r1

View Documentation

other SDK

import axios from 'axios';

async function generateRealTimeVideo() {
  const response = await axios.post('https://app-api.pixverse.ai/openapi/v2/video/t2v', {
    prompt: 'A rainy cyberpunk street at night with neon reflections',
    model: 'pixverse-r1',
    aspect_ratio: '16:9',
    mode: 'ambient',
    duration: 300 // 5-minute session in seconds
  }, {
    headers: {
      'API-KEY': 'YOUR_API_KEY',
      'ai-trace-id': Date.now().toString()
    }
  });

  console.log('Session Video ID:', response.data.Resp.video_id);
}

Install the SDK and start making API calls in minutes.

Community Feedback

See what the community thinks about PixVerse-R1

“The magic aquarium demo showed a goldfish responding instantly to prompts. It is not generating a clip, it is changing a live frame.”

— dotey

twitter

“It is not just a few seconds of video. It is a breathing world. You say 'rain' and the reflections and puddles compute immediately.”

— berryxia

twitter

“Most systems work in isolated bursts. PixVerse R1 carries forward true continuity and memory which Luma and Runway currently lack.”

— Singularity User

“The RESTful structure is a refreshing change for video models, making automation pipelines much easier to build than before.”

— DevGuru99

hackernews

“I used the API for a live art stream and the audience was losing their minds over how fast the scenery adapted to their chat prompts.”

— CinematicAI

“R1 is the first time I felt like I was actually directing an AI rather than just gambling with a random seed generator.”

— FrameChaser

twitter

Supercharge your workflow with AI Automation

Automatio combines the power of AI agents, web automation, and smart integrations to help you accomplish more in less time.

AI Agents

Web Automation

Smart Workflows

Get Started Free

Pro Tips

Expert tips to help you get the most out of PixVerse-R1 and achieve better results.

Use Ambient Mode for Stability

Select the Ambient setting to ensure the most consistent physical logic during long-duration sessions.

Wait for the Response Rhythm

Allow 10 to 12 seconds between instructions so the engine can transition the scene smoothly.

Reference Images with Fusion

Upload a starting image in Fusion mode to lock in specific character designs or environment layouts.

Keep Prompts Specific

Direct the model with clear actions rather than vague concepts to prevent character cloning or scene jumping.

Unique API Trace IDs

Ensure every API request has a unique trace ID to avoid receiving cached or duplicate generation results.

Testimonials

What Our Users Say

Join thousands of satisfied users who have transformed their workflow

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Automatio is one of the most used for RPA Tools both internally and externally. It saves us countless hours of work and we realized this could do the same for other startups and so we choose Automatio for most of our automation needs.

Mohammed Ibrahim

CEO, qannas.pro

I have used many tools over the past 5 years, Automatio is the Jack of All trades.. !! it could be your scraping bot in the morning and then it becomes your VA by the noon and in the evening it does your automations.. its amazing!

Ben Bressington

CTO, AiChatSolutions

Automatio is fantastic and simple to use to extract data from any website. This allowed me to replace a developer and do tasks myself as they only take a few minutes to setup and forget about it. Automatio is a game changer!

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Jonathan Kogan

Co-Founder/CEO, rpatools.io

Mohammed Ibrahim

CEO, qannas.pro

Ben Bressington

CTO, AiChatSolutions

Sarah Chen

Head of Growth, ScaleUp Labs

We've tried dozens of automation tools, but Automatio stands out for its flexibility and ease of use. Our team productivity increased by 40% within the first month of adoption.

David Park

Founder, DataDriven.io

The AI-powered features in Automatio are incredible. It understands context and adapts to changes in websites automatically. No more broken scrapers!

Emily Rodriguez

Marketing Director, GrowthMetrics

Automatio transformed our lead generation process. What used to take our team days now happens automatically in minutes. The ROI is incredible.

Related AI Models

MiMo V2.5 Pro

Other

MiMo V2.5 Pro is Xiaomi's open-source 1.02T parameter MoE model featuring a 1M context window, native multimodality, and elite agentic coding performance.

1M context

$1.00/$3.00/1M

Frequently Asked Questions

Find answers to common questions about PixVerse-R1

PixVerse-R1

About PixVerse-R1

Use Cases

Live Stream Environments

Collaborative Film Pre-viz

Dynamic Game Worlds

Virtual Production Backgrounds

Immersive Brand Storytelling

Rapid Narrative Prototyping

Strengths

Limitations

API Quick Start

Community Feedback

Related Videos

Supercharge your workflow with AI Automation

Pro Tips

Use Ambient Mode for Stability

Wait for the Response Rhythm

Reference Images with Fusion

Keep Prompts Specific

Unique API Trace IDs

What Our Users Say

Related AI Models

MiMo V2.5 Pro

Frequently Asked Questions

How do I get access to PixVerse-R1?

What is the maximum video resolution?

How long can one session last?

Does it include audio generation?

Is there an API for developers?

How is the API priced?

What are the available generation modes?

How does it compare to standard video generators?