Video

Poe Platform Launches Dream 3.0 Image Model and Seedance 1.0 Lite Video Model

The Poe platform has officially launched the Image Generation Model Seedream 3.0 and the Video Generation Model Seedance 1.0 Lite, both developed by ByteDance (ByteDance), offering global users a more efficient and higher quality multi-modal content creation experience. This update marks another breakthrough for Poe in the field of image and video generation, providing creators with a seamless creation process from static images to dynamic videos.Seedream 3.0: A New Benchmark in Image Generation. As the latest image generation model under ByteDance's Volcano Engine, Seedream 3.0 has drawn industry attention due to its excellent image quality and semantic understanding capabilities.

6/17/2025 9:03:21 PM AI在线

ByteDance Seaweed APT2 is震撼 released! Real-time Interactive AI Video Generation Unlocks a New Era of 3D Virtual World

Recently, ByteDance launched Seaweed APT2, a revolutionary AI video generation model. Its breakthroughs in real-time video stream generation, interactive camera control, and virtual human generation have sparked heated discussions in the industry. This model is praised as "an important step towards the Holodeck" due to its efficient performance and innovative interactive features.Seaweed APT2: A New Benchmark for Real-Time Video GenerationSeaweed APT2 is an 800-million-parameter generative AI model developed by ByteDance's Seed team, specifically designed for real-time interactive video generation.

6/17/2025 2:02:26 AM AI在线

Video Ocean发布2K/4K HDR视频生成工具，性价比引爆全网

5月21日，潞晨科技旗下Video Ocean重磅推出全新AI视频生成工具，支持5-10秒内生成2K/4K HDR高质量大片，迅速登顶Product Hunt热门榜单，引发广泛关注。 Video Ocean提供海量模板，内置Laugh、Cakeify、Crush等炫酷特效，用户一键套用即可轻松创作电影级视频，即使新手也能快速上手，秒变“导演”。该工具支持文生视频、图生视频及角色生视频功能，满足多样化创作需求，从3D写实到赛博朋克风格均可实现。

5/22/2025 10:01:01 AM AI在线

Alibaba Qianwen Wan2.1-VACE Open Source Claims to Be the First Open-source Unified Video Editing Model

Wanxiang "Wan2.1-VACE" has been announced as open-source, marking a major technological revolution in the video editing field. The 1.3B version of Wan2.1-VACE supports 480P resolution, while the 14B version supports both 480P and 720P resolutions. The emergence of VACE brings users a one-stop video creation experience, allowing them to complete various tasks such as text-to-video generation, image reference generation, local editing, and video extension without frequently switching between different models or tools, greatly improving their creative efficiency and flexibility..

5/15/2025 10:01:53 AM AI在线

Released PixVerse V4.5 Video Model! Over 20 Movie Lenses + Multi-Image Fusion, Create a Hollywood Blockbuster in 5 Seconds!

PixVerse officially released its V4.5 video model, introducing over 20 new features in cinematic lens control, multi-image reference functionality, and enhanced handling of complex actions (). This update significantly improves the quality and creative freedom of video generation, solidifying PixVerse's leading position in the field of AI video creation. AIbase observed that the release of V4.5 quickly sparked discussions among creators worldwide, being hailed as a "milestone in movie-level AI video creation."Core Features: Professional Lens Control and Multi-Image Fusion.

5/15/2025 10:01:53 AM AI在线

Alibaba Open Sources All-in-one Video Foundation Model to Empower Video Generation and Editing

On the evening of May 14th, Alibaba officially launched Tongyi Wanxiang Wan2.1-VACE, which is currently the most comprehensive video generation and editing model in the industry. The highlight of this model lies in its multiple powerful capabilities, enabling it to simultaneously achieve text-to-video generation, image-based video generation, video retouching, local editing, background extension, duration extension, and other foundational generation and editing functions. This innovative product further lowers the threshold for video production, allowing more creators to easily get started..

5/15/2025 10:01:52 AM AI在线

一张显卡“看懂”一部电影：智源联合高校开源 Video-XL，打破长视频理解极限

长视频理解是多模态大模型的核心能力之一，也是迈向通用人工智能（AGI）的关键一步。然而，现有的多模态大模型在处理 10 分钟以上的超长视频时，仍然面临性能差和效率低的双重挑战。对此，智源研究院联合上海交通大学、中国人民大学、北京大学和北京邮电大学等多所高校，推出了小时级的超长视频理解大模型 Video-XL。

10/28/2024 4:29:25 PM 汪淼

Adobe 推出全新 AI 视频生成器 Firefly Video Model，完全使用授权内容进行训练

Adobe 公司今日发布了全新的人工智能驱动的文本转视频工具 Firefly Video Model。该工具能够根据文本提示生成全新的视频，与竞争对手不同，Adobe 声称 Firefly Video Model 完全使用授权内容进行训练，有望规避其他生成式 AI 工具所面临的伦理和版权问题。AI在线注意到，由于其使用授权内容进行训练，Adobe 称 Firefly Video Model 是“第一个公开可用的商业安全视频模型”。

10/15/2024 7:14:23 AM 远洋

Meta 推出革命性 AI 视频工具，让广告创意焕然一新

感谢科技媒体 The Verge 于 10 月 8 日发布博文，报道称 Meta 公司在 Advertising Week 活动中，推出了 Image Animation 和 Video Expansion 两款 AI 工具。Image Animation根据 Meta 公司分享的最新动图，用户可以选择一张静态照片，无需在 Instagram Reels 上使用任何现有的视频素材，就能生成创意视频。早期广告客户的反馈积极，图像动画帮助他们克服了资源有限的问题，并为广告创意提供了更长的使用寿命。Video Expan

10/10/2024 9:51:47 AM 故渊

谷歌 DeepMind 新研究：利用 AI 模型为无声视频配音

据谷歌 DeepMind 新闻稿，DeepMind 近日公布了一项利用 AI 为无声视频生成背景音乐的“video-to-audio”技术。IT之家获悉，当前 DeepMind 这款 AI 模型依然存在局限性，需要开发者使用提示词为模型预先“介绍”视频可能的声音，暂时不能直接根据视频画面添加具体音效。据悉，该模型首先会将用户输入的视频进行拆解，此后结合用户的用户文字提示，利用扩散模型反复运算，最终以生成与视频画面协调的背景声音，例如输入一条“在黑暗中行走”的无声视频，再添加“电影、恐怖片、音乐、紧张、混凝土上的脚步

6/18/2024 10:23:41 PM 漾仔

可从单张图像创建多视图 3D 视频，Stability AI 发布 Stable Video 3D 模型

Stability AI 近日发布了 Stable Video 3D 模型，该模型可从单张图像创建多视图 3D 视频。▲ 图源 Stability AI，下同Stable Video 3D 包含两个变体，其中 SV3D_u 能基于单个图像输入生成轨道视频，无需相机调节；而 SV3D_p 扩展了 SVD3_u 的功能，其可容纳轨道视图，允许沿着指定的摄像机路径创建 3D 视频。相较之前的 Stable Zero123 模型或开源替代品 Zero123-XL，Stable Video 3D 在质量上有明显提高，并具有更

3/21/2024 10:53:01 AM 溯波（实习）

资讯热榜

这样在本地搭建DeepSeek可以直接封神：本地部署+避坑指南（升级版） AI应用新纪元：2025中国AI应用排行榜榜单揭晓丨2025年1月免费！让图片放大不失真的位图转矢量图神器 Tmttool Mac也能跑Qwen3，一文看懂本地部署qwen 3配置要求 GGUF 是什么？一文看懂大模型里最火的模型格式后悔没早发现！教你用谷歌Gemini生成精美PPT（附提示词） 6秒视频10秒生成！全新AI视频神器 Grok Imagine 深度体验+元提示词分享 Sora、可灵、即梦哪家强？AI视频软件深度测评！

标签云

AI 人工智能 OpenAI AIGC 模型 ChatGPT 谷歌 DeepSeek AI新词 AI绘画大模型机器人数据 Midjourney 开源 Meta 微软智能用户 GPT 学习英伟达 Gemini 智能体技术马斯克 Anthropic 图像 AI创作训练 LLM 论文 AI for Science 代码腾讯苹果算法 Agent Claude 芯片具身智能 Stable Diffusion xAI 蛋白质人形机器人开发者生成式神经网络机器学习 AI视频 3D 大语言模型字节跳动 RAG Sora 百度研究 GPU 生成华为工具 AGI 计算生成式AI AI设计大型语言模型搜索亚马逊 AI模型视频生成特斯拉 DeepMind 场景 Copilot 深度学习 Transformer 架构 MCP 编程视觉