New GoT-R1 Multimodal Model Released: Making AI Drawing Smarter, the New Era of Image Generation!
Recently, a research team from the University of Hong Kong, The Chinese University of Hong Kong, and SenseTime has released a remarkable new framework - GoT-R1. This innovative multimodal large model significantly enhances AI's semantic and spatial reasoning capabilities in visual generation tasks by introducing reinforcement learning (RL), successfully generating high-fidelity and semantically consistent images from complex text prompts. This advancement marks another leap forward in image generation technology.Currently, although existing multimodal large models have made significant progress in generating images based on text prompts, they still face many challenges when handling instructions involving precise spatial relationships and complex combinations.
6/26/2025 5:01:43 PM
AI在线
"AI Daily Report - June 26"; Doubao AI Programming Launches Major Upgrade; Google Opensources AI Agent Gemini CLI
Welcome to the AIbase [AI Daily] column!Learn about the day's major AI events in three minutes, helping you understand AI industry trends and innovative AI product applications.Visit more AI news at:. DouBao AI Programming has been greatly upgraded! No-code beginners can easily create their own web pages, with real-time editing that is very convenient!DouBao AI Programming has been upgraded to "Application Creation 1.0," featuring visual editing, real-time preview, and multi-version management functions, lowering the barriers to web and application development.
6/26/2025 5:01:43 PM
AI在线
High-Level Call Between OpenAI and Microsoft! The Future of Their Cooperation Remains Uncertain
Amid the intensifying competition in the field of artificial intelligence, Sam Altman, CEO of OpenAI, recently had a phone call with Satya Nadella, CEO of Microsoft, discussing future cooperation. This information was revealed in Altman's podcast interview on Tuesday, where he mentioned that the conversation with Nadella mainly focused on how to revise their investment terms and equity issues for the future.It is known that Microsoft is a major investor in OpenAI, and recent differences have arisen between the two parties regarding investment details, particularly about the scale of Microsoft's future shareholding. If no consensus can be reached on these key issues, Microsoft even considered suspending further discussions with OpenAI.
6/26/2025 5:01:43 PM
AI在线
Google Launches Imagen4: Breaking the Text-to-Image Generation Bottleneck, Gemini API Empowers Text-to-Image
Recently, Google officially launched its latest text-to-image model **Imagen4** through the Gemini API, marking a significant milestone in the field of generative AI (AIGC). According to Google's official blog and community feedback, Imagen4 has made breakthroughs in generating text within images, solving long-standing technical bottlenecks in AIGC and providing developers with high-quality tools for visual content creation.It is reported that the model comes in two versions: **Imagen4** and **Imagen4Ultra**, priced at $0.04 and $0.06 per image respectively. Currently, paid previews are available on the Gemini API and Google AI Studio, with some free trial spots open.Compared to its predecessor Imagen3, Imagen4 significantly improves text rendering quality, supports image generation up to 2K resolution, and covers a diverse range of artistic styles from realistic to abstract.
6/26/2025 5:01:43 PM
AI在线
Dou Bao AI's Gaokao Score Reaches the Threshold for Tsinghua and Peking University Admission! Literature Score of 683 Leads Domestic and International Top Models
ByteDance's Seed team recently announced the impressive results of the 2025 Gaokao comprehensive subject test: the Douyin Dabao Seed 1.6-Thinking model scored 683 in liberal arts and 648 in science, meeting the admission line for Tsinghua and Peking University, and performing outstandingly in Gaokao tests among AI models both domestically and internationally.The test used the national new volume I and Shandong provincial independent examination papers. Dabao competed against five top AI models, including Google's Gemini 2.5 Pro, DeepSeek R1, and OpenAI o3. Dabao achieved the highest score of 683 in liberal arts, while its science score of 648 was second to Google's Gemini 2.5 Pro with 655.
6/26/2025 5:01:43 PM
AI在线
Google Launches Gemini CLI! AI Assistant Touches the Developer Terminal
Recently, Google officially launched a new command-line tool - Gemini CLI. This tool is based on Google's self-developed Gemini 2.5 Pro AI model, aiming to provide developers with convenient AI Q&A and content generation services. With Gemini CLI, developers can directly call the powerful capabilities of AI in their own terminal interface, thus improving programming efficiency and work convenience.One of the highlights of Gemini CLI is its support for a context window of up to 1 million tokens, which means the AI model can process large amounts of information and has stronger understanding capabilities.
6/26/2025 5:01:43 PM
AI在线
Douba AI Programming Makes a Major Upgrade! No-Code Beginners Can Easily Create Their Own Websites, Real-Time Editing Is Very Convenient!
Recently, Douyin's AI smart assistant, Doulai, has undergone a major functional upgrade. Its AI programming feature has officially evolved into "Application Creation 1.0," offering users a new experience with visual editing, real-time preview, and multi-version management. This update significantly lowers the barriers to web and application development, enabling users with no programming background to easily create personalized digital products.Visual Editing, Web Design Without CodeThe core highlight of Doulai's AI programming "Application Creation 1.0" is its powerful visual editing function.
6/26/2025 5:01:43 PM
AI在线
AI Hacker Rises to Power! XBOW's Autonomous AI Tool Dominates HackerOne, Revealing Thousands of Vulnerabilities and Intimidating the Cybersecurity Industry
Recently, AI security company XBOW announced that its self-developed AI tool "XBOW" has outperformed other participants on the globally renowned bug bounty platform HackerOne, ranking first in the United States. This is the first time an AI tool has surpassed human security researchers to top the HackerOne vulnerability disclosure ranking, marking a milestone breakthrough for AI in the field of vulnerability detection.XBOW AI: Pioneering Fully Automated Penetration TestingXBOW's AI tool is a fully autonomous penetration testing (pentest) system that simulates the operations of human security researchers without any human intervention, identifying and exploiting software vulnerabilities. It is reported that the tool can complete comprehensive penetration tests within hours, covering various types of vulnerabilities such as remote code execution (RCE), SQL injection, cross-site scripting (XSS), server-side request forgery (SSRF), and information leakage.
6/26/2025 5:01:43 PM
AI在线
ChatGPT iOS App Monthly Downloads Exceed 30 Million, Surpassing All Social Apps
The iOS app of ChatGPT has been downloaded 29.6 million times in the past 28 days, becoming the most popular app worldwide. This achievement has made ChatGPT surpass the combined downloads of four major social apps—TikTok, Facebook, Instagram, and X—which had approximately 32.9 million downloads during the same period, with a gap of 10.6%. Although social apps have been on the market for a longer time, ChatGPT has achieved such impressive results in a short time, showing its strong appeal.According to data from the relevant data analysis company Similarweb, ChatGPT ranked first in global app downloads in April, successfully surpassing competitors like Instagram and TikTok.
6/26/2025 5:01:39 PM
AI在线
Image Giant Getty Images Reverses Core Copyright Lawsuit Against Stability AI, UK Case Continues
Recently, Getty Images announced in the High Court of London that it had withdrawn its main copyright infringement claims against Stability AI, further narrowing the focus of this closely watched legal battle. The core of this lawsuit revolves around how AI companies use copyrighted content to train their models.Image source note: The image is AI-generated, and the image licensing service is Midjourney.Although Getty Images' dismissal did not end the case, the company is still pursuing other charges and has filed an independent lawsuit in the United States. This development highlights the ambiguous territory between content ownership and usage rights in the era of generative AI.
6/26/2025 5:01:39 PM
AI在线
Google Launches AlphaGenome: AI Technology Helps Cracking the Code of Life in One Click
In the field of life sciences, Google DeepMind has once again set a new trend by introducing a revolutionary artificial intelligence tool - AlphaGenome. This new AI model can not only read up to 1 million DNA bases but also accurately predict how gene mutations affect molecular functions, marking a major breakthrough in biological research. Decoding the Blueprint of LifeThe genome is the blueprint of life, recording the DNA instructions for each cell.
6/26/2025 5:01:39 PM
AI在线
WhatsApp Launches AI Message Summary Feature, Meta AI Can Summarize Personal Chat Records
WhatsApp has recently launched a new AI message summary feature, allowing users to get intelligent summaries of their personal chat records through Meta AI. This feature is currently available in English in the United States and is planned to be expanded to more countries and languages later this year.Users can access this feature by clicking the button to expand all unread messages in a chat. Instead of displaying the original messages directly, WhatsApp uses Meta AI to generate a bullet-point summary of the content users have missed, avoiding the burden of reading long messages.Meta specifically emphasizes that this feature employs "privacy-preserving" technology, creating a "secure cloud environment" to hide the interaction process between users and the AI model, claiming it can prevent Meta itself and other third parties from eavesdropping on user messages.
6/26/2025 5:01:39 PM
AI在线
OpenAI Fully Transitions to Rust for Refactoring Codex CLI to Improve Performance
Recently, OpenAI announced that it will restructure its Codex CLI tool, completely abandoning TypeScript and switching to the Rust language. This decision aims to provide developers with a more efficient and stable AI terminal interaction experience. Codex CLI was initially designed to simplify developers' interactions with AI in the terminal, using TypeScript and the React-based Ink framework.
6/26/2025 5:01:39 PM
AI在线
New Oriental Launches Its First Original AI Education Product - New Oriental AI 1-on-1, Revolutionizing the Traditional Learning Model
On June 25, 2023, New Oriental officially launched its first original AI educational product for consumers - New Oriental AI 1-on-1. This is not only a major breakthrough in teaching methods but also marks a key step in New Oriental's "Education AI" strategic layout.The core competitiveness of New Oriental AI 1-on-1 lies in providing learners with a high-frequency interactive 1-on-1 learning experience. The AI teacher can realistically recreate the learning environment, achieving real interaction and real Q&A.
6/26/2025 5:01:39 PM
AI在线
Gaokao Volunteer Application Brings Heat to Kuaishou Deep Search, Each Student Uses It an Average of 4 Times
With the release of college entrance exam score lines in 31 provinces and cities, the demand for volunteer applications has surged. On June 26, AIbase learned from Quark that in the past three days, tens of millions of candidates and their parents have used Quark's college entrance exam services. Among them, over 5 million volunteer reports generated by the college entrance exam volunteer model, with nearly half of the demands coming from third-tier and lower cities.June 25 was the day with the heaviest system pressure.
6/26/2025 5:01:38 PM
AI在线
Ant Group Accelerates the Promotion of AI Healthcare and Launches a New Large Model Application "AQ"
Ant Group officially launched the AI health app "AQ" on June 26, offering more than 100 AI functions such as health education, medical consultation, and report interpretation. It connects over 5,000 hospitals nationwide, nearly one million doctors, and nearly 200 AI avatars of renowned doctors. The app is now available on major app stores..
6/26/2025 5:01:38 PM
AI在线
第四波!2025年6月精选实用设计干货合集
大家好,这是 2025 年 6 月第四波干货合集! 这一期干货合集从 3 个颇为不错的 AI 和设计/开发工具合集网站开始,它们为设计师、开发者和 AI 爱好者搜集整理了大量可用的工具,做好标注,方便大家定位、搜寻和实用。 除此之外,这次合集当中还有一款实用性很强的在线像素风插画/动画编辑器,另外还有 2 个关于热门潮玩拉布布的网站。
6/26/2025 4:49:59 PM
陈子木
蚂蚁集团发布 AI 健康应用 AQ:可看病症、看医生、看报告
蚂蚁集团发布AI健康应用AQ,提供健康科普、就诊咨询、报告解读等上百项功能,连接全国超5000家医院、近百万医生。AQ基于蚂蚁医疗大模型,支持多模态交互,已通过权威评测。#蚂蚁集团 #AI健康应用AQ #医疗服务升级
6/26/2025 3:21:39 PM
远洋
资讯热榜
标签云
AI
人工智能
OpenAI
AIGC
模型
ChatGPT
DeepSeek
谷歌
AI绘画
机器人
数据
大模型
Midjourney
开源
智能
Meta
用户
微软
GPT
学习
技术
AI新词
马斯克
图像
智能体
Gemini
AI创作
Anthropic
英伟达
论文
训练
代码
LLM
算法
Stable Diffusion
芯片
腾讯
苹果
蛋白质
Claude
AI for Science
开发者
Agent
生成式
神经网络
机器学习
xAI
3D
研究
人形机器人
生成
AI视频
百度
工具
计算
GPU
华为
大语言模型
Sora
RAG
具身智能
AI设计
字节跳动
搜索
大型语言模型
场景
AGI
深度学习
视频生成
预测
视觉
伟达
架构
Transformer
DeepMind
编程
神器推荐
亚马逊
特斯拉
AI模型