AI在线 AI在线

AWS Intensifies Infrastructure in AI Competition, SageMaker Platform Receives Major Upgrade

AWS has made a major upgrade to its machine learning and AI model training and inference platform, SageMaker, aiming to enhance user experience and strengthen its market competitiveness. This upgrade adds new observability features, connection to coding environments, and GPU cluster performance management, among other new capabilities.Since 2024, the SageMaker platform has become a unified data source integration center, integrating various machine learning tools.

AWS has made a major upgrade to its machine learning and AI model training and inference platform, SageMaker, aiming to enhance user experience and strengthen its market competitiveness. This upgrade adds new observability features, connection to coding environments, and GPU cluster performance management, among other new capabilities.

Since 2024, the SageMaker platform has become a unified data source integration center, integrating various machine learning tools. The main goal of this update is to help users better understand the reasons for model performance degradation and provide greater control over the allocation of computing resources.

AWS, Amazon, cloud service, Amazon, cloud computing, server

Ankur Mehrotra, manager of AWS's SageMaker, said in an interview with VentureBeat that many of the new features were inspired by user feedback. He mentioned that customers who develop generative AI models often face the problem of not being able to identify the specific layer where an issue occurs.

To address this, the introduction of the SageMaker HyperPod observability feature allows engineers to check the status of different layers, such as the compute layer and network layer. When model performance decreases, the system can issue alerts immediately and display related metrics on the dashboard.

Aside from the observability features, SageMaker has also added a local integrated development environment (IDE) connection feature, allowing engineers to seamlessly deploy AI projects written locally to the platform. Mehrotra noted that previously, locally coded models could only run locally, which posed significant challenges for developers wanting to scale their work. Now, AWS has introduced secure remote execution, enabling users to develop on their local machines or managed IDEs and connect to SageMaker, offering flexibility for different tasks.

AWS launched SageMaker HyperPod in December 2023, aiming to help customers manage server clusters for training models. HyperPod can schedule GPU usage based on demand patterns, helping customers effectively balance resources and costs. AWS stated that many customers hope to achieve similar services for inference tasks. Since inference tasks are usually performed during the day, while training tasks are often done during off-peak hours, this new feature will offer developers greater flexibility.

Although Amazon may not be as prominent as Google and Microsoft in foundational models, AWS continues to provide solid infrastructure support for enterprises building AI models, applications, or agents. In addition to SageMaker, AWS also launched the Bedrock platform, specifically designed for building applications and agents. With the continuous upgrades to SageMaker, AWS's competitiveness in the enterprise AI field becomes increasingly evident.

Key Points:

🌟 AWS has made a major upgrade to the SageMaker platform, adding observability and local IDE connection features.

⚙️ The SageMaker HyperPod feature helps users better manage server clusters and improve resource utilization.

🚀 AWS's layout in the AI infrastructure field will enhance its competitive advantage in the market.

相关资讯

​AWS 在 AI 竞争中加码基础设施,SageMaker 平台迎来重大升级

亚马逊网络服务(AWS)对其机器学习和 AI 模型训练与推理平台 SageMaker 进行了重磅升级,旨在提升用户体验并增强其市场竞争力。 这一升级增加了新型可观察性功能、连接编码环境以及 GPU 集群性能管理等多项新特性。 SageMaker 平台自2024年起,已转变为一个统一的数据源集成中心,集成了多种机器学习工具。
7/11/2025 2:41:05 PM
AI在线

Suno Launches v4.5+ Introduces Vocal Replacement Feature, Allows Replacing Original Vocals with Other Voices

Suno has officially launched the latest version of its AI music generation model, v4.5 , bringing unprecedented innovative features to music creators. This update not only optimizes audio quality and generation speed but also introduces the highly anticipated vocal replacement feature, further enhancing the flexibility and personalized experience of music creation.Vocal Replacement Feature: A Leap from Accompaniment to Full SongsSuno v4.5 introduces three core features, with the most notable being the "Add Vocals" vocal replacement feature.
7/18/2025 2:52:22 PM
AI在线

亚马逊投资500亿美元推动美国政府 AI 与超级计算发展

亚马逊近日宣布,将投入高达500亿美元,致力于提升美国政府在人工智能(AI)和超级计算领域的基础设施。 此项重大投资旨在支持美国政府的 AI 行动计划,帮助各政府机构加速数据发现、决策过程及任务工作流,尤其是通过更快的分析和自动化技术实现效率提升。 根据计划,从2026年开始,亚马逊网络服务(AWS)将增加约1.3吉瓦的新计算能力,覆盖其 “绝密”、“秘密” 及 “政府云(美国)” 区域。
11/25/2025 9:57:05 AM
AI在线