Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Instagram Launches New Social Features Including Location-Based Map Tool

    August 6, 2025

    Microsoft Integrates OpenAI’s gpt-oss-20b into Windows Ecosystem

    August 6, 2025

    OpenAI Releases First Open-Weight Models in Years: gpt-oss-120b and gpt-oss-20b

    August 6, 2025
    Facebook X (Twitter) Instagram Pinterest
    EchoCraft AIEchoCraft AI
    • Home
    • AI
    • Apps
    • Smart Phone
    • Computers
    • Gadgets
    • Live Updates
    • About Us
      • About Us
      • Privacy Policy
      • Terms & Conditions
    • Contact Us
    EchoCraft AIEchoCraft AI
    Home»AI»DeepSeek Releases Updated R1 AI Model on Hugging Face Under MIT License
    AI

    DeepSeek Releases Updated R1 AI Model on Hugging Face Under MIT License

    EchoCraft AIBy EchoCraft AIMay 29, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    DeepSeek has released an updated version of its R1 reasoning model, now available on Hugging Face under the permissive MIT license.

    Highlights

    • DeepSeek has released R1-0528, a minor but significant upgrade to its flagship R1 model, now hosted on Hugging Face under the MIT license.
    • Massive scale: The model boasts 685 billion parameters, positioning it as one of the largest open-source AI models to date—targeted at enterprise and research applications.
    • Open and permissive licensing: The MIT license allows unrestricted commercial use, modification, and integration into proprietary products.
    • Competitive performance: Ranks just behind OpenAI’s o4-mini and o3 on LiveCodeBench, outperforming Grok-3-mini and Qwen-3 in code generation tasks.
    • Innovative training: Utilizes reinforcement learning without supervised fine-tuning, enhancing chain-of-thought reasoning and multi-step capabilities.
    • Distilled variants available: Includes Llama and Qwen-based versions for more accessible deployment, with one outperforming OpenAI’s o1-mini in multiple tests.
    • Documentation gaps: Despite the release, Hugging Face lacks in-depth documentation, deployment examples, or fine-tuning guidelines.
    • Strategic move: DeepSeek aims to position itself as a global open-source AI leader while navigating geopolitical scrutiny from U.S. regulators.
    • Broader implication: The release signifies the growing maturity of China’s open-source AI scene and the expanding competitive landscape beyond the West.
    • More than a “minor” update: R1-0528 reinforces DeepSeek’s momentum and provides a strong alternative for high-performance, open AI development.

    While the company describes the release as a “minor” upgrade, it reflects continued progress in its open-source AI efforts and growing presence in the global AI ecosystem.

    The announcement was shared via WeChat and highlights incremental improvements to the R1 model, which has been positioned as a notable open-source alternative to proprietary models from larger U.S.-based organizations such as OpenAI.

    Model Details

    The updated version, referred to as R1-0528, features a substantial 685 billion parameters, placing it among the largest open-source AI models currently available.

    This scale suggests the model is primarily intended for enterprise and research-grade applications, rather than consumer-level use, due to significant hardware requirements.

    While the Hugging Face repository includes core configuration files and model weights, it currently lacks detailed documentation, performance insights, or deployment guidelines.

    Licensing and Commercial Use

    One key aspect of this release is its MIT licensing, which enables developers, researchers, and businesses to freely use, modify, and integrate the model into proprietary or commercial products.

    This move may broaden adoption, especially among enterprise users seeking high-performance, customizable models without restrictive licensing terms.

    Performance and Benchmarks

    In code generation tasks, R1-0528 has demonstrated competitive performance.

    According to LiveCodeBench—a benchmark developed collaboratively by UC Berkeley, MIT, and Cornell—the model ranks just below OpenAI’s o4-mini and o3 models, while outperforming xAI’s Grok-3-mini and Alibaba’s Qwen-3.

    Reinforcement Learning Without Supervised Fine-Tuning

    A notable feature of DeepSeek’s approach is its training strategy. R1-0528 was trained using reinforcement learning (RL) without an initial supervised fine-tuning (SFT) phase.

    This method enables more autonomous learning and enhances the model’s chain-of-thought (CoT) reasoning, allowing for abilities such as self-verification, iterative reflection, and the generation of complex multi-step outputs.

    Distilled Versions

    To increase accessibility and support academic research, DeepSeek has also released several distilled versions of the R1 model.

    These include adaptations based on Llama and Qwen architectures. One version, DeepSeek-R1-Distill-Qwen-32B, has surpassed OpenAI’s o1-mini in multiple benchmarks, achieving new performance highs among dense models in the open-source community.

    Open-Source Strategy

    The release of R1-0528 aligns with DeepSeek’s broader strategy to offer transparent and accessible AI tools.

    The company’s rise has also drawn regulatory attention, particularly from U.S. agencies concerned about the geopolitical implications of advanced AI development outside Western institutions.

    While the current update does not introduce radically new features, its availability on Hugging Face reinforces DeepSeek’s long-term commitment to open-source AI. It also signals a steady escalation in global competition over AI capabilities and influence.

    AI DeepSeek DeepSeek's R1 Hugging Face Update
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleOpera Introduces Neon, An AI-First Browser for the Agentic Web
    Next Article Apple May Rename iOS 19 to iOS 26 at WWDC 2025, Year-Based Naming Strategy
    EchoCraft AI

    Related Posts

    AI

    Microsoft Integrates OpenAI’s gpt-oss-20b into Windows Ecosystem

    August 6, 2025
    AI

    OpenAI Releases First Open-Weight Models in Years: gpt-oss-120b and gpt-oss-20b

    August 6, 2025
    AI

    DeepMind’s Genie 3 Brings Real-Time 3D Simulations to AGI Research

    August 5, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Search
    Top Posts

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024381 Views

    CapCut Ends Free Cloud Storage, Introduces Paid Plans Starting August 5

    July 12, 2024254 Views

    6G technology The Future of Innovation for 2024

    February 24, 2024229 Views
    Categories
    • AI
    • Apps
    • Computers
    • Gadgets
    • Gaming
    • Innovations
    • Live Updates
    • Science
    • Smart Phone
    • Social Media
    • Tech News
    • Uncategorized
    Latest in AI
    AI

    Microsoft Integrates OpenAI’s gpt-oss-20b into Windows Ecosystem

    EchoCraft AIAugust 6, 2025
    AI

    OpenAI Releases First Open-Weight Models in Years: gpt-oss-120b and gpt-oss-20b

    EchoCraft AIAugust 6, 2025
    AI

    DeepMind’s Genie 3 Brings Real-Time 3D Simulations to AGI Research

    EchoCraft AIAugust 5, 2025
    AI

    What’s New in GPT-5? A Detailed Look at OpenAI’s Upcoming Model

    EchoCraft AIAugust 5, 2025
    AI

    OpenMind Aims to Become the “Android” of Humanoid Robots

    EchoCraft AIAugust 4, 2025

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Stay In Touch
    • Facebook
    • YouTube
    • Twitter
    • Instagram
    • Pinterest
    Tags
    2024 Adobe AI AI agents AI safety android Anthropic apple Apple Intelligence Apps ChatGPT Claude AI Copilot Cyberattack Elon Musk Gaming Gemini Generative Ai Google Grok AI Hugging Face India Innovation Instagram IOS iphone Meta Meta AI Microsoft NVIDIA Open-Source AI OpenAI PC privacy and Security Reasoning Model Robotics Samsung Smartphones Smart phones Social Media U.S whatsapp xAI Xiaomi YouTube
    Most Popular

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024381 Views

    Insightful iQoo Z9 Turbo with New Changes in 2024

    March 16, 2024217 Views

    Apple A18 Pro Impressive Leap in Performance

    April 16, 2024170 Views
    Our Picks

    Google Tests AI-Powered Age Estimation to Shield Minors Across Its Products in the U.S.

    July 31, 2025

    Apple Previews Major Accessibility Upgrades, Explores Brain-Computer Interface Integration

    May 13, 2025

    Apple Advances Custom Chip Development for Smart Glasses, Macs, and AI Systems

    May 9, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • About Us
    © 2025 EchoCraft AI. All Right Reserved

    Type above and press Enter to search. Press Esc to cancel.

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
    View preferences
    {title} {title} {title}