Alibaba AI model-based agentic framework tops global ranking – 1v1 Video Chat & LIve Streaming & Influencer Subscription

By Ann Cao

Alibaba Group Holding’s open-source Qwen artificial intelligence (AI) model has enabled the agentic framework DeepSWE to outperform rival systems in this growing segment, according to the software platform’s developers.
Jointly developed by open-source initiative Agentica and San Francisco-based start-up Together AI, DeepSWE was trained on the Qwen3-32B large language model (LLM) – part of Alibaba Cloud’s third-generation family of AI models. It topped the leaderboard of the latest SWEBench-Verified test, scoring 59 per cent accuracy against other so-called open-weight models like DeepSeek’s V3-0324, the developers said in a blog post on Wednesday.
Agentic frameworks are software platforms that provide the structure, tools and functionalities to build, deploy and manage AI agents. They enable AI agents to collaborate, make decisions and automate complex tasks.
AI agents, such as Chinese start-up Butterfly Effect’s Manus, are software programs that are capable of autonomously performing tasks on behalf of a user or another system. Essentially, these agents create a plan of specific tasks and subtasks to complete a goal using available resources.
DeepSWE marks the latest example of Hangzhou-based Alibaba’s growing leadership position in the global open-source community. Alibaba owns the South China Morning Post.
The open-source approach gives public access to a program’s source code, allowing third-party software developers to modify or share its design, fix broken links or scale up its capabilities.

DeepSWE was developed by post-training the Qwen3-32B model using rLLM, Agentica’s modular reinforcement learning (RL) system.
“We’ve open-sourced everything – our data set, code, training and eval logs – for everyone to progress on scaling and improving agents with RL,” the developers’ blog post said.
Trained for six days on a computing facility powered by Nvidia’s H100 graphics processing units, DeepSWE was specifically trained to solve complex software engineering tasks such as implementing new code features, debugging and solving issues on the online developer platform GitHub.
The development of DeepSWE comes more than two months after Alibaba Cloud made Qwen3 available on more developer platforms online, as the company pushed for wider international adoption of its open-source systems. Alibaba Cloud started open-sourcing Qwen models in August 2023.
Released in April, the Qwen3 AI models can be deployed via LLM platforms Ollama, LM Studio, SGLang and vLLM, according to the Qwen team’s post on their X account.
Benchmark tests cited by Alibaba in April said models such as Qwen3-235B and Qwen3-4B either matched or exceeded the performance of advanced models from both overseas and domestic competitors – including ChatGPT creator OpenAI’s o1, Google’s Gemini and DeepSeek’s R1 – in areas like instruction following, coding assistance, text generation, maths skills and complex problem solving.
Qwen was now “the world’s largest open-source model family”, Alibaba chairman Joe Tsai and CEO Eddie Wu Yongming said in a letter to shareholders last month. They added that the company had open-sourced more than 200 Qwen models as of April, generating more than 300 million global downloads and over 100,000 derivative models.
Alibaba Cloud on Thursday announced that it would invest more than US$60 million before the end of its current financial year in March to accelerate AI innovation via its partner ecosystem.
That followed Alibaba CEO Wu’s commitment in February to “aggressively invest” in AI and cloud computing infrastructure, with an outlay of at least 380 billion yuan (US$53 billion) over the next three years – the largest-ever computing project financed by a single private business in China.