计元 · 模型效率排行榜

通用计算单位 · 用计元量化不同AI模型的真实推理成本?

数据正常更新
更新于:2026-05-28T20:49:16
已接入模型:11

鼠标悬停模型名查看信息 · 点击?查看术语说明 · 计元 = 1 GFLOPs ?

#模型架构活跃参数?词表? 综合效率?
chars/tok
中文?
chars/tok
英文?
chars/tok
代码?
chars/tok
输入?
4K(计元)
输出?
4K(计元)
输入?
131K(计元)
输出?
131K(计元)
#1DeepSeek-V3MoE37.0B128,8153.742.036.683.6788.8222.0159.8399.6
#2DeepSeek-R1MoE37.0B128,8153.742.036.683.6788.8222.0159.8399.6
#3InternLM2.5-7BDense7.0B92,5443.711.806.684.0828.070.050.4126.0
#4Qwen2.5-72BDense72.0B151,6653.711.886.684.16144.0360.0259.2648.0
#5Qwen2.5-7BDense7.2B151,6653.711.886.684.1614.436.025.964.8
#6Qwen2.5-0.5BDense0.5B151,6653.711.886.684.162.05.03.69.0
#7Baichuan2-7BDense7.0B125,6963.482.006.602.7728.070.050.4126.0
#8Yi-1.5-6BDense6.0B63,9923.481.866.602.9824.060.043.2108.0
#9MiniCPM3-4BDense3.6B73,4483.381.806.462.8914.436.025.964.8
#10Phi-4Dense14.0B100,3523.120.936.684.1856.0140.0100.8252.0
#11Mistral-7BDense7.0B32,7682.740.955.992.9828.070.050.4126.0
计元协议 v1.0 · 锚定: 1计元 = 1GFLOPs (FP8) · 数据每周自动更新