| RSS |
| 买 macbook pro 笔记本,跑本地模型,怎么配置性价比比较高? sjmcefc2 • 54 mins ago • Lastly replied by homonym | 29 |
| lama.cpp 目前有重大性能 bug: checkpoint 的巡回逻辑对于混合模型(比如 qwen3.6-27B)无效,从而导致大概率每次对话都要 prefill 全文,严重拖慢速度 sentinelK • 1 day ago • Lastly replied by coefu | 15 |
| GPU 跑 LLM 也会超频吗? mingtdlb • 20h 29m ago • Lastly replied by mingtdlb | 4 |
| DiffusionGemma
Livid PRO |
31 |
| Gemma4 12b 居然比 Qwen3.5 9b 还快,意料不到 yuping913 • 1 day ago • Lastly replied by lifechan | 3 |
| 现在大模型主流都用哪些 nVidia GPU? mingtdlb • 1 day ago • Lastly replied by lifechan | 20 |
| 什么? Apple Watch 也能本地跑 Qwen 了? ericterminal • 3 days ago • Lastly replied by ericterminal | 7 |
| 关于低算力 gpu 推理时 prefill 在总时长中的占比问题 zzutmebwd • 3 days ago • Lastly replied by coefu | 8 |
| 需要购买国产显卡本地部署大模型,哪家的比较好 Flagship9945 • 3 days ago • Lastly replied by zomco | 115 |
| Mac book air M5 32G+1TB 能跑本地大模型?
TGOcc PRO |
17 |
| Gemma4 12B 如何跑在 16G 显存上? CatCode • 4 days ago • Lastly replied by zzutmebwd | 25 |
| mac mini 跑本地模型,需要什么配置? kakalulin • 4 days ago • Lastly replied by kennylam777 | 18 |
| mac 64g 能部署哪个本地大模型 followadc • 7 days ago • Lastly replied by coefu | 19 |
| 消费级显卡(16G A 卡)是不是不适合运行 vllm 和 sglang,好像使用 transformer 推理都比这两个框架快,并且占用显存低 zhengfan2016 • 4 days ago • Lastly replied by zzutmebwd | 20 |
| 本地大模型最佳 Mac 配置选择 SteveRogers • 11 days ago • Lastly replied by SteveRogers | 26 |
| 想折腾一个 AI 主机,请行家出手 davidyin • 4 days ago • Lastly replied by coefu | 82 |
| 关于 5070ti 模型推理的速度和本地部署思考 tootfsg • May 20 • Lastly replied by tootfsg | 9 |
| 有没有能够兼容 Win7 的离线模型工具 faketemp • May 19 • Lastly replied by tairan2006 | 12 |
| 有适合本地跑训练 AI 的电脑配置吗? linxiaojialin • 2 days ago • Lastly replied by linxiaojialin | 2 |
| 锤子找钉子的项目分享:假想企业本地部署后不用人工洗库接入 llm 的中间层。 KaiWuBOSS • May 10 • Lastly replied by yijihu | 2 |