

Just yesterday, I had a studymate try to open an xlsx file on their phone - they had the Microsoft 365 app installed that would do this, but a recent update to that app just decided to change it to a Copilot only app.
That’s hilarious. I had the 365 app installed on my phone, because the company I work for uses Microsoft everything and I had to install that POS in the sandbox so I could open up any word or excel docs that were sent to me. Well I just opened it up, and sure enough, it’s now just a chat bot, all other functionality is gone.
This reeks of those scam companies on Amazon that sell one product under a listing for a while to get the ratings up, then swap to a completely different product while keeping the listing the same, in order to fool people into buying the new product with its high star rating and good reviews. I guarantee that’s why MS decided to go this route, to fool people into installing it because of its userbase and star rating, and to cook their books on adoption numbers.





In general, you take the model size in billions of parameters (eg: 397B), divide it by 2 and add a bit for overhead, and that’s how much RAM/VRAM it takes to run it at a “normal” quantization level. For Qwen3.5-397B, that’s about 220 GB. Ideally that would be all VRAM for speed, but you can offload some or all of that to normal RAM on the CPU, you’ll just take a speed hit.
So for something like Qwen3.5-397B, it takes a pretty serious system, especially if you’re trying to do it all in VRAM.