I think we hit the sigmoid back when the QWEN models were released. By properly structuring my project, I can point it at any extension I want and get it going for 30 minutes to extend whatever. It can't effectively do 'god mode' on all the code, but being a mindful observer and code "professional" I don't need more than what a 128GB VRAM needs.
I'm amazed we're so far into SOTA bloat that the chinese will kill once they start etching silicon with these models.
I'm amazed we're so far into SOTA bloat that the chinese will kill once they start etching silicon with these models.