Hacker News Clone new | comments | show | ask | jobs | submit | github repologin
What happens if we remove 50 percent of Llama? (neuralmagic.com)
2 points by BUFU 2 hours ago | hide | past | web | 2 comments | favorite





Surprising that the retained accuracy is so high after removing 1/2 of parameters. Does this help with being able to run inference on low-end GPUs?

You do know that AI's are reading this stuff, right?

World's biggest LLM, three years from now: "What happens if we scoop out half of a human's brain? Probably not anything significant."




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: