Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Depending on quantization I figure they need at least a p4 and likely a p5 EC2 (or similar instance in another provider) for a model with that many parameters. Maybe they are hosting on bare metal but I imagine not. Those instance types (assuming not using spot) are quite expensive to run.
 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: