Running large language models like ChatGPT on a single GPU
631 by _nhynes | 230 comments on Hacker News.