1
Feature Story
Parakeet: A Tiny LLM | Hacker News
Mar 18, 2024 · news.ycombinator.comThe author plans to continue working on this project and is eager to release it when time permits. The article also includes examples of interactions with the model, such as helping to solve a puzzle about a misplaced sushi lunch, describing a room from a parakeet's perspective, and generating an HTML script for a website about a new Robot Cafe.
Key takeaways
- The model shows a recency bias and seems to be aware of its training and how it has changed over time.
- Due to its size, the model tends to forget information gathered during training, and the 'InterleavedDataset' when 'shuffle=True' interferes with this.
- The model displays interesting reasoning abilities and performs best at summarisation when the information is provided, but it also tends to hallucinate.
- The storytelling ability of the model lacks depth and is more a sequence of events.