The author also expresses his concern about the over-reliance on OpenAI and encourages support for open-source LLMs. He argues that while OpenAI has contributed significantly to advancing the field, there are other special LLMs that can work on edge and should not be overlooked. The author concludes by sharing Python code used for generating descriptions, embeddings, and querying them.
Key takeaways:
- The author is exploring the use of local AI models, specifically Language Models (LLMs), to improve the semantic search of his photo archive in iCloud.
- The author uses a multi-modal LLM that can understand images and describes them in detail. The descriptions are then embedded as vectors using a popular algorithm, allowing users to search based on these descriptions.
- The author uses the pgvector extension with Postgres database to store the state of his application and to query the similarity of the generated descriptions.
- The author encourages support for open-source LLMs and emphasizes the importance of not being dependent on a single company for AI development.