The company has a roadmap for future developments, including more action support, clever tab management, and the ability to control the browser by text or voice. Other plans include the introduction of workflows, a chat feature, the ability to share workflows, and a cloud version of AI Employe. The company also plans to support open-source models and community-shared workflows.
Key takeaways:
- AI Employe is a browser automation tool that can automate tasks requiring human-like intelligence such as understanding emails, receipts, invoices, etc.
- The tool uses a unique technique to find the right element on a webpage by indexing the entire DOM in MeiliSearch, which allows GPT-4-vision to generate commands for actions.
- To prevent GPT from derailing from tasks, AI Employe uses a technique called Actions Augmented Generation, which records the DOM element changes for every action a user takes.
- The roadmap for AI Employe includes features like workflows, chat with what you see, more actions support, clever tab management, open source models support, and a cloud version of AI Employe.