Sign up to save tools and stay up to date with the latest in AI
bg
bg
1

Microsoft unveils ‘LeMa’: A revolutionary AI learning method mirroring human problem solving

Nov 04, 2023 - venturebeat.com
Researchers from Microsoft Research Asia, Peking University, and Xi’an Jiaotong University have developed a new technique, Learning from Mistakes (LeMa), to enhance the problem-solving abilities of large language models (LLMs). The strategy trains AI to learn from its own errors, similar to human learning processes. The researchers used flawed reasoning paths generated by LLaMA-2 and corrected by GPT-4 to further train the models, leading to improved performance across multiple LLMs and mathematical reasoning tasks.

The LeMa technique also showed impressive results on challenging datasets, with specialized LLMs like WizardMath and MetaMath surpassing the performance of non-execution open-source models. This development marks a significant step towards AI systems that can learn from their mistakes, potentially revolutionizing sectors like healthcare, finance, and autonomous vehicles. The researchers have made their code, data, and models publicly available on GitHub, encouraging further exploration and advancements in machine learning.

Key takeaways:

  • Researchers from Microsoft Research Asia, Peking University, and Xi’an Jiaotong University have developed a new technique, Learning from Mistakes (LeMa), to improve large language models’ (LLMs) ability to solve math problems by learning from their mistakes.
  • The LeMa strategy involves having models generate flawed reasoning paths for math problems, identifying errors, explaining them, and providing corrected reasoning paths. This corrected data is then used to further train the original models.
  • LeMa has shown impressive results, improving performance across five backbone LLMs and two mathematical reasoning tasks, and achieving high accuracy on challenging datasets with specialized LLMs like WizardMath and MetaMath.
  • The development of LeMa signifies a significant step towards AI systems that can learn and improve from their mistakes, similar to human learning processes. This could revolutionize sectors heavily reliant on AI, such as healthcare, finance, and autonomous vehicles.
View Full Article

Comments (0)

Be the first to comment!