Chatbot AI Baru Belajar dari Kesalahannya

Binance News · 2024-09-06T03:24:42.000Z

Menurut Cointelegraph, chatbot kecerdasan buatan baru bernama 'Reflection 70B' telah diperkenalkan oleh HyperWrite AI. CEO Matt Shumer mengumumkan pengembangan tersebut pada tanggal 5 September, mengklaimnya sebagai model sumber terbuka terbaik di dunia. AI tersebut dilatih menggunakan teknik yang disebut 'Reflection-Tuning,' yang dirancang untuk memungkinkan model bahasa besar (LLM) mengoreksi kesalahannya sendiri. Reflection Llama-3.1 70B, yang merupakan AI sumber terbuka Meta yang diluncurkan pada bulan Juli, dapat bersaing dengan model sumber tertutup teratas seperti Claude 3.5 Sonnet milik Anthropic dan GPT-4o milik OpenAI dalam beberapa tolok ukur, menurut Shumer. Ia menjelaskan bahwa model AI saat ini sering berhalusinasi, artinya model tersebut menghasilkan keluaran yang tidak akurat dengan melihat pola atau objek yang tidak ada. Reflection-Tuning bertujuan untuk mengatasi hal ini dengan memungkinkan AI mengenali dan mengoreksi kesalahannya sebelum menyelesaikan jawaban.

According to Cointelegraph, a new artificial intelligence chatbot named 'Reflection 70B' has been introduced by HyperWrite AI. CEO Matt Shumer announced the development on Sept. 5, claiming it to be the world's top open-source model. The AI was trained using a technique called 'Reflection-Tuning,' designed to enable large language models (LLMs) to correct their own mistakes.
Reflection Llama-3.1 70B, which is Meta's open-source AI launched in July, can compete with top closed-source models like Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o in several benchmarks, according to Shumer. He explained that current AI models often hallucinate, meaning they generate inaccurate outputs by perceiving patterns or objects that do not exist. Reflection-Tuning aims to address this by allowing AI to recognize and correct its mistakes before finalizing an answer.
Reflection tuning involves feeding AI responses back into the model, asking it to evaluate its own outputs to identify strengths, weaknesses, and areas for improvement. This iterative process helps the AI continuously refine its capabilities, making it more self-aware and better at critiquing and improving its performance. Shumer noted that with the right prompting, the new model is highly effective for various use cases.
In 2023, Microsoft-backed OpenAI released a research paper discussing methods to prevent AI hallucinations. One proposed idea was 'process supervision,' which involves training AI models to reward themselves for each correct step of reasoning rather than just the final correct conclusion. Karl Cobbe, a researcher at OpenAI, emphasized the importance of detecting and mitigating logical mistakes or hallucinations as a critical step towards building aligned artificial general intelligence (AGI).

New AI Chatbot Learns From Its Mistakes

Berita Terbaru