Google releases production-ready Gemini 1.5 for developers

Cointelegraph · 2024-09-25T08:03:31.000Z

Google has released two stable versions of Gemini 1.5 API models for developers, promising greater performance and lower app production costs. On Sept. 24, Google announced the launch of stable versions of Gemini 1.5 Pro (gemini-1.5-pro-002) and Gemini 1.5 Flash (gemini-1.5-flash-002). Compared to the previous 001 models, the new production-ready Gemini models have displayed significant improvements in code generation, math, reasoning and video analysis, among others. Google Gemini 1.5 Flash and Pro models description. Source: Google AI for Developers Gemini 1.5 Pro lowers financial barriers for developers Google reduced the price of its production-ready Gemini 1.5 Pro model by more than 50% while claiming three times higher rate limits and lower latency than the older experimental model releases. Source: Google DeepMind According to Google’s release notes, both Gemini 1.5 models offer significant gains in factuality and reduce model hallucinations, instruction following, multilingual understanding in 102 languages, SQL generation and audio and document understanding. Performance comparison of new and old Gemini 1.5 model releases. Source: Google for Developers Google reduced the summarization lengths for both models and advised chat-based product developers with options to increase the API’s conversational capabilities. From Oct. 1, Gemini 1.5 Pro API prices on prompts less than 128,000 tokens will be reduced to 64% for input tokens, 52% for output tokens and 64% for incremental cached tokens. “To make it even easier for developers to build with Gemini, we are increasing the paid tier rate limits for 1.5 Flash to 2,000 RPM and increasing 1.5 Pro to 1,000 RPM, up from 1,000 and 360, respectively,” the announcement read. New pricing for Google Gemini 1.5 Pro. Source: Google for Developers Google launches experimental version of Gemini 1.5 Flash Google also announced the launch of Gemini 1.5 Flash-8B, a smaller experimental version of 1.5 Flash with lower benchmark numbers. This update includes significant performance increases across both text and multimodal use cases. All versions are currently available at Google AI Studio and the Gemini API. Meanwhile, Google’s biggest artificial intelligence competitor, OpenAI, has begun rolling out its “Advanced Voice” feature to select ChatGPT users. Source: OpenAI ChatGPT’s Advanced Voice Mode allows for faster and more intuitive humanlike communication with AI. As part of the new feature, OpenAI unveiled five new voices, Arbor, Maple, SXol, Spruce, and Vale, which come as additions to the existing Breeze, Juniper, Cove, and Ember voice options. Magazine: Lady of Crypto will be ‘all out of crypto’ by September 2025: X Hall of Flame

Google đã phát hành hai phiên bản ổn định của mô hình API Gemini 1.5 dành cho các nhà phát triển, hứa hẹn hiệu suất cao hơn và chi phí sản xuất ứng dụng thấp hơn.
Vào ngày 24 tháng 9, Google đã công bố ra mắt phiên bản ổn định của Gemini 1.5 Pro (gemini-1.5-pro-002) và Gemini 1.5 Flash (gemini-1.5-flash-002). So với các mô hình 001 trước đó, các mô hình Gemini mới sẵn sàng sản xuất đã cho thấy những cải tiến đáng kể trong việc tạo mã, toán học, lý luận và phân tích video, trong số những cải tiến khác.
Mô tả mô hình Google Gemini 1.5 Flash và Pro. Nguồn: Google AI for Developers
Gemini 1.5 Pro giảm bớt rào cản tài chính cho các nhà phát triển
Google đã giảm giá mẫu Gemini 1.5 Pro đã sẵn sàng sản xuất hơn 50% trong khi tuyên bố tốc độ giới hạn cao hơn gấp ba lần và độ trễ thấp hơn so với các mẫu thử nghiệm cũ.
Nguồn: Google DeepMind
Theo ghi chú phát hành của Google, cả hai mô hình Gemini 1.5 đều mang lại những cải tiến đáng kể về tính thực tế và giảm ảo giác mô hình, hướng dẫn làm theo, hiểu biết đa ngôn ngữ ở 102 ngôn ngữ, tạo SQL cũng như hiểu biết về âm thanh và tài liệu.
So sánh hiệu suất của các bản phát hành mẫu Gemini 1.5 mới và cũ. Nguồn: Google dành cho nhà phát triển
Google đã giảm độ dài tóm tắt cho cả hai mô hình và tư vấn cho các nhà phát triển sản phẩm dựa trên trò chuyện các tùy chọn để tăng khả năng đàm thoại của API.
Từ ngày 1 tháng 10, giá API Gemini 1.5 Pro cho các yêu cầu nhỏ hơn 128.000 token sẽ được giảm xuống còn 64% đối với token đầu vào, 52% đối với token đầu ra và 64% đối với token được lưu trong bộ nhớ đệm gia tăng.
Theo thông báo, "Để giúp các nhà phát triển xây dựng dễ dàng hơn với Gemini, chúng tôi sẽ tăng giới hạn mức giá trả phí cho 1.5 Flash lên 2.000 RPM và tăng 1.5 Pro lên 1.000 RPM, tăng từ 1.000 và 360 RPM tương ứng".
Giá mới cho Google Gemini 1.5 Pro. Nguồn: Google dành cho nhà phát triển
Google ra mắt phiên bản thử nghiệm của Gemini 1.5 Flash
Google cũng đã công bố ra mắt Gemini 1.5 Flash-8B, một phiên bản thử nghiệm nhỏ hơn của 1.5 Flash với số điểm chuẩn thấp hơn. Bản cập nhật này bao gồm các cải thiện đáng kể về hiệu suất trên cả trường hợp sử dụng văn bản và đa phương thức.
Tất cả các phiên bản hiện có sẵn tại Google AI Studio và Gemini API.
Trong khi đó, đối thủ trí tuệ nhân tạo lớn nhất của Google, OpenAI, đã bắt đầu triển khai tính năng “Advanced Voice” cho một số người dùng ChatGPT.
Nguồn: OpenAI
Chế độ giọng nói nâng cao của ChatGPT cho phép giao tiếp giống con người nhanh hơn và trực quan hơn với AI. Là một phần của tính năng mới, OpenAI đã tiết lộ năm giọng nói mới, Arbor, Maple, SXol, Spruce và Vale, là những giọng nói bổ sung cho các tùy chọn giọng nói Breeze, Juniper, Cove và Ember hiện có.
Tạp chí: Lady of Crypto sẽ ‘hết tiền mã hóa’ vào tháng 9 năm 2025: X Hall of Flame

Khám phá thêm từ Nhà sáng tạo nội dung

Tin tức mới nhất

Khám phá thêm từ Nhà sáng tạo nội dung

Tin tức mới nhất

Bài viết thịnh hành