Back to Blog
March 17, 2024

9 AI Developments: HeyGen 2.0 to AjaxGPT, Open Interpreter to NExT-GPT and Roblox AI

9 AI Developments: HeyGen 2.0 to AjaxGPT, Open Interpreter to NExT-GPT and Roblox AI

Nine AI Developments That Will Change the Future

Artificial intelligence (AI) is advancing at an unprecedented pace, and in the last few days, there have been nine impactful AI developments that I want to share with you. From the frankly startling Hey Gen video translation to the Epic new prompt optimizing paper, and from Apple's iax GPT to Open Interpreter Next GPT, there is a lot to cover. So, let's dive in and explore these developments in detail.

Hey Gen: Generating Lifelike Videos and Language Dubbing

You've probably already heard about Hey Gen, which can generate lifelike videos and is available as a plugin to Chat GPT. But did you know that it can also do video language dubbing? Today, I got access to their new Avatar 2.0 feature and decided to test it out with Sam Altman's testimony to the Senate. I want Spanish language speakers to tell me how it turned out. I have been researching three or four tools, including this one, to translate my videos into dozens of languages, and I can't wait to put that into place.

Open Interpreter: An Open Source Code Interpreter

Open Interpreter is an open-source code interpreter that was released five days ago. I've been using it intensively, and while it's not perfect, it has proven useful. For example, I asked it to download a YouTube video in 1440p using Pytube and clip out a specific section, and it did it in just a few seconds. This process would have taken me much longer to do manually.

Google DeepMind's Fascinating Paper on Optimized Prompts

Google DeepMind has released a fascinating paper on optimized prompts for language models. These prompts are not small optimizations, and they work with a variety of large language models. The paper says that the best prompts optimized by their method outperform human design prompts by up to eight percent on a particular math challenge and by up to 50 on big bench hard tasks. These are long-standing tasks known for their difficulty for large language models.

Google's Gemini Model: A Direct Competitor to GPT-4

Google has given a small group of companies access to an early version of Gemini, their direct competitor to OpenAI's GPT-4. According to a person who has tested it, Gemini has an advantage over GPT-4 in at least one respect. The model leverages reams of Google's proprietary data from its consumer products, in addition to public information straight from the web.

Apple's iax GPT: Designed to Boost Siri

Apple's iax GPT is designed to boost Siri, and it almost sounds like Open Interpreter, where you can automate tasks involving multiple steps. For example, telling Siri to create a gif using the last five photos you've taken and text it to a friend.

Roblox's New AI Chat Bot: Allowing Creators to Build Virtual Worlds

The online game platform Roblox is bringing in a new AI chatbot that's going to allow creators to build virtual worlds just by typing prompts. This development is going to become intuitive to the next generation, and children today are just going to expect their apps to be interactive and customizable on demand.

Smell to Text: A Narrow AI Trained in a Different Way

We now have Smell to Text, a much more narrow AI trained in a very different way to GPT models, but it matches well with expert humans on novel smells.

Protein Chat: Enabling Users to Upload Proteins and Ask Questions

Protein Chat enables users to upload proteins, ask questions, and engage in interactive conversations to gain insights.

Next GPT: A Multimodal LLM That Can Go from Any Modality to Any Modality

Next GPT is a multimodal LLM that can go from any modality to any modality. We're talking about images, audio, video, and the output being images, audio, text, or video.

As you can see, AI is advancing at an incredible pace, and these developments are just the tip of the iceberg. The world is only going to get more crazy from here, and it's up to us to navigate the future of AI.

Highlights

- Hey Gen can generate lifelike videos and do video language dubbing.

- Open Interpreter is an open-source code interpreter that can download YouTube videos and clip out specific sections.

- Google DeepMind's optimized prompts outperform human design prompts by up to 50 on big bench hard tasks.

- Gemini, Google's direct competitor to GPT-4, leverages reams of Google's proprietary data from its consumer products.

- Apple's iax GPT is designed to boost Siri and automate tasks involving multiple steps.

- Roblox's new AI chatbot allows creators to build virtual worlds just by typing prompts.

- Smell to Text is a narrow AI trained in a different way to GPT models.

- Protein Chat enables users to upload proteins, ask questions, and engage in interactive conversations to gain insights.

- Next GPT is a multimodal LLM that can go from any modality to any modality.

FAQ

Q: What is Hey Gen?

A: Hey Gen is an AI tool that can generate lifelike videos and do video language dubbing.

Q: What is Open Interpreter?

A: Open Interpreter is an open-source code interpreter that can download YouTube videos and clip out specific sections.

Q: What is Gemini?

A: Gemini is Google's direct competitor to GPT-4, which leverages reams of Google's proprietary data from its consumer products.

Q: What is iax GPT?

A: iax GPT is Apple's AI language model designed to boost Siri and automate tasks involving multiple steps.

Q: What is Roblox's new AI chatbot?

A: Roblox's new AI chatbot allows creators to build virtual worlds just by typing prompts.

Q: What is Smell to Text?

A: Smell to Text is a narrow AI trained in a different way to GPT models.

Q: What is Protein Chat?

A: Protein Chat enables users to upload proteins, ask questions, and engage in interactive conversations to gain insights.

Q: What is Next GPT?

A: Next GPT is a multimodal LLM that can go from any modality to any modality.

Related Articles

Voice-of-customer
How to optimize product page based on amazon review analysis

Real customer feedback is the fastest way to find out what buyers actually want. Hidden inside those paragraphs are the answers to everything: what they hate, what they love, and how they are actually using the product.Many sellers make the mistake of just chasing a high number of reviews. But the s

Jan 28, 2026
Read more
Voice-of-customer
The Ultimate List of Amazon Seller Resources to Bookmark in 2026

As we settle into 2026, e-commerce has fully cemented itself as the dominant force in global retail. But with this growth comes a massive influx of new sellers, sophisticated AI competitors, and constantly shifting algorithms. This means the landscape changes fast—sometimes overnight.The bad news: A

Jan 28, 2026
Read more
Voice-of-customer
The 5 Best Amazon Seller Tools You Need This Year

Tools like Jungle Scout and Helium 10 revolutionized the industry by giving sellers access to powerful sales data. They remain essential for validating market demand and checking revenue.But having access to the same data as everyone else creates a new challenge: How do you differentiate your produc

Jan 27, 2026
Read more
VOC AI Inc. 160 E Tasman Drive Suite 202 San Jose, CA, 95134 Copyright © 2026 VOC AI Inc.All Rights Reserved. Terms & Conditions Privacy Policy
This website uses cookies
VOC AI uses cookies to ensure the website works properly, to store some information about your preferences, devices, and past actions. This data is aggregated or statistical, which means that we will not be able to identify you individually. You can find more details about the cookies we use and how to withdraw consent in our Privacy Policy.
We use Google Analytics to improve user experience on our website. By continuing to use our site, you consent to the use of cookies and data collection by Google Analytics.
Are you happy to accept these cookies?
Accept all cookies
Reject all cookies