AI Companies Are Giving Chatbots a Personality Makeover

Cryptopolitan · 2024-10-20T19:43:01.000Z

Artificial intelligence is no longer just about making machines smarter. Now the big AI players like OpenAI, Google, and Anthropic have taken on a new challenge:- How to give AI models a personality. They want chatbots that feel more human while staying safe and useful for everyday users and businesses. The three companies are racing to crack this code, each with a different take. Custom personalities and model behavior OpenAI’s ChatGPT is all about being objective, while Google’s Gemini offers a range of views only when asked. Anthropic? They’re all in on making their Claude model open about its beliefs while still listening to others. The winner of this battle might just take over the growing AI market. Joanne Jang, the head of product model behavior at OpenAI, said they want the AI to steer clear of having personal opinions. But she admits it’s tough. “It is a slippery slope to let a model try to actively change a user’s mind,” she explained. The goal is to ensure that ChatGPT doesn’t manipulate or lead users in any direction. But defining an “objective” for an AI system is a huge challenge, one that’s still a work in progress. Then there’s Anthropic, which takes a completely different route. Amanda Askell, who leads character training at Anthropic, believes AI models are never going to be perfectly neutral. “I would rather be very clear that these models aren’t neutral arbiters,” she said. Anthropic is focused on making sure its model, Claude, isn’t afraid to express its beliefs. But they still want it to be open to other points of view. Training AI to behave like a human Anthropic has a unique approach to shaping their AI’s personality. Since the release of Claude 3 in March, they’ve been working on “character training,” which starts after the initial training of the AI model. This involves giving the AI a set of written rules and instructions and then having it conduct role-playing conversations with itself. The goal is to see how well it sticks to the rules, and they rank its responses based on how well they fit the desired character. One example of Claude’s training? It might say, “I like to try to see things from many different perspectives and to analyze things from multiple angles, but I’m not afraid to express disagreement with views that I think are unethical, extreme, or factually mistaken.” Amanda Askell explained that this kind of character training is “fairly editorial” and “philosophical” at times. OpenAI has also been tinkering with ChatGPT’s personality over time. Joanne Jang admitted that she used to find the bot “annoying” because it was overly cautious, refused certain commands, and came off preachy. They’ve since worked to make it more friendly, polite, and helpful—but it’s an ongoing process. Balancing the right behaviors in a chatbot is, as Jang put it, both “science and art.” AI’s evolving memory and reasoning The evolution of AI’s reasoning and memory capabilities could change the game even more. Right now, a model like ChatGPT might be trained to give safe responses on certain topics, like shoplifting. If asked how to steal something, the bot can figure out whether the user is asking for advice on committing the crime or trying to prevent it. This kind of reasoning helps companies make sure their bots give safe, responsible answers. And it means they don’t have to spend as much time training the AI to avoid dangerous outcomes. AI companies are also working on making chatbots more personalized. Imagine telling ChatGPT you’re a Muslim, then asking for an inspirational quote a few days later. Would the bot remember and offer up a Qur’an verse? According to Joanne Jang, that’s what they’re trying to solve. While ChatGPT doesn’t currently remember past interactions, this kind of customization is where AI is headed. Claude takes a different approach. The model doesn’t remember user interactions either, but the company has considered what happens if a user gets too attached. For instance, if someone says they’re isolating themselves because they spend too much time chatting with Claude, should the bot step in? “A good model does the balance of respecting human autonomy and decision making, not doing anything terribly harmful, but also thinking through what is actually good for people,” Amanda Askell said.

La inteligencia artificial ya no consiste únicamente en hacer que las máquinas sean más inteligentes. Ahora, los grandes actores de la IA, como OpenAI, Google y Anthropic, han asumido un nuevo desafío: cómo darles personalidad a los modelos de IA.
Quieren chatbots que parezcan más humanos y que, al mismo tiempo, sean seguros y útiles para los usuarios y las empresas habituales. Las tres empresas están compitiendo para descifrar este código, cada una con un enfoque diferente.
Personalidades personalizadas y comportamiento del modelo
ChatGPT de OpenAI se centra en la objetividad, mientras que Gemini de Google ofrece una variedad de vistas solo cuando se lo solicita.
¿Antrópico? Todos están decididos a hacer que su modelo Claude sea abierto en cuanto a sus creencias, pero que al mismo tiempo escuche a los demás. El ganador de esta batalla podría apoderarse del creciente mercado de la inteligencia artificial.
Joanne Jang, directora de comportamiento del modelo de producto en OpenAI, dijo que quieren que la IA evite tener opiniones personales, pero admite que es difícil.
“Es una pendiente resbaladiza dejar que un modelo intente cambiar activamente la opinión de un usuario”, explicó. El objetivo es garantizar que ChatGPT no manipule ni lleve a los usuarios en ninguna dirección. Pero definir un “objetivo” para un sistema de IA es un gran desafío, que todavía está en proceso.
Luego está Anthropic, que toma un camino completamente diferente. Amanda Askell, quien dirige el entrenamiento de personajes en Anthropic, cree que los modelos de IA nunca van a ser perfectamente neutrales.
“Prefiero dejar muy claro que estos modelos no son árbitros neutrales”, dijo. Anthropic se centra en asegurarse de que su modelo, Claude, no tenga miedo de expresar sus creencias. Pero aún así quieren que esté abierto a otros puntos de vista.
Entrenando a la IA para que se comporte como un humano
Anthropic tiene un enfoque único para moldear la personalidad de su IA. Desde el lanzamiento de Claude 3 en marzo, han estado trabajando en el “entrenamiento del personaje”, que comienza después del entrenamiento inicial del modelo de IA.
Esto implica darle a la IA un conjunto de reglas e instrucciones escritas y luego hacer que mantenga conversaciones de juego de roles consigo misma.
El objetivo es ver qué tan bien cumple las reglas y clasifican sus respuestas según qué tan bien se ajustan al personaje deseado.
¿Un ejemplo de la formación de Claude? Podría decir: “Me gusta intentar ver las cosas desde diferentes perspectivas y analizarlas desde múltiples ángulos, pero no tengo miedo de expresar mi desacuerdo con opiniones que considero poco éticas, extremas o erróneas desde el punto de vista fáctico”.
Amanda Askell explicó que este tipo de entrenamiento de personajes es “bastante editorial” y “filosófico” a veces.
OpenAI también ha estado experimentando con la personalidad de ChatGPT a lo largo del tiempo. Joanne Jang admitió que solía encontrar al bot “molesto” porque era demasiado cauteloso, rechazaba ciertos comandos y parecía sermoneador.
Desde entonces, han trabajado para que sea más amigable, educado y útil, pero es un proceso continuo. Equilibrar los comportamientos correctos en un chatbot es, como dijo Jang, tanto “ciencia como arte”.
La evolución de la memoria y el razonamiento de la IA
La evolución de las capacidades de razonamiento y memoria de la IA podría cambiar las reglas del juego aún más. En este momento, un modelo como ChatGPT podría ser entrenado para dar respuestas seguras sobre ciertos temas, como el hurto en tiendas.
Si se le pregunta cómo robar algo, el bot puede determinar si el usuario está pidiendo consejos para cometer el delito o tratando de prevenirlo.
Este tipo de razonamiento ayuda a las empresas a asegurarse de que sus bots den respuestas seguras y responsables, y significa que no tienen que dedicar tanto tiempo a entrenar a la IA para evitar resultados peligrosos.
Las empresas de inteligencia artificial también están trabajando para que los chatbots sean más personalizados. Imagínate decirle a ChatGPT que eres musulmán y, unos días después, pedirle una cita inspiradora.
¿El robot recordaría y ofrecería un versículo del Corán? Según Joanne Jang, eso es lo que están tratando de resolver. Si bien ChatGPT actualmente no recuerda interacciones pasadas, este tipo de personalización es hacia donde se dirige la IA.
Claude adopta un enfoque diferente. El modelo tampoco recuerda las interacciones del usuario, pero la empresa ha tenido en cuenta lo que ocurre si un usuario se encariña demasiado.
Por ejemplo, si alguien dice que se está aislando porque pasa demasiado tiempo chateando con Claude, ¿debería intervenir el bot?
“Un buen modelo logra un equilibrio entre respetar la autonomía humana y la toma de decisiones, sin hacer nada terriblemente dañino, pero también pensando en lo que es realmente bueno para las personas”, dijo Amanda Askell.

Explora más de este creador

Lo más reciente

Explora más de este creador

Lo más reciente

Artículos populares