Monthly Archives: May 2024
OpenAI pauses use of “Sky” voice after threat of legal action.
OpenAI has paused a voice mode option for ChatGPT-4o, Sky, after backlash accusing the AI company of intentionally ripping off Scarlett Johansson’s critically acclaimed voice-acting performance in the 2013 sci-fi film Her.
In a blog defending its casting decision for Sky, OpenAI went into great detail explaining its process for choosing the individual voice options for its chatbot. But ultimately, the company seemed pressed to admit that Sky’s voice was just too similar to Johansson’s to keep using it, at least for now. — Read More
The ‘dead internet theory’ makes eerie claims about an AI-run web. The truth is more sinister
If you search “shrimp Jesus” on Facebook, you might encounter dozens of images of artificial intelligence (AI) generated crustaceans meshed in various forms with a stereotypical image of Jesus Christ.
Some of these hyper-realistic images have garnered more than 20,000 likes and comments. So what exactly is going on here?
The “dead internet theory” has an explanation: AI and bot-generated content has surpassed the human-generated internet. But where did this idea come from, and does it have any basis in reality? — Read More
Filmmakers Launch AI Studio Late Night Labs
A group of filmmakers are launching an AI film and animation studio and has snagged some A-list advisors.
Eric Day, Benjamin Michel, and Nick Confalone have launched LA-based Late Night Labs with Poker Face star Natasha Lyonne and Blue Beetle director Angel Manuel Soto among its advisors.
The trio are using generative AI in the creative process but are hoping that the new technology can also provide artists with “tangible ownership” with what they create. — Read More
How does ChatGPT ‘think’? Psychology and neuroscience crack open AI large language models
David Bau is very familiar with the idea that computer systems are becoming so complicated it’s hard to keep track of how they operate. “I spent 20 years as a software engineer, working on really complex systems. And there’s always this problem,” says Bau, a computer scientist at Northeastern University in Boston, Massachusetts.
But with conventional software, someone with inside knowledge can usually deduce what’s going on, Bau says. If a website’s ranking drops in a Google search, for example, someone at Google — where Bau worked for a dozen years — will have a good idea why. “Here’s what really terrifies me” about the current breed of artificial intelligence (AI), he says: “there is no such understanding”, even among the people building it. — Read More
AI eats the web
Google’s shift toward AI-generated search results, displacing the familiar list of links, is rewiring the internet — and could accelerate the decline of the 30+-year-old World Wide Web.
Why it matters: A world where Google answers most questions in a single machine voice makes online life more convenient — and duller.
— The change also threatens to cut into Google’s revenue from search ads, and starve future AIs of the human data they’ll need. — Read More
Newspaper conglomerate Gannett is adding AI-generated summaries to the top of its articles
Gannett, the media company that owns hundreds of newspapers in the US, is launching a new program that adds AI-generated bullet points at the top of journalists’ stories, according to an internal memo seen by The Verge.
The AI feature, labeled “key points” on stories, uses automated technology to create summaries that appear below a headline. The bottom of articles includes a disclaimer, reading, “The Key Points at the top of this article were created with the assistance of Artificial Intelligence (AI) and reviewed by a journalist before publication. No other parts of the article were generated using AI.” The memo is dated May 14th and notes that participation is optional at this point. — Read More
Communicative Agents for Software Development
Software engineering is a domain characterized by intricate decision-making processes, often relying on nuanced intuition and consultation. Recent advancements in deep learning have started to revolutionize software engineering practices through elaborate designs implemented at various stages of software development. In this paper, we present an innovative paradigm that leverages large language models (LLMs) throughout the entire software development process, streamlining and unifying key processes through natural language communication, thereby eliminating the need for specialized models at each phase. At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting. Each stage engages a team of “software agents”, such as programmers, code reviewers, and test engineers, fostering collaborative dialogue and facilitating a seamless workflow. The chat chain acts as a facilitator, breaking down each stage into atomic subtasks. This enables dual roles, allowing for proposing and validating solutions through context-aware communication, leading to efficient resolution of specific subtasks. The instrumental analysis of ChatDev highlights its remarkable efficacy in software generation, enabling the completion of the entire software development process in under seven minutes at a cost of less than one dollar. It not only identifies and alleviates potential vulnerabilities but also rectifies potential hallucinations while maintaining commendable efficiency and cost-effectiveness. The potential of ChatDev unveils fresh possibilities for integrating LLMs into the realm of software development. Our code is available at this https URL. – Read More
Researchers publish largest-ever dataset of neural connections
A cubic millimeter of brain tissue may not sound like much. But considering that that tiny square contains 57,000 cells, 230 millimeters of blood vessels, and 150 million synapses, all amounting to 1,400 terabytes of data, Harvard and Google researchers have just accomplished something stupendous.
Led by Jeff Lichtman, the Jeremy R. Knowles Professor of Molecular and Cellular Biology and newly appointed dean of science, the Harvard team helped create the largest 3D brain reconstruction to date, showing in vivid detail each cell and its web of connections in a piece of temporal cortex about half the size of a rice grain. — Read More
The Study
ChatGPT 4o vs Gemini 1.5 Pro: It’s Not Even Close
OpenAI introduced its flagship GPT-4o model at the Spring Update event and made it free for everyone. Just after a day, at the Google I/O 2024 event, Google debuted the Gemini 1.5 Pro model for consumers via Gemini Advanced. Now that two flagship models are available for consumers, let’s compare ChatGPT 4o and Gemini 1.5 Pro and see which one does a better job. On that note, let’s begin.
We have performed many commonsense reasoning and multimodal tests on both ChatGPT 4o and Gemini 1.5 Pro. ChatGPT 4o performs much better than Gemini 1.5 Pro in a variety of tasks including reasoning, code generation, multimodal understanding, and more. — Read More