• AI Research Insights
  • Posts
  • 🚀 AI News: Meet CipherChat + How Joint Speech-Text Encoders Overcome Sequence-Length Mismatch in Cross-Modal Representations....(Aug 19, 2023 Edition)

🚀 AI News: Meet CipherChat + How Joint Speech-Text Encoders Overcome Sequence-Length Mismatch in Cross-Modal Representations....(Aug 19, 2023 Edition)

This newsletter brings AI research news that is much more technical than most resources but still digestible and applicable

🔥 Trending AI Research: Let’s learn something new from the trending papers.

🛎️ Trending Tools: Check out some cool AI tools picked up by our editorial team.

Read Time: 3 Minutes

Sponsored

🔥Trending AI Research

1️⃣ CMU Researchers Developed a Simple Distance Learning AI Method to Transfer Visual Priors to Robotics Tasks: Improving Policy Learning by 20% Over Baselines [Paper] [Blog]

In recent years, visual representation learning has rapidly expanded. However, its application in robotics has been limited. Typically, these visual representations are used to understand visual data, while robotic action policies have been learned separately from costly robot-specific data. This paper introduces a method that uses visual representations to directly predict robotic actions. The key realization is that vision encoders illustrate relationships between images as distances, which can be harnessed to plan robot behavior efficiently. By adjusting a pre-trained model using human-collected video sequences, a novel algorithm is developed. This approach significantly surpasses traditional methods, with a success rate of 70% compared to 50% in traditional behavior cloning tasks. Impressively, the method can adapt to new objects without requiring any robot demonstrations during training.

2️⃣ This Paper from NYU and Google Explains How Joint Speech-Text Encoders Overcome Sequence-Length Mismatch in Cross-Modal Representations [Blog] [Paper]

The recent surge in text-prompted image generation hinges on a cross-modal representation, seamlessly merging text and image domains. Applying this to ASR, joint speech-text encoders have emerged, capable of adapting to vast parameter models by utilizing unpaired speech and text. However, managing sequence-length disparities in speech and text remains a challenge, often relying on up-sampling tactics or specific alignment models. This study reveals that these encoders can uniformly represent both modalities, overlooking sequence length differences. We propose that consistency losses can potentially reconcile length variations, presupposing optimal alignment. Evidence suggests this method enhances the WER in extensive monolingual and multilingual setups.

Sponsored
AI Tool ReportLearn AI in 5 minutes a day. We'll teach you how to save time and earn more with AI. Join 500,000+ free daily readers from Tesla, Apple, A16z, Meta, & more.

3️⃣ Meet CipherChat: An AI Framework to Systematically Examine the Generalizability of Safety Alignment to Non-Natural Languages-Specifically Ciphers [Paper] [Blog]

The research examines the vulnerabilities of Large Language Models (LLMs) in the context of safety alignment, revealing that LLMs like ChatGPT and GPT-4 can be bypassed using chat in cipher. The novel framework, CipherChat, was introduced to assess how LLMs interact with non-natural languages or ciphers, especially focusing on safety. When tested with different ciphers across 11 safety domains in English and Chinese, some ciphers almost entirely evaded GPT-4's safety protocols. Intriguingly, the study discovered a unique "secret cipher" innate to LLMs, named SelfCipher. This SelfCipher, which uses role play and natural language demonstrations, surpassed traditional human ciphers in effectiveness.

4️⃣ Google DeepMind Researchers Propose 6 Composable Transformations to Incrementally Increase the Size of Transformer-based Neural Networks while Preserving Functionality [Paper] [Blog]

In this study, the challenges of training advanced neural networks due to computational and time constraints are highlighted. The importance of model scale in achieving superior performance is underscored. Conventionally, upscaling a neural network meant initiating from zero, a process complicated by a change in the architecture's parameters that hinders the seamless transfer of knowledge. This paper introduces six flexible transformations to gradually augment the size of transformer-based neural networks while retaining their functionality. These methods guarantee exact function maintenance with limited initialization prerequisites for each transformation. Such approaches can pave the way for effective training procedures, permitting the progressive expansion of the model during the training phase.

Sponsored

🛎️ Trending Tools

Recall: This AI tool recalls, summarizes, categorizes, and link various online content like Podcasts, PDFs, YouTube Videos, News Articles, and Blog posts.

AdCreative AI: This AI Tool can help you boost your advertising and social media game with.

Hostinger AI Website Builder: The Hostinger AI Website Builder offers an intuitive interface combined with advanced AI capabilities, designed for crafting websites for any purpose

Pecan AI: Pecan AI automates predictive analytics to solve today’s business challenges: shrinking budgets, rising costs, and limited data science and AI resources.

Claude 2: Claude 2 is an AI chatbot that rivals ChatGPT in terms of functionality. While both tools are comparable, Claude 2 offers certain benefits over ChatGPT.

Sponsored
AI Tool ReportLearn AI in 5 minutes a day. We'll teach you how to save time and earn more with AI. Join 500,000+ free daily readers from Tesla, Apple, A16z, Meta, & more.

Taplio: Transform your LinkedIn presence with Taplio's AI-powered platform.

Equals: Equals, the ultimate tool for startups to swiftly analyze data, stands out as the singular spreadsheet equipped with seamless integration to any database.

Robin: In the realm of project management, Robin stands out. It offers collaborative features, Gantt charts, and task management

Cresta AI: Cresta AI harnesses the power of AI to empower sales teams.

Notion: Notion AI, is a robust generative AI tool that assists users with tasks like note summarization, identifying action items in meetings, and creating and modifying text.

Ferret AI: Ferret provides deep insights into customer behavior through web analytics.

Sponsored
CXOTalk UpdatesConversations on Leadership, Enterprise AI, and the Digital Economy