• AI Research Insights
  • Posts
  • đŸ”„ What is Trending in AI Research?: Large Language Models Surprise Meta AI Researchers at Compiler Optimization! + BlindChat + Kosmos-2.5 + How Green is AI? + DeciDiffusion 1.0...

đŸ”„ What is Trending in AI Research?: Large Language Models Surprise Meta AI Researchers at Compiler Optimization! + BlindChat + Kosmos-2.5 + How Green is AI? + DeciDiffusion 1.0...

This newsletter brings AI research news that is much more technical than most resources but still digestible and applicable

Hey Folks!

This newsletter will discuss some cool AI research papers and AI tools. Happy learning!

👉 What is Trending in AI/ML Research? 

How can we harness the capabilities of Large Language Models (LLMs) for code optimization? In this study by Meta AI, the authors dive into this emerging application, developing a 7B-parameter transformer model tailored to optimize LLVM assembly code for size. Unlike conventional methods, this model analyzes unoptimized assembly to suggest optimal compiler options. During training, the model is fine-tuned to forecast instruction counts pre- and post-optimization and the optimized code itself. This training strategy enriches the model's optimization prowess. Remarkably, on tested programs, the approach trims instruction counts by 3.0% more than the compiler, surpassing two top-tier benchmarks, while showcasing a profound code understanding—yielding compilable code in 91% of instances and matching the compiler's output 70% of the time.

BlindChat, an open-source and privacy-first alternative to ChatGPT, was just launched by MithrilSecurity. BlindChat is an open-source AI initiative aiming to create the world’s first conversational AI that operates entirely within a web browser without any third-party access. Today’s prevalent everyday AI solutions typically include sharing user data with AI service providers in exchange for AI model usage. Users risk having their data stolen if they let this happen. Since data is a valuable resource for enhancing LLMs, several approaches implicitly adjust users’ data to train the model better. Users run the danger of having LLMs memorize private information in this way. By performing local inference or employing secure, isolated environments called secure enclaves, BlindChat ensures that users’ data is kept private at all times and that they retain complete control over it.

Message from our sponsor

Project management is necessary — it’s how we keep teammates in sync, minimize downtime and bottlenecks, and hit deadlines. However, it’s often tedious. Stressful. Complicated. And most of all, far from your true ambition.

Enter Height, the first and only AI project manager. Height is offloading the drudgery of project management, so you can get back to bringing ideas to life.

AI is woven right into the tasks and projects you’re working on. With Height, it’s frictionless to keep your workflows and projects organized on autopilot.

Project management shouldn’t create more work for you. It should enable you to do the work that matters most — with Height, you’ll be able to do more while doing less.

With Height, experience both cutting-edge AI features and dynamic workflow features you can't find anywhere else.

Enter a new era of project management: https://bit.ly/3sMn6kB 

How can machines effectively read and interpret text-rich images? Microsoft researchers Introduce Kosmos-2.5, a state-of-the-art multimodal literate model adept at processing text-intensive images. Trained on a vast dataset of such images, it serves two main purposes: First, it can generate spatially-aware text blocks, pinpointing the location of each text section within the image. Secondly, it translates these texts into a structured markdown format, preserving styles and structures. This dual functionality is powered by a shared Transformer design combined with specialized prompts and dynamic text representations. Assessed on extensive text recognition and image-to-markdown tasks, Kosmos-2.5 showcases potential as a universal tool for deciphering text-heavy images and represents a leap towards more advanced multimodal language models.

How do the greenhouse gas emissions of AI systems compare to those of humans performing the same tasks? This study compared the CO2e emissions of various AI systems, including ChatGPT, BLOOM, DALL-E2, and Midjourney, with human counterparts. Findings revealed that an AI writing a page of text produces 130 to 1500 times less CO2e than a human, and an AI generating an image emits 310 to 2900 times less. However, this analysis doesn't factor in societal effects such as job displacement, legal concerns, and potential increased AI usage. While AI can't replace all human tasks, it offers the possibility of conducting certain activities with significantly reduced emissions.

Deci AI introduces DeciDiffusion 1.0 – A New Approach To solve the text-to-image generation problem; a research team introduced DeciDiffusion 1.0, a groundbreaking model representing a significant leap forward in this domain. DeciDiffusion 1.0 builds upon the foundations of previous models but introduces several key innovations that set it apart. One of the key innovations is the substitution of the traditional U-Net architecture with the more efficient U-Net-NAS. This architectural change reduces the number of parameters while maintaining or even improving performance. The result is a model that is not only capable of generating high-quality images but also does so more efficiently in terms of computation. The model’s training process is also noteworthy. It undergoes a four-phase training procedure to optimize sample efficiency and computational speed. This approach is crucial for ensuring the model can generate images with fewer iterations, making it more practical for real-world applications.

👉 What is Trending in AI Tools? 

  • Height 2.0 — The autonomous project collaboration tool powered by AI. [Productivity]

  • Adcreative AI: Boost your advertising and social media game with AdCreative.ai - the ultimate Artificial Intelligence solution. [Marketing and Sales]

  • Pixelicious: Pixelicious is an online image-to-pixel art converter. [Image Generator]

  • Canva: Canva has thousands of logo templates you can customize to make your own. [Design]

  • SocialBee: SocialBee is an AI-powered social media management tool that allows you to generate captivating captions and images effortlessly [Social Media]

  • Parsio (OCR + AI chat): Automate your data extraction with an AI-powered document parser. [Productivity]

  • Sanebox: SaneBox's powerful AI automatically organizes your email for you. [Email]

  • Rask AI: a one-stop-shop localization tool that allows content creators and companies to translate their videos into 130+ languages quickly and efficiently. [Speech and Translation]

Sponsored
Bagel Bots20,000+ people read Bagel Bots weekly to learn how to use AI to make more money and save more time.