• About
  • Privacy Policy
  • Disclaimer
  • Contact
Soft Bliss Academy
No Result
View All Result
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups
Soft Bliss Academy
No Result
View All Result
Home Machine Learning

SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators

softbliss by softbliss
April 7, 2025
in Machine Learning
0
0
SHARES
1
VIEWS
Share on FacebookShare on Twitter


Large Language Models (LLMs) have transformed natural language processing, but face significant challenges in widespread deployment due to their high runtime cost. In this paper, we introduce SeedLM, a novel post-training compression method that uses seeds of a pseudo-random generator to encode and compress model weights. Specifically, for each block of weights, we
find a seed that is fed into a Linear Feedback Shift Register (LFSR) during inference to efficiently generate a random matrix. This matrix is then linearly combined with compressed coefficients to reconstruct the weight block. SeedLM reduces memory access and leverages idle compute cycles during inference, effectively speeding up memory-bound tasks by trading compute for fewer memory accesses. Unlike state-of-the-art methods that rely on calibration data, our approach is data-free and generalizes well across diverse tasks. Our experiments with
Llama3 70B, which is particularly challenging, show zero-shot accuracy retention at 4- and 3-bit compression to be on par with or better than state-of-the-art methods, while maintaining performance comparable to FP16 baselines. Additionally, FPGA-based tests demonstrate that 4-bit SeedLM, as model size increases, approaches a 4x speed-up over an FP16 Llama 2/3 baseline.

† Meta

Tags: CompressingGeneratorsLLMPseudoRandomSeedLMSeedsWeights
Previous Post

Academic Mean Girls

Next Post

Deel’s comms chief departs amidst spying lawsuit from Rippling

softbliss

softbliss

Related Posts

Machine Learning

Beyond Text Compression: Evaluating Tokenizers Across Scales

by softbliss
June 5, 2025
Teaching AI models the broad strokes to sketch more like humans do | MIT News
Machine Learning

Teaching AI models the broad strokes to sketch more like humans do | MIT News

by softbliss
June 4, 2025
NotebookLM introduces public notebooks for sharing
Machine Learning

NotebookLM introduces public notebooks for sharing

by softbliss
June 4, 2025
8 FREE Platforms to Host Machine Learning Models
Machine Learning

8 FREE Platforms to Host Machine Learning Models

by softbliss
June 4, 2025
RLHF 101: A Technical Tutorial on Reinforcement Learning from Human Feedback – Machine Learning Blog | ML@CMU
Machine Learning

RLHF 101: A Technical Tutorial on Reinforcement Learning from Human Feedback – Machine Learning Blog | ML@CMU

by softbliss
June 3, 2025
Next Post
Deel’s comms chief departs amidst spying lawsuit from Rippling

Deel’s comms chief departs amidst spying lawsuit from Rippling

Premium Content

Function Calling at the Edge – The Berkeley Artificial Intelligence Research Blog

Function Calling at the Edge – The Berkeley Artificial Intelligence Research Blog

April 3, 2025
10 Surprising Benefits of Video Games for Kids

10 Surprising Benefits of Video Games for Kids

May 15, 2025
A Guide for Choosing Between F# vs C#

A Guide for Choosing Between F# vs C#

May 16, 2025

Browse by Category

  • Artificial Intelligence
  • Machine Learning
  • Research & Academia
  • Software Development
  • Startups

Browse by Tags

Amazon API App Artificial Blog Build Building Business Data Development Digital Framework Future Gemini Generative Google Guide Impact Intelligence Key Language Large Learning LLM LLMs Machine Microsoft MIT model Models News NVIDIA Official opinion OReilly Research Science Series Software Startup Startups students Tech Tools Video

Soft Bliss Academy

Welcome to SoftBliss Academy, your go-to source for the latest news, insights, and resources on Artificial Intelligence (AI), Software Development, Machine Learning, Startups, and Research & Academia. We are passionate about exploring the ever-evolving world of technology and providing valuable content for developers, AI enthusiasts, entrepreneurs, and anyone interested in the future of innovation.

Categories

  • Artificial Intelligence
  • Machine Learning
  • Research & Academia
  • Software Development
  • Startups

Recent Posts

  • Phishing attacks are evolving, but schools can fight back
  • Gemini 2.5’s native audio capabilities
  • Beyond Text Compression: Evaluating Tokenizers Across Scales

© 2025 https://softblissacademy.online/- All Rights Reserved

No Result
View All Result
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups

© 2025 https://softblissacademy.online/- All Rights Reserved

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?