Menu

How to randomly select rows from a large table in MySQL super fast?

Let's see how to randomly select rows from a large table quick and efficiently in SQL

Written by Selva Prabhakaran | 2 min read

Problem

How to randomly select rows quickly from a large table in MySQL?

Input

employee_idfirst_namelast_namedepartmentsalary
1JohnDoeHR50000.00
2JaneSmithEngineering60000.00
3AliceJohnsonFinance55000.00
4BobBrownSales52000.00
5EvaWilliamsMarketing48000.00
6ChrisDavisEngineering62000.00
7SarahWilsonFinance56000.00
8MikeJonesHR51000.00
9LindaMartinezSales53000.00
10TomMooreMarketing49000.00
11b0a93c51eaEngineering59930.00
121a7d657e07Engineering56147.00
134d3667775bEngineering54771.00
146fb08b77e7HR59799.00
154a00a88ec8Finance56144.00
1677288d1c5cMarketing50731.00
173fdcae51acSales58556.00
187331337117Engineering52198.00
19d90a8f6b7cFinance53655.00
208ec6a833c8Finance53730.00

Try Hands-On: Fiddle

Create Input Table: Gist

Desired Output

employee_idfirst_namelast_namedepartmentsalary
9LindaMartinezSales53000.00
199bc593de01Finance55591.00
11ff441bf511Finance55456.00
2JaneSmithEngineering60000.00
3AliceJohnsonFinance55000.00

Solution 1:

Using generated Random number

sql
SELECT * FROM employees
ORDER BY RAND()
LIMIT 5;

Explanation:

SELECT * FROM employees: This part of the query selects all columns from the employees table.

ORDER BY RAND(): This clause orders the rows randomly. RAND() generates a random number between 0 and 1 for each row, effectively shuffling the rows.

LIMIT 5: This limits the result set to the first 5 rows. You can change the number to the desired number of randomly selected rows.

Solution 2:

More efficient / faster approach.

Using generated Random number and WHERE

sql
SELECT * FROM employees
WHERE RAND() <= 0.1
ORDER BY RAND()
LIMIT5;

Explanation:

  1. SELECT * FROM employees: This part of the query selects all columns from the employees table.

  2. WHERE RAND() <= 0.1: This condition filters rows with a random number less than or equal to 0.1. By adjusting the value 0.1, you can control the probability of selecting rows. For example, setting it to 0.1 means there’s a 10% chance of selecting each row.

  3. ORDER BY RAND(): This clause still orders the filtered rows randomly to ensure randomness within the selected subset.

  4. LIMIT 5: This limits the result set to the first 5 rows from the filtered and randomly ordered subset.

Solution 3:

sql
SELECT * FROM employees
WHERE RAND() <= (
    SELECT (5 / COUNT(*)) FROM employees
)
ORDER BY RAND()
LIMIT 5;

Explanation:

WHERE RAND() <= (…): Here, we use a subquery to calculate a random selection probability. (5 / COUNT(*)) calculates the probability based on the number of rows in the table. In this case, it selects approximately 5 rows out of the total count.

You can adjust the 5 to control the desired number of rows to select.

  1. SQL for Data Science – Level 1
  2. SQL for Data Science – Level 2
  3. SQL for Data Science – Level 3
  1. Introduction to SQL
  2. SQL Window Functons – Made Simple and Easy
  3. SQL Subquery

More SQL Questions

  1. How to efficiently convert rows to columns in SQL?
  2. How to transpose columns to rows in SQL?
  3. How to select first row in each GROUP BY group?
Free Course
Master Core Python — Your First Step into AI/ML

Build a strong Python foundation with hands-on exercises designed for aspiring Data Scientists and AI/ML Engineers.

Start Free Course
Trusted by 50,000+ learners
Related Course
Master SQL — Hands-On
Join 5,000+ students at edu.machinelearningplus.com
Explore Course
Free Callback - Limited Slots
Not Sure Which Course to Start With?
Talk to our AI Counsellors and Practitioners. We'll help you clear all your questions for your background and goals, bridging the gap between your current skills and a career in AI.
10-digit mobile number
📞
Thank You!
We'll Call You Soon!
Our learning advisor will reach out within 24 hours.
(Check your inbox too — we've sent a confirmation)
⚡ Before you go

Python.
SQL. NumPy.
All free.

Get the exact 10-course programming foundation that Data Science professionals use.

🐍
Core Python — from first line to expert level
📈
NumPy & Pandas — the #1 libraries every DS job needs
🗃️
SQL Levels I–III — basics to Window Functions
📄
Real industry data — Jupyter notebooks included
R A M S K
57,000+ students
★★★★★ Rated 4.9/5
⚡ Before you go
Python. SQL.
All Free.
R A M S K
57,000+ students  ★★★★★ 4.9/5
Get Free Access Now
10 courses. Real projects. Zero cost. No credit card.
New learners enrolling right now
🔒 100% free ☕ No spam, ever ✓ Instant access
🚀
You're in!
Check your inbox for your access link.
(Check Promotions or Spam if you don't see it)
Or start your first course right now:
Start Free Course →
Scroll to Top
Scroll to Top
Course Preview

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos:

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science

Machine Learning A-Z™: Hands-On Python & R In Data Science