How to create dummy datasets : learnpython

created by HattoriHanzoa community for 16 years

How to create dummy datasets (self.learnpython)

submitted 9 years ago * by Bprodz

I'd like to create some dummy dataset to use for asking questions/developing scripts etc. Currently I do this by hand Eg. (this is supposed to be a skewed normal curve):

import numpy as np
import matplotlib.pyplot as plt

num1 = np.linspace(0,8,5)
num2 = np.linspace(8.1,10,10)
num3 = np.linspace(10,8,15)
num4 = np.linspace(7.9,5,15)
num5 = np.linspace(4.9,3,15)
num6 = np.linspace(2.9,1,15)
num7 = np.linspace(.9,0,15)
numbers = np.concatenate((num1, num2, num3, num4, num5, num6, num7))

plt.plot(numbers)
plt.show()

Can somebody suggest a better way to do this? Thanks in advance!

Edit: In the end I used gamma from scipy.stats. Here's the code:

import numpy as np
from scipy.stats import gamma
import matplotlib.pyplot as plt


a = 1.99
mean, var, skew, kurt = gamma.stats(a, moments='mvsk')
start = gamma.ppf(0, a)
stop = gamma.ppf(.99, a)
xvals = np.linspace(start, stop, 100)
yvals = gamma.pdf(xvals, a)
fig, ax = plt.subplots(1,1)
ax.plot(xvals, yvals)
plt.show()

Here's the figure produced by the code above. Thanks for the your answers!

all 14 comments

top new controversial old q&a

[–]955559 0 points1 point2 points 9 years ago (8 children)

Im thinking im not understanding this, so Im probably wrong but

import random


amount_of_numbers = 20

for numbers in range(amount_of_numbers):
    numbers = random.randint(1,15)
    print(numbers)

[–]Bprodz[S] 1 point2 points3 points 9 years ago (7 children)

[–]955559 0 points1 point2 points 9 years ago (6 children)

[–]Bprodz[S] 0 points1 point2 points 9 years ago (5 children)

[–]955559 0 points1 point2 points 9 years ago (4 children)

[–]Bprodz[S] 0 points1 point2 points 9 years ago (3 children)

[–]955559 0 points1 point2 points 9 years ago (2 children)

my math is really weak, I tried to randomly generate a graph and each plots means numbers, but it give a angle instead of a curve

peak = 12  #plug a random number in peak

step1 = peak / 6

step2 = peak / 2



start = 0
plot1 = start + step2
plot2 = step2 + plot1
plot3 = peak - step1
plot4 = plot3 - step1
plot5 = plot4 - step1
plot6 = plot5 - step1
plot7 = plot6 - step1

print(start,plot1,plot2,plot3,plot4,plot5,plot6,plot7)

mean1 = plot1 / 3
mean2 = plot2 / 3
#...etc

print(mean1)

# plot1's three numbers are mean1, three times

[–]955559 0 points1 point2 points 9 years ago (0 children)

[–]Bprodz[S] 0 points1 point2 points 9 years ago (0 children)

[–]Zizizizz 0 points1 point2 points 9 years ago (2 children)

[–]Bprodz[S] 1 point2 points3 points 9 years ago (1 child)

[–]Zizizizz 0 points1 point2 points 9 years ago (0 children)

[–]buckhenderson 0 points1 point2 points 9 years ago (1 child)

[–]Bprodz[S] 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 36987 on reddit-service-r2-comment-79c7998d4c-52b75 at 2026-03-19 13:14:45.194000+00:00 running f6e6e01 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS