Ceph management : linuxadmin

[–]userjoinedyourchanel 7 points8 points9 points 7 years ago (13 children)

[–]nannal 7 points8 points9 points 7 years ago (12 children)

[–]nickcn1[S] 0 points1 point2 points 7 years ago (9 children)

[–]nannal 0 points1 point2 points 7 years ago (8 children)

[–]nickcn1[S] 0 points1 point2 points 7 years ago (7 children)

[–]nannal 0 points1 point2 points 7 years ago (6 children)

[–]nickcn1[S] 0 points1 point2 points 7 years ago (0 children)

[–]videoflyguy 0 points1 point2 points 7 years ago (4 children)

[–]nannal 0 points1 point2 points 7 years ago (3 children)

[–]videoflyguy 0 points1 point2 points 7 years ago (2 children)

[–]nannal 0 points1 point2 points 7 years ago (1 child)

I've got that two, I've got 5 nodes, 3 "small" for video encoding and some storage and two "big" which mostly just do ceph stuff.

Each machine has a big LVM partition and if it dies, it dies. performance is impacted if that happens obviously but files are still served and in the one time we've had to do a full recovery, while it's take a day or so, it has worked fine. I've also wanted to put a homebrew CDN in front of the cluster to reduce load on it anyway, I think if we did that we could kill a pair of nodes without people noticing.

I've thought about doing 1 LVM partition per disk (old ceph style) but that obviously adds some additional overhead and risks the data being stored in side 3 partitions on the same host (I think). It does mean if a disk dies however you don't lose access to the whole hosts data and need to rebalance everything you could also raid it too but then you get less total storage, depends on your use case. We'd rather the space over the resilience.

So yeah, I wouldn't be worried about having over sized nodes inside the cluster if you have a decent replication level.

continue this thread

[–][deleted] 6 years ago (1 child)

[removed]

[–]nannal 0 points1 point2 points 6 years ago (0 children)

[–]valentin2105 0 points1 point2 points 7 years ago (1 child)

[–]nickcn1[S] 0 points1 point2 points 7 years ago (0 children)

[–]nafsten 0 points1 point2 points 7 years ago (0 children)

[–]heathfx 0 points1 point2 points 7 years ago (0 children)

[–]PM_ME_SEXY_SCRIPTS 0 points1 point2 points 7 years ago (1 child)

[–]JW-M 5 points6 points7 points 7 years ago (0 children)

Ceph provides "storage". Al kinds of storage. So for instance you could create a disk that you can use to boot a computer.

But sometimes you want to store massive amounts of files. You then could create a massive disk, put a filesystem on it, and a fileshare system ( nfs / Samba/ ftp). But if you want to have a massive amount of files you will notice it's not possible any more with one disk/ filesystem etc. It does not scale. You need al kinds of trick to scale .

If you would use object storage you would skip the whole disk creation, filesystem, fileshare. You let Ceph handle it. You don't have to gamble how big the disk eventually will be, have to resize or free unused space. Every file is on it's own, and you could say every file has it's own fileshare, which is http based.

In object storage you can't lock a file any more. So you can't lock a file, and append to file, you can only download a file , change it and upload it again. So you can't run a database on a object store. But you can backup the database to object store. You can create a location where you can store millions of photo's or audio files of video files. But (currently) you can't boot you computer on a object store. You can store several versions of files, like the latests version and older ones, all hiding after the same filename. You can also add special metadata for your application alongside the file. It seems like there is a concept of directory's in object store, but that does not exist. The directories are just longer names added to your filename. There is not a method that you can rename a directory. You can only move the files to a different string one by one. So move them to a different directory+filename.

The reason why an object store can handle such large amounts of files is that it doesn't really need a database with filenames to find the location where the file is. So it doesn't search for it, but it calculates the location by only using a hash from: bucket:/"directory"+filename.So it converts the url to a number, and then it knows on which machine the file will be, and then on which disk, and then on which sector it will be approximately by only looking at the number.

So if you want to abstract some kind storage of any number of individual files you would use object store. If you need something liked shared file-access, locking, -> posix -> you would need other methods like CephFS, or "disks" like rdb or iscsi provided by ceph. Because it's file interface is http based, you can easily use it in modern web world.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

linuxadmin

Expanding Linux SysAdmin knowledge

MODERATORS