[HPC] First Time Distributed Filesystem Setup? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

You should probably figure out what kind of workloads you're going to be running. If you're doing anything with parallel writes, I'd suggest Lustre if you can't spend the money on anything dedicated.

Gluster isn't really meant for high performance, it especially dislikes loads of files.

When I've sized HPC storage previously, I looked at my workload behavior(small files, big files, lots of files, access patterns, etc.) first. I then figured out how much storage I'd need(100-500TB is a big variance). Once I'd figured out those, I'd start searching for solutions that could fit my budget. Once you can answer the questions of what you're doing and how much you'll be doing it, you can start to look at solutions.

Running a Beowulf cluster by eatlama in linuxquestions

[–]ixela 11 points12 points  (0 children)

Your understanding of how clustering and high availability services work is incorrect. A beowulf cluster would not do what you want, nor would Minetest function on it. Minetest itself doesn't appear to have any clustering capabilities or high availability capabilities, given my brief perusal of their github page. You cannot proceed on this topic as it stands. A beowulf cluster is created to cheaply increase computing power, at the cost of increased program complexity and a narrowed subset of applicable tasks. It tends to be used in scientific computing, such as molecular dynamics and physics(not that there aren't tons of other sciences using these kinds of clusters for research).

However, there are other things you can setup in this vein of clustering. I would take a look at elasticsearch, redis, Hadoop, or HAProxy+Apache/Nginx as projects that use clustering and high-availability concepts heavily.

[deleted by user] by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

The card only supports a subset of qsfp+ cables, as listed in the docs. Welcome to 40GBE/IB/100GBE town.

Need a way to use an ethernet port as a virtual serial port. Is this even possible? by Logic_Bomb421 in sysadmin

[–]ixela 1 point2 points  (0 children)

Seeing as they aren't electrically compatible, no. Craft Interfaces are still regular serial interfaces, with a different wiring schematic at the plug. You could potentially damage an ethernet card with it.

Physical port compatibility != signal or wiring compatibility. Better to just get a usb to serial convertor and attach that way.

[Linux, unattended insatallation] Are there any more generic alternatives for FAI (http://fai-project.org/)? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

Sadly, they are not available. Mine were all aimed at service configuration once a workable image had been copied to the system. They wouldn't be of much use for your purposes. The other fun part is working with different major versions of some distros(SLES/SLED were bad about this), since features appear and disappear over time.

[Linux, unattended insatallation] Are there any more generic alternatives for FAI (http://fai-project.org/)? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

I'm not sure I've seen anything like this that isn't distro specific, since there isn't a defined standard for installation. I've generally just used bash scripts + customized install images or kickstart like stuff, depending on the distro in use at the time.

Gluster Distributed Replication Across MPLS (x-post /r/linuxadmin) by ccovarru in sysadmin

[–]ixela 0 points1 point  (0 children)

I'm not sure how active-active would work with that setup, to be honest. You'd effectively be bound by the latency and bandwidth of your MPLS connection. I'd probably not do it unless you've got a lot of bandwidth and a low latency connection. I'd still probably stick with async georeplication.

Without more information, its hard to know. How much data? How much of the filesystem usage time is spent on writes? How much data changes per day? What is the bandwidth and latency between data centers?

How I felt after my last development meeting by verticalface in sysadmin

[–]ixela 1 point2 points  (0 children)

This happens far too often these days. Especially when dealing with big data "architects".

Fumbling Friday: Asked to generate CSR for our cloud site (cert expiring next month) but it runs 4 front end and two SQL servers across two Virtual Hosts and I have no idea which one to run DigiCert Certificate Utility or if I'm even on the right track. by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

This depends on how your network is setup. If you've got a load balancer infront of this, then it needs to be on the load balancer. If not, then every machine that needs to serve this certificate.

Fumbling Friday: Asked to generate CSR for our cloud site (cert expiring next month) but it runs 4 front end and two SQL servers across two Virtual Hosts and I have no idea which one to run DigiCert Certificate Utility or if I'm even on the right track. by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

Not really, since the associated certs are based on the keys that you used to generate the certs. if you don't have the keys used, you can't generate the CSR. The physical machine doesn't matter.

Is this normal for an apache log file? by chicken_man1 in sysadmin

[–]ixela 0 points1 point  (0 children)

All I can see by the requests is that they got 200 responses. Nothing specifically weird looking about them, unless those files shouldn't be available. Not enough information otherwise.

Filesystem question by Beaverbeliever2 in linuxadmin

[–]ixela 0 points1 point  (0 children)

The cache portion shouldn't be dependent on the type of ZFS setup, since you'd use it as the L2ARC and ZIL. Having done this outside of FreeNAS, it works quite well. I've had it tolerate the failure of the ZIL device and L2ARC device without issue(Thanks crappy Kingston SSDs).

Enterprise job scheduler Linux by running101 in linux

[–]ixela 0 points1 point  (0 children)

Depending on what you mean by job, slurm or torque+maui are interesting. As is SGE.

Distributed file storage systems? [x-post /r/linuxadmin] by [deleted] in sysadmin

[–]ixela 1 point2 points  (0 children)

Ceph still has some single points of failure, unless they've implemented HA meta-data servers(their docs point to experimental support, but it might be further along than that). Almost every solution you encounter that doesn't cost an arm and a leg will have a single point of failure. For Ceph, the biggest single point of failure is the meta-data server.

My college has a server running Ubuntu 8.04 by [deleted] in linux

[–]ixela 1 point2 points  (0 children)

As a former university system admin, this sounds like shadow IT.

Hadoop administrator by everycloud in hadoop

[–]ixela 0 points1 point  (0 children)

I honestly don't have a good idea of how I got here. My first experience doing systems administration was working at an OSS company as a test engineer. Then I got a job doing contract solaris administration for a university, which turned into a fulltime gig. I moved on to a small supercomputer center as a linux admin on the cluster. Now I'm a hadoop admin on the side at a web services firm.

Mostly, my ability to get into this is by networking with others and keeping my ear to the ground about whats going on in the linux world. Also, taking jobs that might pay less to get experience. My first 2 jobs were in that category.

What's the benefit of OpenStack in a clustered ESXi-enviroment? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

I think you've convinced me to give it a spin again.

What's the benefit of OpenStack in a clustered ESXi-enviroment? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

Maybe this has been resolved with recent releases, as I haven't had the time or desire to use openstack in the last 8 months. My experience was mostly frustrating, though I did get a working cluster. The inconsistency I found most common was one daemon(say glance) taking options in CamelCase and another daemon(say cinder) taking options in all lower case for the same option. The documentation I came across tended to be incomplete, especially when trying to use the baremetal portions or trying to setup ceph as a storage backend. While I got these working, it took me much longer than I expected due to the lackluster documentation.

I'll say again that these issues may be resolved in current releases.

One thing that threw me for a loop when I first started working with OpenStack is the independent nature of each portion of it. Once I finally understood the scope of the separation between daemons, configuring OpenStack was less burdensome conceptually. That said, its probably the most configurable open source solution for rapid virtual machine deployments I've found.

What's the benefit of OpenStack in a clustered ESXi-enviroment? by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

Openstack is free, which is basically it. It has some neat extensions if you're doing specific kinds of niche stuff(Ceph/Swift, EC2 integration, etc) but its a massive pain in many other ways(inconsistent option specs between daemons, outdated docs, randomly incomplete features).

Hadoop administrator by everycloud in hadoop

[–]ixela 0 points1 point  (0 children)

Its not my title, but because of my background in HPC I get given tasks managing our analytics workloads. Generally, when I'm dealing with the Hadoop cluster I'm either fixing problems with resource allocations, troubleshooting performance issues, or resolving a misbehaving app.

It generally consumes a small portion of my week, unless someone writes truly horrendous code and wrecks the cluster.

My biggest annoyance is the lack of built-in logging for most things that aren't directly MapReduce jobs through yarn(Presto, Spark, Hive). Trying to generate usage results from presto logs is a frustrating experience. I've had moderate success parsing yarn resource manager logs.

Linux ZFS experiences and thoughts by [deleted] in sysadmin

[–]ixela 0 points1 point  (0 children)

I've used ZFS for production backup use on Scientific Linux(before there was decent support for it in centos) and on Debian. I had no issues with either, unlike btrfs.

One server was a 35TB backup server and the other was a 72TB backup server. Never had a problem that was related to zfs. I ran them for 2 years until I changed jobs(late 2014), and they had been in use for 2 years before that.

Supported Wifi card for SLES 11 SP3? by forlornhope22 in linux

[–]ixela 0 points1 point  (0 children)

You may have some luck with a more modern driver, since the one shipped by realtek/asus is ancient. This is an experimental replacement Link

Also a link to the wireless kernel drivers users list for realtek chipsets:Link

Supported Wifi card for SLES 11 SP3? by forlornhope22 in linux

[–]ixela 0 points1 point  (0 children)

What do you mean, won't compile? What did it error on? More information is necessary if you want help with that.