A Fast Start For openMosix 83
axehind writes "Dr. Moshe Bar recently announced the creation of openMosix, a new OpenSource project. The project has quickly attracted a team of volunteers developers from around the globe and is off to a very fast start. openMosix, is an extension of the Linux kernel. openMosix is a Linux kernel extension for single-system image clustering. openMosix is perfectly scalable and adaptive. Once you have installed openMosix, the nodes in the cluster start talking to one another and the cluster adapts itself to the workload.
"
Imagine a Beowulf Cluster of... (Score:5, Funny)
Re:Imagine a Beowulf Cluster of... (Score:1)
Re:Imagine a Beowulf Cluster of... (Score:2, Informative)
So the practical upshot is.... (Score:2)
(also: so how's this differ from Appleseed?)
Re:So the practical upshot is.... (Score:2)
Beowolf usually uses PVM which requires you to do some explicit work with the PVM library. With PVM you also get a better control of something (how tasks distribute to what machines is one). The other thing is that PVM is entirely userspace, and can use a cluster of computers that say has x86 linux boxes and sparcs in it.
Mosix is almost entirely kernel space. When you use mosix you fork off a process and then communicate using unix domain sockets. You can't use threads beacuse there isn't shared memory support, but you don't have to make any calls to a mosix library to make your program work. The downside is that you can't really controll where you proccess goes and the cluster has to be all the same arch(beacuse the entire proccess migrates rather than the correct executable being called on the remote machine).
Sounds like an advertisment to me. (Score:4, Interesting)
Re:Sounds like an advertisment to me. (Score:1)
I don't get the same impressions from Daniel Robbins of Gentoo [gentoo.org], who wrote Advanced filesystem implementor's guide [ibm.com] for IBM's developerWorks.
Re:Sounds like an advertisment to me. (Score:2)
I think it was more. But who cruted these Open Source Developers in the first place?
Hype? (Score:5, Insightful)
Nothing like a 'perfectly' statement to discredit a story.
Re:Hype? (Score:2, Insightful)
Maybe interpret it as "as perfectly as can be done with existing (read common, cheap) technology" instead.
Sure, we know Amdahl's law is pretty much like the laws of thermodynamics (the best you can do is break even, and you can't even break even).
However, unless you are talking about high-budget professional solutions (e.g. Cray, HP Superdomes, most big shit fom Sun, other highly integrated solutions with custom inter-processor/memory communications), you're always going to take this hit, and openMosix has no reason to be worse than other simple solutions. And if it can reach a state where there seems to be no performance improvement without throwing hardware at it, then surely it could be said to have reached perfection? However, that's a lot of "if"s, and is all pie in the sky at the moment; whether it achieves this 'perfection' target remains to be seen.
YAWAIW.
Re:Hype? (Score:1)
This + Linux Terminal Servers = Cool (Score:5, Interesting)
Re:This + Linux Terminal Servers = Cool (Score:2)
I had to read that sentence twice just to work it out!
Re:This + Linux Terminal Servers = Cool (Score:1)
Wait! Where is X-windows plug-in for Mozilla?
Great for older hardware (Score:5, Interesting)
Re:Great for older hardware (Score:1)
Re:Great for older hardware (Score:1)
Though yes, there certainly are cases where this will be unable to handle the load (as with any computer), but the load level an openMosix cluster can handle will be, for almost all uses, it would seem, much larger than any of the nodes in single.
-Knots
Re:Great for older hardware (Score:1)
However, you're right, second time they do it, it should be instant from the cache.
YAWIAR.
Does MOSIX handle open socket migration yet? (Score:2, Interesting)
Re:Does MOSIX handle open socket migration yet? (Score:3, Insightful)
Mosix (Score:2)
I have found This link [debian.org], but that suggests a code fork rather than a revival-after-closing-source.
Re:Mosix (Score:1)
http://foundries.sourceforge.net/clusters/index
Re:Mosix (Score:1)
Original author can create closed source (Score:1)
Re:Original author can create closed source (Score:1)
Re:Original author can create closed source (Score:1)
Re:Original author can create closed source (Score:1)
Re:Original author can create closed source (Score:2)
obviously, IANAL, so if anyone wants to take a stab at answering this one..
Re:Mosix (Score:2, Insightful)
Re:Mosix (Score:1)
I love this (Score:2)
Re:I love this (Score:1)
Not without limitations... (Score:5, Informative)
Under some workloads, I can go along with the assertion that a MOSIX cluster is just like having a big machine with a lot of CPU's. It seems to be great for those workloads and I would love to try it out. Those loads tend to be multiple long running (more than a few seconds) and not multithreaded. For MOSIX to be most efficient, there also needs to be fewer jobs than there are CPUs to run them.
Other workloads, however, will not benefit from MOSIX. These statements are based on reading the docs a couple weeks back, not on actual experience.
Under the MOSIX model, when a process forks, the child may run on the current machine or it may migrate somewhere else. If the job is short lived (ls, echo whatever | sed s/blah/baz, you get the point) MOSIX will perform poorly because it will spend more time trying to figure out where the process should run than would have if it had just run the program on the local host.
If you need more CPU time than one CPU can provide and your program is multi-threaded, a single multiprocessor machine will also work better. This is because MOSIX does not yet support threads running on different machines. A 128-node cluster of 386's is going to run Netscape slower than a single 486 because you will only be using one 386 CPU.
For cases where you just have too many jobs for the resources available (CPU or memory), you may be better off with something like Condor [wisc.edu]. It is great for submitting batch jobs, migrating those jobs around, and only running the number of jobs that the system can handle.
Re:Not without limitations... (Score:3, Informative)
Other workloads, however, will not benefit from MOSIX. These statements are based on reading the docs a couple weeks back, not on actual experience.
Speaking from experience, you are pretty much correct. Jobs that use lots of CPU, but have little IO are good for mosix clusters, but jobs that have high IO are bad. The mosix filesystem and other things can partly get around the IO problems if the users plan carefully, but mostly they just want to start 30 jobs and forget about it for a few days.
There is no reason that a mosix cluster can't be combined with a batch/queueing system. This lets lazy/stupid users run their CPU bound jobs and lets mosix distribute them, but more savy users can script their IO jobs to run on particular machines and use local disk for IO.
It took a few months for the users of the cluster I setup to get trained into what jobs work well, and which kill the cluster. The problem is that launching 40 "good" jobs on a single machine is not a problem, because they just shoot out to the other nodes, but launching 40 "bad" jobs on a single machine will make that machine almost unusable.
This can have adverse effects on the cluster if the good jobs were started from the overloaded machine; for example the good jobs might have to check back with their originating machine every few minutes to update a checkpoint file.
Basically, mosix isn't some magic bullet to solve machine limitations, but it is a very cheap and effective way to solve certain problems.
Re:Not without limitations... (Score:1)
> the child may run on the current machine or
> it may migrate somewhere else. If the job is
> short lived (ls, echo whatever | sed
> s/blah/baz, you get the point) MOSIX will
> perform poorly because it will
> spend more time trying to figure out where
> the process should run than would have if it
> had just run the program on the local host.
No. openMOSIX keeps statistisc on what proccess do and use it to decide whether migrating them will be usefull, so short lived jobs will never be considered for migration.
As a matter of fact the greedy algorythm that openMOSIX use (developed by Prof. Amnon Barak and his team from the Hebrew university) will pretty much avoid migration at all cost unless it is absoultly positive that migration will be usefull: Most of the time the problem is that proccess you want migrate don't migrate and not the other way around.
It is of course true that openMOSIX has some limitations. For example, proccess using shared memory can't migrate at the moment.You just can't win them all
Re:Not without limitations... (Score:1)
For example, we run a LAM [lam-mpi.org] MPI implementation on our cluster which allows us to carefully arrange which parts of each job go on each CPU and maximize efficiency. What's more, if we miscalculate, and one job on a heavily loaded machine tears off on a long cpu-burst, mosix will step in and migrate it over to a less loaded system for the duration of the burst.
None-the-less, it helps to inform your users not to start up 50 I/O bound jobs on one node and expect them to migrate. You end up having to give users access to multiple nodes to help balance load and this reduces security.
All in all, I've found mosix is very useful if your users know how to code for it. Standard software will typically not benefit too much. That said, if you have a couple of cd-rom drives in your machine, grip performs quite nicely: the ripping takes place wherever the drives are, but the mp3 encoding tends to migrate across the cluster beautifully
openMOSIS (Score:1)
Darn... I thought this said openMOSIS [mosis.com].
I don't think anyone would mind a sourceforge for chip building (especially free nightly builds!)
More on topic and to the point - it is good to see that MOSIX tech is now available opensource (stable anyway). Now we have yet another viable option for speeding up our Beowulfs (MOSIX is generally run with PVM/MPI - not as a replacement).
Migratable sockets/filehandles yet? (Score:3, Informative)
I don't know if this is the case any longer, I heard rumor that all these things were going to be implimented, so it'll be an interesting project to watch.
Good Luck Open Mosix!
-The JungleBoy
Re:Migratable sockets/filehandles yet? (Score:1)
From what I gathered from the OpenMosix Internals [sf.net] paper (which is very informative, BTW), when a process that has been migrated to another machine wants to perform network or file I/O, it communicates over the network to the UHN (Unique Home-Node), where the actual I/O operation will occur. The same goes for machine-specific system calls (gettimeofday() was used as an example).
"One drawback of the deputy approach is the extra overhead in the execution of system calls. Additional overhead is incurred on file and network access operations. For example, all network links (sockets) are created in the UHN, thus imposing communication overhead if the processes migrate away from the UHN." There's probably a more specific quote in the paper.
You seem to be right about processes using shared memory: "For applications using shared memory, such as Web servers or database servers, there will not be any benefit from OpenMosix because all processes accessing said shared memory must resided on the same node."
openMosix? (Score:3, Funny)
Can I make use of this? (Score:1)
If I want to set up apache and FTP access to my network, could I use a Mosix cluster to help distribute the load? Maybe I could use a mosix cluster to speed the rendering of video that I edit on my workstation somehow?
This stuff is just too cool to have absolutely no practical application in my life.
Re:Can I make use of this? (Score:1)
Kernel compiles (Score:1)
Wasn't it already open? (Score:1)
is this a good way to run a desktop environment (Score:2)
if anyone has tried using a cluster for 'end user desktop apps' how does it work out fsater/slower/no diff?
Re:is this a good way to run a desktop environment (Score:2)
The problem is that I'm not sure I'd get any benefit from all that work (learning Mosix or Beowulf, implementing it, etc).
Well, since nobody else seems to be doing it, I guess I'll have to break down and do it one of these days...
Anyone want to donate a gigabit switch?
What apps runs faster/slower/equal (Score:1)
From reading posts, it seems graphic rendering is faster. Darn, I'm interested in Web servers and Database servers
FreeBSD (Score:1)
Mosix for BSD (Score:1)
From the emails I swapped with the fellow in charge of the project (I'm going from memory, and this was towards the end of 1998), they really liked BSD, and all the code was written for the BSD kernel, but had, in the end, decided to rewrite for linux.
I was crushed, as I had just setup a nice small network of BSD machines (for bandwidth and QoS testing), and really wanted to try it... but, I got over it, and decided clustering wasn't going to address any of my issues anyway.
Video processing (Score:1)
Or, render farms needed for those thousands of figures in battles in Lord of The Rings