Goliath: Non-blocking, Ruby 1.9 Web Server

gregwebs · on March 8, 2011

From the FAQ: http://postrank-labs.github.com/goliath/doc/index.html

    How do I deploy Goliath in production?

* We recommend deploying Goliath behind a reverse proxy such as HAProxy, Nginx or equivalent. Using one of the above, you can easily run multiple instances of the same application and load balance between then within the reverse proxy.

I am still wondering about how Goliath fits into both deployment architecture and application development. Traditionally these 2 have always been separated out.

* Thread safety. It is explicitly mentioned that middleware used must be thread safe. Doesn't this hold for all code?

* Can Goliath use multiple cores, or does one instance need to be spun up for each core?

* does it make sense to say, server a Sinatra app from Goliath?

igrigorik · on March 8, 2011

In all cases where we've deployed goliath, its running with a single thread. In theory, nothing stops you from creating a thread to do some background task (or EM.defer), but then you're on your own to make sure you have all the right synchronization logic.

As far as middleware goes, because you are reusing the same "app chain" between multiple requests, you just have to make sure that your middleware does not rely on any instance variables since those will get clobbered by other requests. Check the wiki page on middleware, we have a few examples around this.

Last but not least.. the EM reactor runs on a single core, so you're basically in the same deployment scenario as Thin or node.js. Having said that (always a caveat! ;)), Goliath can run on JRuby and Rubinius.. so in theory we have non-GILed environments there, which means we can start multiple reactors and run across multiple cores from within the same proc. This is not something I've experimented with in practice yet, but in theory its possible.

astrodust · on March 9, 2011

2011 is either going to suck or be awesome depending on how quickly the new Ruby 1.9 features are taken up by Rubinius and JRuby.

Having to choose between multiple threads and a better language is really not a decision you should have to make.

igrigorik · on March 9, 2011

--1.9 mode works really well under JRuby now - that was a big push within 1.6 and they've made a lot of progress. I'd call it "almost there".

Rubinius is a little bit further behind, but I've been able to run our Goliath stack on it, and that exercises quiet a few syntactic changes. So, I think both are close.

getsat · on March 8, 2011

> I am still wondering about how Goliath fits into both deployment architecture

Probably in a manner not entirely dissimilar to Unicorn: https://github.com/blog/517-unicorn

I generally do 2x the number of cores (e.g., eight Unicorn workers on a quadcore box), but it depends on your application. A heavyweight application with 150 models and lots of cpu-intensive tasks will suffer more from context switches than a lightweight one which spends most of its time idle.

mrinterweb · on March 8, 2011

Just wanted to thank Ilya for em-synchrony, em-mysqlplus, em-http-request, goliath, and other gems that help to make developing evented ruby applications much easier.

vamsee · on March 9, 2011

Pardon me if this is a dumb question: how do I hookup Rails (3.x) with it?

mnutt · on March 9, 2011

I'd say that generally, you don't hook it directly into rails. I think it may fill a role similar to node.js, with the added benefits of fibers and the ability to load your rails app's models and libraries.

For instance, I have a 20-line node.js app that does nothing but serve websockets for my rails app. I may replace that with Goliath in the interest of consistency.

yatsyk · on March 8, 2011

Very timely release: I'm creating facade to other http API. Something like the sample in the article. I faced problem with slow responses from other party so I need to move to async request handling. Would you compare Goliath to EM HttpServer or other options?

igrigorik · on March 8, 2011

evma_httpserver is definitely an alternative and one we used early on at PR. Having said that, we migrated away from it because it didn't provide us with the flexibility we needed to do keepalive, pipelining, etc - it basically provides just a very thin layer on top of an EM connection, hence the minimal API and functionality.

Thin would be an alternative to Goliath as well, although we chose to switch to a different parser and also to add the Fiber logic/wrappers right into the framework.

The way I think about it is: if Thin is an app server, then Goliath is more of a minimal framework which you can use to go from start to finish in if you need to bring up an API endpoint. That includes, configuration, routing if you need it, validation, etc.

boundlessdreamz · on March 8, 2011

A noob question: If I have an expensive background job (say pdf generation) is there a way to make it async and use it with Goliath ?

igrigorik · on March 8, 2011

That's a great question actually.. In any evented app, blocking your "reactor" is a big performance problem, since everyone will be waiting for you to complete that operation before anything else can happen. In general, you want to turn any CPU intensive work into an IO-bound operation, where the reactor is "waiting" for the IO notification that the computation is complete.

Now.. How you actually achieve that is a whole different story. You could, in theory, throw a job into some external work queue and poll that, or if your runtime permits, spawn some threadpool and periodically check that, or.. spawn a process and wait on that. In other words, it all depends on the actual operation.

In the case of PDF generation, if you rely on some external tool, you could use a mechanism like EM.system('shell cmd') to spawn a process and wait for it to return you the data.

tptacek · on March 9, 2011

If you're going to be forking off processes to do work for you, I think it's a bad idea to hide the implicit state by using EventMachine to try to manage the pipe I/O and the process state. The reason for that is, sidecar processes screw up, backlog, crash, and eventually start consuming resources you want to track. You end up reinventing the same wheel the Github people did with Resque. Better to reify all those processes from the start with a real queue.

I only make this pedantic comment because EventMachine makes it really easy to start down the path of "just event the process management and I/O", and it seems like you're almost always better off not doing that.

tptacek · on March 9, 2011

This used to be a trickier question, but especially in the ruby world it has a very straightforward answer right now: Resque. Redis is especially amenable to evented clients. So, while there probably are reasonable ways to do an async fork+monitor-a-pipe, the simpler and sounder way to do it in 2011 would be to queue up a job in Resque (or any other "blpop jobs off a Redis list" scheme).

boundlessdreamz · on March 9, 2011

You misunderstood the question. I specifically say "background job". I'm using delayed_job / resque already. But if the results of the job is needed client side immediately then I've to poll the server periodically and fetch the results when done.

What I asked was whether these types of jobs can be done without the server blocking. igrigorik's answered that. The queue might still be needed but the polling can happen from inside the server code rather than from browser (the server will hold a connection from browser, keep polling queue and on success return the response)

tptacek · on March 9, 2011

You're right. It was a noob question. It is a bad idea to hold open an HTTP connection waiting for a long-running job to complete. Good luck with your design, though.

boundlessdreamz · on March 9, 2011

Why would it be a bad idea ? Event servers do not to fall over if there are lot of clients holding http connections open. This is used quite widely. http://en.wikipedia.org/wiki/Push_technology#Long_polling

tptacek · on March 9, 2011

Long-polling, in app designs based from the start on long-polling, is a fine thing (or at least, I'll stipulate that).

Connections that you hold open while forking off and waiting for a process that will take many seconds to complete is just bad UX. There are better ways to do it.

You'll excuse me if after arguing with my response to your one-sentence "noob question: how do I make PDF generation asynchronous from an evented Ruby webserver" I am not chomping at the bit to get into a long architecture debate with you. I meant it: good luck with your design. Sorry I couldn't be more helpful.

kqueue · on March 9, 2011

You can use rpc + message queues for that.

client request1 --> web server --> rpc server --> work queue

client request1 <-- web server <-- rpc server <-- done queue

Of course you need to create consumers to operate on the work queue, process the jobs and put them in a different queue (eg. 'done queue') signaling the rpc server that you finished the job so that the rpc server in turn will reply back to the web server.

otterley · on March 9, 2011

I'm still not convinced that Fiber-related gymnastics are massively superior to callback-related gymnastics, especially if you ever have to debug the magic under the hood (which I inevitably end up doing).

Can someone enlighten me?

gregwebs · on March 17, 2011

With fiber pause resume the onus is put on the IO library writer. Someone has to write a MySQSL library that used threads. The advantage is that this is completely transparent to the library user. A goliath user need not be concerned about events or fibers when using MySQL.

In contrast, a node user must specify a callback every time they make a query. I don't know if the library writer has to do anything special.

sshillo · on March 9, 2011

Know of any async postgres drivers out there?

dj2sincl · on March 9, 2011

There is one built into EventMachine. (https://github.com/eventmachine/eventmachine/blob/master/lib...). Never used it, so I can't say if it's good or not.

molecule · on March 8, 2011

sounds awesome.

also noteworthy: postrank has been running ruby19 fiber'd webserver in production "for well over a year"?

cool!

xal · on March 9, 2011

So has Shopify. Serving terabytes of on-demand resized assets.