Higgs JavaScript Virtual Machine

tinco · on Nov 15, 2014

Crazy that she basically implemented a fully functioning JIT compiler that performs on par with the established commercial Javascript interpreters out there on many fronts in only two years using novel techniques, but reviewers reject her papers because they are unconvinced the novel technique has advantages over existing techniques? How does that even matter?

If someone discovers a new way of implementing compilers efficiently, then just that has scientific merit. Whether the new technique has different performance characteristics surely is simply ground for further research?

Anyway, awesome that at least the engineering community is very interested in this work of hers.

edit: I noticed the compiler is written in D, awesome choice of tech!

srean · on Nov 16, 2014

Unfortunate as it is, peer review is quite broken and it is not quite clear how to fix it. Even if the intended goal is otherwise, in practice, peer review in CS turns into an adversarial exercise, with some differences in levels.

Rather than inspecting a submission and thinking if there is anything in the paper that might be of interest to the community, could the paper spur real progress, the review process usually boils down to: can I quickly skim to find something / anything with which I can shoot down this paper and get away with it and get done with this review. This of course is a generalization, not implying that this is what happened here. Unfortunately, the prototypical reviewer has become more like a prosecutor. There are various reasons why this has become the norm, people are aware of it, some are even trying hard to figure out ways to make it better, whereas some believe this is exactly how it should be.

If accuracy of prediction was a concern, Copernican model would never have been accepted. It took years of polish to make the then new model attain the accuracy of the then prevalent model. Some would argue that such models (the Copernican for example) should remain unpublished and under covers till they beat state of the art. I personally do not agree with this view. I would rather have a reasonably well baked but not yet perfect model out in the ether soon so that the model can enlist / recruit other people to work on it. It really boils down to where in the spectrum should the community operate: (i) spammed by advances of questionable merit, with the community spending effort to winnow the good from the bad or (ii) making sure that only the rock solid ones get through, at the risk of delaying or extinguishing useful advance and waiting for a rediscovery.

seanmcdirmid · on Nov 16, 2014

I'm guessing the paper was submitted to PLDI or POPL (wordpress is blocked here, so I can't read the context), the tier one PL conferences. The communities are not adversarial so much as they are enamored with science. I think from that perspective, rejection is understandable: they want more numbers and comparisons, and might accept the paper even if these numbers are not very enlightening (call it going through the motions). You also don't get many points just for wild experimentation.

For the same reason, many of the papers accepted seem very incremental and disappointing from a "new idea" perspective; and once you get the paper in, you have to go to the conference (I'm not a PLDI/POPL person)! One can always publish in a "new idea" PL conference (like my personal choice, Onward!, and there is even a new one called SNAPL from the people that brought us PLDI and POPL), but the points you get for publishing at those venues aren't as much as the true technical conferences (I don't care, since I have a job, but for a grad student, its a big deal).

My advice would be to suck it up and write a good PLDI paper. Be very thorough and honest with numbers and comparisons, and try to do it "right" (no pointless numbers that don't contribute to the story even if they might "work"). Then with a little bit of luck in PC reviewer selection (most PLDI reviewers are reasonable, some are not), the paper would probably get in. And if they do it right, it could be a really good paper that are lacking these days. Actually, that is another point: sometimes rejection is good, because the paper that gets in later is much much better. Patience.

scott_s · on Nov 17, 2014

This, I think, is a major weakness of the typical conference publication process, which is one-pass reject or accept. Journals have a much more humane process. Take the potential sentence from a reviewer:

"This work has significant novelty, but is lacking a comparison against technique XYZ."

For a Tier 1 conference submission, that's a rejection. For a journal, that's a revision request. Of course, in computer science, conferences are where the action is. But, there are attempts to split the difference. VLDB has moved to a more journal acceptance process (http://www.vldb.org/2015/submission-guidelines.html; rolling deadline at the start of each month through the year, reviewers can request revisions), which myself and some co-authors are currently going through. It is much more reasonable.

seanmcdirmid · on Nov 21, 2014

SIGPLAN is a conference-oriented community where the journals are not very interesting :). They also seem to be playing with more shepherding, but in my experience, once a paper has been tentatively accepted, it is pretty much going to get in unless the authors royally screw it up. There are also papers that are never going to make it in no matter what, and rolling deadlines can provide some false sense of hope there.

mkaufmann · on Nov 15, 2014

Concerning your critique of the in your opinion overly hard scientific review process. According to her blog the paper was rejected because: "Reviewers at conferences [...] have been very skeptical, and pressed us multiple times to produce some kind of comparison of basic block versioning against tracing."[1] I think this is a very valid concern. When you describe a new scientific approach (Basic Block Versioning) you should compare to the state of the art. Otherwise it is very hard for the reader to judge the merit of the new approach. However I agree with your sentiment that a new approach should not only be judged by performance numbers (there is actually a nice article going deeper on this topic on databasearchitects[2]). But there should at least be a more thorough theoretical discussion than only 3-4 sentences. Benchmarks comparing to existing approaches in this case can help to show pathological cases which might indicate weaknesses of the approach or to empirically demonstrate the feasibility of the new approach.

[1] http://pointersgonewild.wordpress.com/2014/11/14/the-fastest...

[2] http://databasearchitects.blogspot.de/2014/09/experiments-hu...

EDIT: Many conferences also allow publishing papers without a deep comparison to exisiting research in the industrial session. This allows demonstrations of interesting implementation variants or system choices

EDIT 2: The review critisim in older blog posts "Conference reviewers criticized us for not discussing compilation times, and raised the issue that perhaps basic block versioning could drastically increase compilation times." is also very valid for a jit compiler. Again discussion does not have to mean that you have to be faster than all existing systems. Paper acceptance is always a little bit a random process, but at least in this case the review comments are valid from my point of view and her phd advisor should probably have detected these problems in proofreading before submitting. I really hope that the paper will finally be accepted!

gsnedders · on Nov 16, 2014

Note she's only comparing it with TraceMonkey (which is now gone from SpiderMonkey) and the initial V8 compiler (i.e., not the optimising Crankshaft compiler). This makes competing much easier. It's about the timespan I'd have expected — Carakan (the second latest in-browser JS VM; Chakra is the latest, but relatively little is known about it) was essentially feature complete and competitive on perf after nine months of development (with 3.5 developers, and an increasing number of QA from 0.5 up to 2). Carakan's big thing was finding and fixing ever more obscure bugs (especially in the compiler), taking almost the same time again.

hnav · on Nov 16, 2014

Could it be that this technique is not as broadly applicable as the existing JIT techniques? Would it be possible to use the approach taken with Higgs to implement a language with a more open ended object system such as ruby? In JavaScript there's an upper bound on what a + b could mean, does BBV work well for more general method dispatch?

tachyonbeam · on Nov 16, 2014

No reason it wouldn't work for ruby. It's about propagating type information. The way specific operators are implemented makes not much difference.

smilepet_26 · on Nov 16, 2014

Some notable features of Higgs include: - A self-hosted runtime written in extended JavaScript - Lazy/incremental JIT compilation Context-driven versioning of basic blocks - A Foreign Function Interface (FFI) system to interface with C code - An interactive shell (REPL) with access to low-level primitives. - A simple module system and a set of useful libraries.

I like these features. This is simply awesome.

noiv · on Nov 16, 2014

There was even a proposal to have inline x86 assembly...

tach4n · on Nov 16, 2014

If you're curious about Higgs you can joins us in our irc: #higgsjs on freenode. We also have a subreddit: r/higgsjs

rurban · on Nov 16, 2014

I'm impressed by the bbv idea, but not about the startup-time overhead which is even worse than perl6 or the jvm. Even if the run-time of big loops is later on faster than v8.

    $ time ./higgs --e "var x = 4; x = x + 5; print(x)"
    9
    real	0m1.191s

e.g. my own little jitted language which compiles also to C like asm with tagged data, but with inner-loop type-checks, does it in

    $ time bin/potion -e'x = 4, x = x + 5, x print'
    9
    real	0m0.005s

so I don't buy the compiler overhead yet. type checks are not that slow.

    $ time ./perl6-p -e'my $x=4; $x=$x+5; print $x'
    9
    real	0m0.818s

    $ time ./perl6-m -e'my $x=4; $x=$x+5; print $x'
    9
    real	0m0.614s

and uncompiled:

    $ time perl -e'my $x=4; $x=$x+5; print $x'
    9
    real	0m0.010s

inglor · on Nov 15, 2014

Other than 'for fun' - does this serve a commercial purpose?

Looks very cool regardless

EvenThisAcronym · on Nov 15, 2014

The Higgs VM is part of Maxime Chevalier's PhD work.You can find her blog here: http://pointersgonewild.wordpress.com/category/higgs/.

grayrest · on Nov 15, 2014

It's a PhD research project. See the project overview page for specifics:

http://pointersgonewild.wordpress.com/higgs/

agumonkey · on Nov 15, 2014

In some air.mozilla.org talk the author said that this research was very unconventional and if successfull could bring generic yet not marginal perf improvements in a lot of languages.

glennsl · on Nov 15, 2014

link: https://air.mozilla.org/higgs-jit/

tach4n · on Nov 16, 2014

It's a research project so it produces experience and, well, research that can be useful to commercial enterprises. Lots of companies have an interest in making dynamic languages faster.

The project does have a secondary goal be a useful hobbyist platform and is making progress on that front.

(disclaimer: I'm a minor contributor.)

jussij · on Nov 15, 2014

> does this serve a commercial purpose?

I'm sure the high paying job offers will be coming in thick and fast.

jonny_eh · on Nov 15, 2014

What is this? All the readme says is what's required to use it.

jermo · on Nov 15, 2014

It's a JavaScript JIT compiler. From the info page [1]:

Some notable features of Higgs include:

- A self-hosted runtime written in extended JavaScript

- Lazy/incremental JIT compilation Context-driven versioning of basic blocks

- A Foreign Function Interface (FFI) system to interface with C code

- An interactive shell (REPL) with access to low-level primitives.

- A simple module system and a set of useful libraries.

Example usage:

  higgs --e "var x = 4; x = x + 5; print(x)"

or

  higgs file.js

[1] http://pointersgonewild.wordpress.com/higgs/

kr0 · on Nov 15, 2014

I believe it's a cross compiler for JS.

tachyonbeam · on Nov 16, 2014

It's a JIT compiler that generates x86-64 machine code

echaozh · on Nov 16, 2014

You either have an AOT compiler or an interpreter with JIT compilation. What's a JIT compiler?

mistercow · on Nov 16, 2014

Here are about 5,000 reasons that nitpick is silly: http://scholar.google.com/scholar?as_vis=1&q=%22jit+compiler...

Clearly the term "JIT compiler" is a common way to describe such a system.

bzbarsky · on Nov 17, 2014

V8 has no interpreter. Just a pair of compilers (first tier and second tier), which compile stuff lazily at the point when it needs to run.

Would you call that an "AOT compiler"? Because most people wouldn't.

protopete · on Nov 16, 2014

AOT: ahead of time

JIT: just in time

Compiler applies to both of them.