Perché Facebook usa il C ++ accanto a PHP? [chiuso]

15

Qual è la ragione principale per cui Facebook ha dovuto usare C ++ accanto a PHP? Mi chiedo se faccio un sito web con un sacco di visitatori dovrei usare anche il C ++?

    
posta Goma 05.03.2011 - 19:12
fonte

4 risposte

20

Ecco perché: HipHop per PHP: Move Fast :

One of the key values at Facebook is to move fast. For the past six years, we have been able to accomplish a lot thanks to rapid pace of development that PHP offers. As a programming language, PHP is simple. Simple to learn, simple to write, simple to read, and simple to debug. We are able to get new engineers ramped up at Facebook a lot faster with PHP than with other languages, which allows us to innovate faster.

Today I'm excited to share the project a small team of amazing people and I have been working on for the past two years; HipHop for PHP. With HipHop we've reduced the CPU usage on our Web servers on average by about fifty percent, depending on the page. Less CPU means fewer servers, which means less overhead. This project has had a tremendous impact on Facebook. We feel the Web at large can benefit from HipHop, so we are releasing it as open source this evening in hope that it brings a new focus toward scaling large complex websites with PHP. While HipHop has shown us incredible results, it's certainly not complete and you should be comfortable with beta software before trying it out.

HipHop for PHP isn't technically a compiler itself. Rather it is a source code transformer. HipHop programmatically transforms your PHP source code into highly optimized C++ and then uses g++ to compile it. HipHop executes the source code in a semantically equivalent manner and sacrifices some rarely used features — such as eval() — in exchange for improved performance. HipHop includes a code transformer, a reimplementation of PHP's runtime system, and a rewrite of many common PHP Extensions to take advantage of these performance optimizations.

Scaling PHP as a Scripting Language

PHP's roots are those of a scripting language, like Perl, Python, and Ruby, all of which have major benefits in terms of programmer productivity and the ability to iterate quickly on products. This is compared to more traditional compiled languages like C++ and interpreted languages like Java. On the other hand, scripting languages are known to generally be less efficient when it comes to CPU and memory usage. Because of this, it's been challenging to scale Facebook to over 400 billion PHP-based page views every month.

One common way to address these inefficiencies is to rewrite the more complex parts of your PHP application directly in C++ as PHP Extensions. This largely transforms PHP into a glue language between your front end HTML and application logic in C++. From a technical perspective this works well, but drastically reduces the number of engineers who are able to work on your entire application. Learning C++ is only the first step to writing PHP Extensions, the second is understanding the Zend APIs. Given that our engineering team is relatively small — there are over one million users to every engineer — we can't afford to make parts of our codebase less accessible than others.

Scaling Facebook is particularly challenging because almost every page view is a logged-in user with a customized experience. When you view your home page we need to look up all of your friends, query their most relevant updates (from a custom service we've built called Multifeed), filter the results based on your privacy settings, then fill out the stories with comments, photos, likes, and all the rich data that people love about Facebook. All of this in just under a second. HipHop allows us to write the logic that does the final page assembly in PHP and iterate it quickly while relying on custom back-end services in C++, Erlang, Java, or Python to service the News Feed, search, Chat, and other core parts of the site.

Since 2007 we've thought about a few different ways to solve these problems and have even tried implementing a few of them. The common suggestion is to just rewrite Facebook in another language, but given the complexity and speed of development of the site this would take some time to accomplish. We've rewritten aspects of the Zend Engine — PHP's internals — and contributed those patches back into the PHP project, but ultimately haven't seen the sort of performance increases that are needed. HipHop's benefits are nearly transparent to our development speed.

Hacking Up HipHop

One night at a Hackathon a few years ago (see Prime Time Hack), I started my first piece of code transforming PHP into C++. The languages are fairly similar syntactically and C++ drastically outperforms PHP when it comes to both CPU and memory usage. Even PHP itself is written in C. We knew that it was impossible to successfully rewrite an entire codebase of this size by hand, but wondered what would happen if we built a system to do it programmatically.

Finding new ways to improve PHP performance isn't a new concept. At run time the Zend Engine turns your PHP source into opcodes which are then run through the Zend Virtual Machine. Open source projects such as APC and eAccelerator cache this output and are used by the majority of PHP powered websites. There's also Zend Server, a commercial product which makes PHP faster via opcode optimization and caching. Instead, we were thinking about transforming PHP source directly into C++ which can then be turned into native machine code. Even compiling PHP isn't a new idea, open source projects like Roadsend and phc compile PHP to C, Quercus compiles PHP to Java, and Phalanger compiles PHP to .Net.

Needless to say, it took longer than that single Hackathon. Eight months later, I had enough code to demonstrate it is indeed possible to run faster with compiled code. We quickly added Iain Proctor and Minghui Yang to the team to speed up the pace of the project. We spent the next ten months finishing up all the coding and the following six months testing on production servers. We are proud to say that at this point, we are serving over 90% of our Web traffic using HipHop, all only six months after deployment.

How HipHop Works

The main challenge of the project was bridging the gap between PHP and C++. PHP is a scripting language with dynamic, weak typing. C++ is a compiled language with static typing. While PHP allows you to write magical dynamic features, most PHP is relatively straightforward. It's more likely that you see if (...) {...} else {..} than it is to see function foo($x) { include $x; }. This is where we gain in performance. Whenever possible our generated code uses static binding for functions and variables. We also use type inference to pick the most specific type possible for our variables and thus save memory.

The transformation process includes three main steps:

  1. Static analysis where we collect information on who declares what and dependencies,
  2. Type inference where we choose the most specific type between C++ scalars, String, Array, classes, Object, and Variant, and
  3. Code generation which for the most part is a direct correspondence from PHP statements and expressions to C++ statements and expressions.

We have also developed HPHPi, which is an experimental interpreter designed for development. When using HPHPi you don't need to compile your PHP source code before running it. It's helped us catch bugs in HipHop itself and provides engineers a way to use HipHop without changing how they write PHP.

Overall HipHop allows us to keep the best aspects of PHP while taking advantage of the performance benefits of C++. In total, we have written over 300,000 lines of code and more than 5,000 unit tests.

    
risposta data 05.03.2011 - 19:31
fonte
13

Risposta breve - no, non bisogno di C ++.

Credo che tu ti stia riferendo a HipHop . Se leggi i miglioramenti che dà, è circa il 50%. Questo è molto importante se sei Facebook che ha migliaia di server frontend. Ridurne il carico significa milioni di risparmi sui costi dei data center. Nel caso di una società con server di frontend 10-100, il costo dello sviluppo supererebbe di gran lunga i possibili risparmi. In ogni caso, l'elaborazione più pesante viene eseguita nel back-end, che di solito è una soluzione RDBMS o NoSQL, sviluppata come codice C / C ++ altamente ottimizzato.

Ovviamente si potrebbero ottenere risparmi molto più alti scaricando completamente PHP. Ma non è qualcosa che puoi fare con una base di codice enorme, come nel caso di Facebook.

Ora, per l'altra parte della domanda: se vuoi davvero sapere come gestire enormi carichi, leggi il blog ad alta scalabilità , in particolare la parte di Real Life Architectures.

È possibile con PHP, ma sicuramente non sarebbe la mia scelta. Se vuoi un linguaggio dinamico, Python, Ruby o forse Lua sarebbe una scelta molto migliore.

    
risposta data 05.03.2011 - 19:36
fonte
5

La cosa bella dell'approccio di FB è che non hanno dovuto decidere subito. Dovresti fare lo stesso. Scegli la lingua che ti rende più produttivo, ma assicurati che si interfaccia facilmente con C / C ++.

Una volta che hai milioni di utenti e hai bisogno di ridurre i millisecondi dei tempi di risposta, puoi ottimizzare i percorsi critici usando C / C ++.

    
risposta data 06.03.2011 - 06:40
fonte
0

L'elemento C ++ utilizzato da Facebook è l'HHVM.

Questa è la macchina virtuale HIPHOP. Sebbene scritto in C ++ è essenzialmente un "interprete PHP migliore".

Il codice php viene compilato in un codice byte che viene quindi interpretato dall'HHVM al momento dell'esecuzione ed è soggetto alle ottimizzazioni "Just In Time" durante l'esecuzione.

Gli sviluppatori di face book scrivono ancora per lo più in PHP, e quello che vedi sul tuo browser è quasi certamente prodotto da un programma PHP, il tutto sul quale è stato compilato e interpretato dal runtime di HHVM.

L'attuale HHVM è liberamente disponibile open source. Quindi se hai bisogno di servire un milione di foto di gattini per ora, puoi scaricare HHVM e velocizzare il tuo server con una modifica minima al tuo PHP.

    
risposta data 22.04.2014 - 11:43
fonte

Leggi altre domande sui tag