ansaurus

Question

Why is require_once so bad to use?

Answer 1

+3 A:

A better way to do things is to use an object-oriented approach and use __autoload().

Greg 2008-10-09 08:05:28

but the very first example in the autoloading objects page you linked to uses require_once

Shabbyrobe 2008-10-09 08:13:24

I think require would work there too (not tested though)

Greg 2008-10-09 08:24:25

I don't buy this. There are MANY situations in which OO does not fit as appropriately as other paradigms, so you shouldn't force it just to gain whatever tiny advantages there may be with __autoload().

Bobby Jack 2008-10-09 08:40:41

you'd think that autoload would actually take longer than *_once (assuming that you're only require'ing what you need).

nickf 2008-10-10 00:19:12

Answer 2

+17 A:

Require_once and include_once require that the system keeps a log of what's already been included/required and each time there's another include/require-_once statement, they have to check that log.

On a computational basis, I can see there's extra work and resources going into doing that, but enough to detriment the speed of the whole app? I really doubt it... Not unless you're on really old hardware.

If you're really concerned about it, the alternative is doing the work yourself in as lightweight a manner as you can get away with. For simple apps, just making sure you've only included it once should suffice but if you're still getting redefine errors, you could something like this:

if (!defined('MyIncludeName')) {
    require('MyIncludeName');
    define('MyIncludeName', 1);
}

It's not great and it'll junk up your code but it's light. Personally, I'll stick with the ..._once statements.

Oli 2008-10-09 08:20:31

I doubt your defined() method is any faster than the built-in lookup table, but I agree with your overall point - surely a non-issue?!

Bobby Jack 2008-10-09 08:42:18

I'm fairly sure you're right Bobby but I'm not advocating the defines over _once. It's just an option. The time it would take to interpret the code might even make it marginally slower but, that said, I don't know how thorough the internal method is. It might do extra work to ensure no duplicates.

Oli 2008-10-09 09:37:38

The other downside is APC doesn't cache include_once and require_once calls IIRC

dcousineau 2008-10-10 07:20:33

Answer 3

+11 A:

Can you give us any links to these coding practices which say to avoid it? As far as I'm concerned, it's a complete non-issue. I haven't looked at the source code myself, but I'd imagine that the only difference between include and include_once is that include_once adds that filename to an array and checks over the array each time. It'd be easy to keep that array sorted, so searching over it should be O(log n), and even a medium-largish application would only have a couple of dozen includes.

nickf 2008-10-09 08:27:36

one is, http://www.chazzuka.com/blog/?p=163they really didnt 'not to', but too many 'expensive' things add up.and actually, all included/required files are added to an internal array (theres a function to return it), i guess the _once's have to loop that array and do strcmp's, which would add up

Uberfuzzy 2008-10-09 08:46:40

Answer 4

A:

I think in PEAR documentation, there is a recommendation for require, require_once, include and include_once. I do follow that guideline. Your application would be more clear.

Ekkmanz 2008-10-09 08:42:48

Answer 5

A:

You test, using include, oli's alternative and __autoload(); and test it with something like APC installed. I doubt using constant will speeding things up.

Dinoboff 2008-10-09 10:04:49

Answer 6

A:

My personal opinion is that the usage of require_once (or include_once) is bad practice because require_once checks for you if you already included that file and suppress errors of double included files resulting in fatal errors (like duplicate declaration of functions/classes/etc.).

You should know if you need to include a file.

Joe Scylla 2008-10-09 10:18:33

Answer 7

+2 A:

The *_once() functions stat every parent directory to ensure the file you're including isn't the same as one that's already been included. That's part of the reason for the slowdown.

I recommend using a tool like Siege for benchmarking. You can try all the suggested methodologies and compare response times.

More on require_once() at Tech Your Universe.

Adam Backstrom 2008-10-09 12:35:01

Thanks for the pointer to the article. require_once() is a good safety belt over double-including files, and we'll continue to use it, but being able to make it clean is nice.

Andy Lester 2008-10-12 01:48:39

Answer 8

A:

Yes, it is slightly more expensive than plain ol' require(). I think the point is if you can keep your code organized enough to not douplicate includes, don't use the *_once() functions, as it will save you some cycles.

But using the _once() functions isn't going to kill your app. Basically, just don't use it as an excuse to not have to organize your includes. In some cases, using it is still unavoidable, and it's not a big deal.

Lucas Oman 2008-10-09 12:45:59

Answer 9

+1 A:

The PEAR2 wiki lists good reasons for abandoning all the require/include directives in favor of autoloading, at least for library code. These tie you down to rigid directory structures when alternative packaging models like phar are on the horizon.

mrclay 2008-10-11 04:55:35

Answer 10

+8 A:

I got curious and checked out Adam Backstrom's link to Tech Your Universe. This article describes one of the reasons that require should be used instead of require_once. However, their claims didn't hold up to my analysis. I'd be interested in seeing where I may have misanalysed the solution. I used php 5.2.0 for comparisons.

I started out by creating 100 header files that used require_once to include another header file. Each of these files looked something like:

<?php
// /home/fbarnes/phpperf/hdr0.php
require_once "../phpperf/common_hdr.php";

?>

I created these using a quick bash hack:

for i in /home/fbarnes/phpperf/hdr{00..99}.php; do
  echo "<?php
// $i" > $i
  cat helper.php >> $i;
done

This way I could easily swap between using require_once and require when including the header files. I then created an app.php to load the one hundred files. This looked like:

<?php

// Load all of the php hdrs that were created previously
for($i=0; $i < 100; $i++)
{
  require_once "/home/fbarnes/phpperf/hdr$i.php";
}

// Read the /proc file system to get some simple stats
$pid = getmypid();
$fp = fopen("/proc/$pid/stat", "r");
$line = fread($fp, 2048);
$array = split(" ", $line);

// write out the statistics; on RedHat 4.5 w/ kernel 2.6.9
// 14 is user jiffies; 15 is system jiffies
$cntr = 0;
foreach($array as $elem)
{
  $cntr++;
  echo "stat[$cntr]: $elem\n";
}
fclose($fp);

?>

I contrasted the require_once headers with require headers that used a header file looking like:

<?php
// /home/fbarnes/phpperf/h/hdr0.php
if(!defined('CommonHdr'))
{
  require "../phpperf/common_hdr.php";
  define('CommonHdr', 1);
}

?>

I didn't find much difference when running this with require vs. require_once. In fact my initial tests seemed to imply that require_once was slightly faster, but I don't necessarily believe that. I repeated the experiment with 10000 input files. Here I did see a consistent difference. I ran the test multiple times, the results are close but using require_once uses on average 30.8 user jiffies and 72.6 system jiffies; using require uses on average 39.4 user jiffies and 72.0 system jiffies. Therefore, it appears that the load is slightly lower using require_once. However, wall clock is slightly increased. The 10,000 require_once calls use 10.15 seconds to complete on average and 10,000 require calls use 9.84 seconds on average.

Next step is to look into these differences. I used strace to analyse the system calls that are being made.

Before opening a file from require_once the following system calls are made:

time(NULL)                              = 1223772434
lstat64("/home", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes/phpperf", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes/phpperf/h", {st_mode=S_IFDIR|0755, st_size=270336, ...}) = 0
lstat64("/home/fbarnes/phpperf/h/hdr0.php", {st_mode=S_IFREG|0644, st_size=88, ...}) = 0
time(NULL)                              = 1223772434
open("/home/fbarnes/phpperf/h/hdr0.php", O_RDONLY) = 3

This contrasts with require:

time(NULL)                              = 1223772905
lstat64("/home", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes/phpperf", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0
lstat64("/home/fbarnes/phpperf/h", {st_mode=S_IFDIR|0755, st_size=270336, ...}) = 0
lstat64("/home/fbarnes/phpperf/h/hdr0.php", {st_mode=S_IFREG|0644, st_size=146, ...}) = 0
time(NULL)                              = 1223772905
open("/home/fbarnes/phpperf/h/hdr0.php", O_RDONLY) = 3

Tech Your Universe implies that require_once should make more lstat64 calls. However, they both make the same number of lstat64 calls. Possibly, the difference is that I am not running APC to optimize the code above. However, the next thing I did was compare the output of strace for the entire runs:

[fbarnes@myhost phpperf]$ wc -l strace_1000r.out strace_1000ro.out 
  190709 strace_1000r.out
  210707 strace_1000ro.out
  401416 total

Effectively there are approximately two more system calls per header file when using require_once. One difference is that require_once has an additional call to the time() function:

[fbarnes@myhost phpperf]$ grep -c time strace_1000r.out strace_1000ro.out 
strace_1000r.out:20009
strace_1000ro.out:30008

The other system call is getcwd():

[fbarnes@myhost phpperf]$ grep -c getcwd strace_1000r.out strace_1000ro.out 
strace_1000r.out:5
strace_1000ro.out:10004

This is called because I decided to relative path referenced in the hdrXXX files. If I make this an absolute reference, then the only difference is the additional time(NULL) call made in the code:

[fbarnes@myhost phpperf]$ wc -l strace_1000r.out strace_1000ro.out 
  190705 strace_1000r.out
  200705 strace_1000ro.out
  391410 total
[fbarnes@myhost phpperf]$ grep -c time strace_1000r.out strace_1000ro.out
strace_1000r.out:20008
strace_1000ro.out:30008

This seems to imply that you could reduce the number of system calls by using absolute paths rather than relative paths. The only difference outside of that is the time(NULL) calls which appear to be used for instrumenting the code to compare what is faster.

One other note is that the APC optimization package has an option called "apc.include_once_override" that claims that it reduces the number of system calls made by the require_once and include_once calls (see the PHP docs).

Sorry for the long post. I got curious.

terson 2008-10-12 01:43:19

That's very interesting - thank you, terson!

HoboBen 2008-10-12 14:27:01

And any "optimization" that you have to run 10,000 times to see a such a minuscule difference is not even worthy of worrying about. Use a profiler and find out where the *real* bottlenecks are in your application. I doubt this question is the bottleneck.

DGM 2009-03-27 23:53:03

Answer 11

+24 A:

This thread makes me cringe, because there's already been a "solution posted", and it's, for all intents and purposes, wrong. Let's enumerate:

Defines are really expensive in PHP. You can look it up or test it yourself, but the only efficient way of defining a global constant in PHP is via an extension. (Class constants are actually pretty decent performance wise, but this is a moot point, because of 2)
If you are using require_once() appropriately, that is, for inclusion of classes, you don't even need a define; just check if class_exists('Classname'). If the file you are including contains code, i.e. you're using it in the procedural fashion, there is absolutely no reason that require_once() should be necessary for you; each time you include the file you presume to be making a subroutine call.

So for a while, a lot of people did use the class_exists() method for their inclusions. I don't like it because it's fugly, but they had good reason to: require_once() was pretty inefficient before some of the more recent versions of PHP. But that's been fixed, and it is my contention that the extra bytecode you'd have to compile for the conditional, and the extra method call, would by far overweigh any internal hashtable check.

Now, an admission: this stuff is tough to test for, because it accounts for so little of the execution time.

Here is the question you should be thinking about: includes, as a general rule, are expensive in PHP, because every time the interpreter hits one it has to switch back into parse mode, generate the opcodes, and then jump back. If you have a 100+ includes, this will definitely have a performance impact. The reason why using or not using require_once is such an important question is because it makes life difficult for opcode caches. An explanation for this can be found here, but what this boils down to is that:

If during parse time, you know exactly what include files you will need for the entire life of the request, require() those at the very beginning and the opcode cache will handle everything else for you.
If you are not running an opcode cache, you're in a hard place. Inlining all of your includes into one file (don't do this during development, only in production) can certainly help parse time, but it's a pain to do, and also, you need to know exactly what you'll be including during the request.
Autoload is very convenient, but slow, for the reason that the autoload logic has to be run every time an include is done. In practice, I've found that autoloading several specialized files for one request does not cause too much of a problem, but you should not be autoloading all of the files you will need.
If you have maybe 10 includes (this is a very back of the envelope calculation), all this wanking is not worth it: just optimize your database queries or something.

Edward Z. Yang 2008-10-12 02:05:57

Answer 12

A:

It has nothing to do with speed. It's about failing gracefully.

If require_once() fails, your script is done. Nothing else is processed. If you use include_once() the rest of your script will try to continue to render, so your users potentially would be none-the-wiser to something that has failed in your script.

ashchristopher 2008-10-12 02:11:59

Not necessarily. You can actually hook in an error handler or on shutdown handler to give the user a nice error page (although people rarely do). As a developer, I would much rather things error immediately.

Edward Z. Yang 2008-10-12 02:13:19

Or, as the case may be, *not* failing gracefully - if some vital file isn't require()'d properly, it's a good idea just to give up and halt. But that's require vs include, whereas I think the question is more focused on require vs require_once.

HoboBen 2008-10-12 14:36:05

ansaurus

tags:

views:

answers:

Why is require_once so bad to use?

related questions