English 中文(简体)
Algorithm to parse a config file in php (Doxygen file)
原标题:

I have a conf file like this: http://pastie.org/768582 and my goal is to get in an array the comments and the key/value of each keys.

array( array(

     comment  => "The PROJECT_NAME tag is a single",

     key  => "PROJECT_NAME",

     value  => "JMK",
),

)

I would know what algoritm do I have to use?

I have already transform the content of the configuration file to an array (line by line) with explode() function.

Now I am trying to get all the comment lines while next line begins with # and the couple key/value but it is here that I have trouble.

If someone have an idea it would be nice. Thx.

问题回答

This will get you the key/value pairs, but not the comments:

$options = array();

foreach ($line as $l)
{
  $l = trim($l);
  if (strlen($l) && substr($l, 0, 1) !=  # )
  {
    list($key, $value) = explode("=", $l);

    // remove whitespace from the end of the config key
    $key = rtrim($key);

    $options[$key] = $value;
  }
}

here s one way

$content = file_get_contents("file");
$s = preg_split("/#--*/",$content);
$y = preg_split("/

/",end($s));
for($i=0;$i<count($y)-1;$i++){
    if ($y[$i]){
        if (strpos($y[$i],"#")!==FALSE){
            $comment="$y[$i]
";
            $conf=$y[$i+1];
            $cs = array_map(trim,explode("=",$conf));
            $A["comment"]=$comment;
            $A["key"]=$cs[0];
            $A["value"]=$cs[1];
            $TA[]=$A;
        }
    }
}
print_r($TA);

output

Array
(
    [0] => Array
        (
            [comment] => # This tag specifies the encoding used for all characters in the config file
# that follow. The default is UTF-8 which is also the encoding used for all
# text before the first occurrence of this tag. Doxygen uses libiconv (or the
# iconv built into libc) for the transcoding. See
# http://www.gnu.org/software/libiconv for the list of possible encodings.

            [key] => DOXYFILE_ENCODING
            [value] => UTF-8
        )

    [1] => Array
        (
            [comment] => # The PROJECT_NAME tag is a single word (or a sequence of words surrounded
# by quotes) that should identify the project.

            [key] => PROJECT_NAME
            [value] => JMK
        )

    [2] => Array
        (
            [comment] => # The PROJECT_NUMBER tag can be used to enter a project or revision number.
# This could be handy for archiving the generated documentation or
# if some version control system is used.

            [key] => PROJECT_NUMBER
            [value] => 10
        )

)

You could try parse_ini_file(), the format looks compatible. It won t process the comments, though.





相关问题
How to add/merge several Big O s into one

If I have an algorithm which is comprised of (let s say) three sub-algorithms, all with different O() characteristics, e.g.: algorithm A: O(n) algorithm B: O(log(n)) algorithm C: O(n log(n)) How do ...

Grokking Timsort

There s a (relatively) new sort on the block called Timsort. It s been used as Python s list.sort, and is now going to be the new Array.sort in Java 7. There s some documentation and a tiny Wikipedia ...

Manually implementing high performance algorithms in .NET

As a learning experience I recently tried implementing Quicksort with 3 way partitioning in C#. Apart from needing to add an extra range check on the left/right variables before the recursive call, ...

Print possible strings created from a Number

Given a 10 digit Telephone Number, we have to print all possible strings created from that. The mapping of the numbers is the one as exactly on a phone s keypad. i.e. for 1,0-> No Letter for 2->...

Enumerating All Minimal Directed Cycles Of A Directed Graph

I have a directed graph and my problem is to enumerate all the minimal (cycles that cannot be constructed as the union of other cycles) directed cycles of this graph. This is different from what the ...

Quick padding of a string in Delphi

I was trying to speed up a certain routine in an application, and my profiler, AQTime, identified one method in particular as a bottleneck. The method has been with us for years, and is part of a "...

热门标签