我将网页的源头复制为文本文件,我很难从文件中找到两个数据点:纬度和经度。
我要做的php文件 和扫描文件是这样的:
<?php
$ch = curl_init("http://www.marinetraffic.com/ais/shipdetails.aspx?MMSI=258245000");
$fp = fopen("example_homepage.txt", "w");
curl_setopt($ch, CURLOPT_FILE, $fp);
curl_setopt($ch, CURLOPT_HEADER, 0);
curl_exec($ch);
curl_close($ch);
fclose($fp);
header( Content-Type: text/plain );
$myFile = "example_homepage.txt";
$fh = fopen($myFile, r );
$theData = fread($fh, 9251);
fclose($fh);
echo $theData;
?>
gps 被埋在类似文字的文字中( 从文件示例_ homepage. txt 中) :
<img style="border: 1px solid #aaa" src="flags/NO.gif" />
<br/>
<b>Call Sign:</b>LAJW
<br/>
<b>IMO:</b>9386380,
<b>MMSI:</b>258245000
<br/>
<hr/>
<h2>Last Position Received</h2>
<b>Area:</b>North Sea
<br/>
<b>Latitude / Longitude:</b>
<a href= default.aspx?mmsi=258245000¢erx=5.311533¢ery=60.39997&zoom=10&type_color=9 >60.39997˚ / 5.311533˚ (Map)</a>
<br/>
<b>Currently in Port:</b>
<a href= default.aspx?centerx=5.32245¢ery=60.39085&zoom=14 >BERGEN</a>
<br/>
<b>Last Known Port:</b>
</b>
<a href= default.aspx?centerx=5.32245¢ery=60.39085&zoom=14 >BERGEN</a>
<br/>
<b>Info Received:</b>0d 0h 20min ago
<br/>
<table>
<tr>
<td>
<img src="shipicons/magenta0.png" />
</td>
<td>
<a href= default.aspx?mmsi=258245000¢erx=5.311533¢ery=60.39997&zoom=10&type_color=9 ><b>Current Vessel s Track</b></a>
</td>
</tr>
<tr>
<td>
<img src="windicons/w05_330.png" />
</td>
<td>
<b>Wind:</b>5 knots, 327°, 13°C</td>
</tr>
</table>
<a href= datasheet.aspx?datasource=ITINERARIES&MMSI=258245000 ><b>Itineraries History</b></a>
<br/>
<hr/>
<h2>Voyage Related Info (Last Received)</h2>
<b>Draught:</b>6.8 m
<br/>
<b>Destination:</b>BERGEN HAVN
<br/>
<b>ETA:</b>2012-05-22 18:00
<br/>
<b>Info Received:</b>2012-05-23 18:43 (
我要的两个数字是:
latitude: 60.39085 longitude: 5.32245
我对这种事情没有经验,也许有更好的办法,请告诉我。
使用最后三行代码后, 我就能在文本文件中找到第9251个字符。