file_get_contents伪造user_agent 方法如下:
ini_set('user_agent','Mozilla/4.0 (compatibleMSIE 6.0Windows NT 5.1SV1.NET CLR 2.0.50727http://www.baidu.com)')
curl伪造user_agent的方法:
curl_setopt($c, CURLOPT_USERAGENT, 'Mozilla/4.0 (compatibleMSIE 6.0Windows NT 5.1SV1.NET CLR 2.0.50727http://www.baidu.com)')
附带上curl的完整函数方法:
function curl_get_file_contents($URL)
{
$c = curl_init()
curl_setopt($c, CURLOPT_RETURNTRANSFER, 1)
//curl_setopt($c, CURLOPT_HEADER, 1)//输出远程服务器的header信息
curl_setopt($c, CURLOPT_USERAGENT, 'Mozilla/4.0 (compatibleMSIE 6.0Windows NT 5.1SV1.NET CLR 2.0.50727http://www.baidu.com)')
curl_setopt($c, CURLOPT_URL, $URL)
$contents = curl_exec($c)
curl_close($c)
if ($contents) {return $contents}
else {return FALSE}
}
这样就可以抓取了,user_agent是可以自己修改的。
在curl里可以设置UA
<?php//client
$ch = curl_init()
curl_setopt_array($ch,
array(
CURLOPT_URL => 'http://localhost/ua.php',
CURLOPT_USERAGENT => "YeRenChai_v1.0",
CURLOPT_RETURNTRANSFER => True,
CURLOPT_FOLLOWLOCATION => True,
)
)
$response = curl_exec($ch)
if(!$response) exit(curl_error($ch))
var_dump($response)
?> <?php //server
echo $_SERVER['HTTP_USER_AGENT']
?>
欢迎分享,转载请注明来源:内存溢出
评论列表(0条)