Loading...

PHP網路爬蟲之抓取Open Graph(OG) Meta tags

Temperature: 0 °C

ChungChung
用google爬了很多文章....

最後終於找到一篇內容.....

只是測試了半天...感覺奇怪為何內網抓的到..卻無法抓到外網的資料

搞了半天..結果是補夢網虛擬主機不能爬外網的資料.....

唉..國外的空間雖然溝通比較麻煩...但限制沒那麼多...

可以想玩什麼功能就可以測試...也便宜多了........雖然國內方便...可是服務還是有差......

把範例紀錄下來

首先是php code

function get_tags($url) {

$html = file_get_contents($url);

@libxml_use_internal_errors(true);
$dom = new DomDocument();
$dom->loadHTML($html);
$xpath = new DOMXPath($dom);
$query = '//*/meta[starts-with(@property, \'og:\')]';
$result = $xpath->query($query);

foreach ($result as $meta) {
$property = $meta->getAttribute('property');
$content = $meta->getAttribute('content');

// replace og
$property = str_replace('og:', '', $property);
$list[$property] = $content;
}
return $list;
}

接著是使用方式

$url = 'http://www.example.com/';
$meta = get_tags($url);

echo $meta['locale'];
echo $meta['type'];
echo $meta['title'];
echo $meta['description'];
echo $meta['url'];
echo $meta['site_name'];
echo $meta['image'];


原文連結 http://9bugs.in/get-facebook-open-graphog-meta-tags-php-234
https://innstory.com/story-PHP網路爬蟲之抓取Open_GraphOG_Meta_tags-195

Prev
 PHP網路爬蟲之抓取meta_value

Next
Google_Nexus_6 

About the Author

Chung

我是chung
網路工作者
主業是網站系統開發建置
副業是做夢,寫故事
作品請參考/teme.biz
做夢請參考/innstory.com
聯絡/chung.teme@gmail.com

#有人用筆寫日記,有人用歲月寫日記,有人用照片寫日記,而我,用innstory寫日記。

Visitor message

Leave some footprints to prove that you visited me

Recommended reading

Author's other related stories

Mysql 查詢時間區間是否包含特定日期

Mysql 查詢時間區間是否包含特定日期

這其實只是一個簡單的問題,以前並不常用... 但這陣子寫的案子比較偏系統面,這樣的查詢...

地圖故事

地圖故事

map 一切都從第一張畫布開始... Innstory裡原先已有寫故事的功能,接著我把...

ajax防止重複提交

ajax防止重複提交

會發生重提交的原因,大致上是因為考驗人與系統間的耐性~ AJAX(非同步的...

Please select an option

error

Hi, thank you for your participation, but you cannot vote repeatedly~

Join innstory now and start recording your story.

"Innstory" is a place to store stories. We are committed to becoming a warm platform. Deepening the bonds between people is our direction.
We are convinced that the blockchain between people is not just a cold calculation. Join us now.

Wrong format