guardian.co.uk articles not entirely correct
  • steve June 2011

    http://www.guardian.co.uk/commentisfree/2011/jun/17/women-somalia-hell-worst-world

    "title"=>"The women of Somalia are living in hell | Maryan Qasim | Comment is free"

    expecting: "The women of Somalia are living in hell"

    "author" => nil

    expecting: "Maryan Qasim"

    "text"=>"A six-year-old girl undergoes female genital mutilation in Somalia – which 95% of girls aged 4 to 11 face there. Photograph: Jean-Marc Bouju/AP\nI recently learned of a poll showing the worst places in the world to be a woman. To my surprise, Somalia was ranked 5th. For me, the situation of women in Somalia stands as the worst in the world.\nMogadishu is a living hell for women struggling to feed their children amid war, drought, famine and utter devastation.

    note: the text of the article continues to the end, but I didn't want to paste it all in here.

    expecting: "For me, the situation of women in Somalia stands as the worst in the world.\nMogadishu is a living hell for women struggling to feed their children amid war, drought, famine and utter devastation."

    If you look at the article itself, diffbot grabs the caption text from the image included with the article and incorporates it into the article text. ideally, diffbot would exclude image caption text.

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Sign In Apply for Membership