can anyone plz post the code of content extraction from an html document?