Neidio i’r brif dudalen lywio Neidio i chwilio Neidio i’r prif gynnwys

Visual Segmentation-Based Data Record Extraction From Web Documents

  • M. Weatherston
  • , A. Obregon
  • , Longzhuang Li
  • , Yonghuai Liu

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddTrafodion Cynhadledd (ISBN)

29 Dyfyniadau (Scopus)

Crynodeb

Semi-structured data records contained in the Web pages provide useful information for shopping agents and metasearch engines. In this paper, we present a visual segmentation-based data record extraction (VSDR) method to extract data records from those Web pages. VSDR method first segments a Web page into semantic blocks using the spatial closeness and visual resemblance of data records, then neighboring and non-neighboring data records are extracted based on a compress and collapse technique. Experimental results slum that unlike the existing methods which only generate good results on their test domains, VSDR is a general data record extraction method that is able to produce quite stable and good results on a wide range of Web pages.
Iaith wreiddiolSaesneg
TeitlInternational Conference on Information Reuse and Itegration
Tudalennau502-507
Nifer y tudalennau6
ISBN (Electronig)1-4244-1500-4
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 13 Awst 2007
DigwyddiadInternational Conference on Information Reuse and Itegration - Las Vegas, Teyrnas Unedig Prydain Fawr a Gogledd Iwerddon
Hyd: 13 Awst 200715 Awst 2007

Cynhadledd

CynhadleddInternational Conference on Information Reuse and Itegration
Gwlad/TiriogaethTeyrnas Unedig Prydain Fawr a Gogledd Iwerddon
DinasLas Vegas
Cyfnod13 Awst 200715 Awst 2007

Ôl bys

Gweld gwybodaeth am bynciau ymchwil 'Visual Segmentation-Based Data Record Extraction From Web Documents'. Gyda’i gilydd, maen nhw’n ffurfio ôl bys unigryw.

Dyfynnu hyn