Page Areas:



Current Submenu:

Additional Information:

FAW @ Lange Nacht der Forschung 2016

Lange Nacht der Forschung 2016

Campus map

Campusplan JKU Linz

Our location on campus ...  more of Campus map (Titel)

Softwarepark Hagenberg

Hier den Alternativtext zum Bild eingeben!

Our location on Softwarepark Hagenberg ...  more of Softwarepark Hagenberg (Titel)


Position Indication:

Content

Web Information Retrieval and Extraction

(Web) information extraction (IE) is commonly defined as extracting structured data out of unstructured data as it appears on Web pages. Current research issues @ FAW comprise: Ontology-based IE, structural IE including table IE, Web page classification, and IE quality assessment.

People

A.Univ.-Prof. Dr. Birgit Pröll
Dipl.-Ing. Christina Feilmayr
Dipl.-Ing. Christina Buttinger
Claudia Vojinovic
Martin Scharrer
Thomas Katzmaier
Manfred Astl
Severin Linecker

Associate People

Dipl.-Ing. Stefan Parzer
Dipl.-Ing. Michael Guttenbrunner

Projects

  • HybridIE: Development of an IE methodology aiming at the specification and evaluation of application criteria for IE-processes, -methods and tools. Funded as FFG BRIDGE program (August 2010 -)
  •  
  • OntoJob: Within the OntoJob project, FAW realises Ontology Population in the eRecruitment domain. Funded as FFG COIN program (starting September 2010)
  •  
  • Marlies: Marlies aims at the recognition of entities and their relations in the manufacturing domain. Cooperation with Tech2select GmbH, Linz; Funded as FFG BASIS program (April 2008 - ongoing)
  •  
  • TourIE: TourIE focuses on the extraction of tourism information from heterogeneously designed accommodation Web sites. Extracted information includes accommodation’s name, available facilities, room price, location, accommodation category, and images. The developed prototype uses a knowledge engineering approach and i based on a tourism domain ontology. (June 2007 - June 2008)
  •  
  • Jobolize: Ontology Based Web Information Extraction for eRecruitement. Cooperation with JoinVision GmbH, Vienna; Funded as FFG BASIS program (Mar-Sep 2007, Oct-Dec 2008)

Publications/Presentations

Publications:

Feilmayr, C., Buttinger, C., Guttenbrunner, M., Parzer, S., Pröll, B.
Extracting Room Prices from Web Tables - an Ontology-Aware Approach
Proceedings of ENTER 2010, ENTER 2010, Lugano, Switzerland, February 2010.
 
Feilmayr, C., Parzer, S., Pröll, B.
Ontology-based Information Extraction from Tourism Web sites
Journal Information Technology & Tourism, Vol.11, No.3, pp. 183-196, 2009.
 
Feilmayr, C., Pröll, B.
Ontologie-basierte Informationsextraktion im eTourismus
HMD - Praxis der Wirtschaftsinformatik, Heft 270 eTourism, pp. 63-72, December 2009
 
Feilmayr, C., Barta, R., Grün, C., Pröll, B., Werthner, H.
Covering the Semantic Space of Tourism: an Approach based on Modularized Ontologies
Workshop on Context, Information And Ontologies (CIAO 2009), ESWC 2009, Heraklion, Greece, June 2009.
 
Feilmayr, C., Buttinger, C., Guttenbrunner, M., Parzer, S., Pröll, B.
Web Information Extraction
in: "Hagenberg Research" Ed. Bruno Buchberger et al. Chapter 7: "Information and Semantics in Databases and on the Web", Springer July 2009.
 
Buttinger Ch., Palkoska J., Retschitzegger W., Schauer M., Immler R.
JobOlize – Headhunting by Information Extraction in the era of Web 2.0
Workshop Proceedings of 8th International Conference on Web Engineering (ICWE 2008); 7th Workshop on Web-Oriented Software Technologies (IWWOST'08), New York, July 2008.

Presentations:

Buttinger C. Feilmayr, C., Guttenbrunner, M., Pröll, B.
MARLIES – WebIE Supporting the Supply of Demands in Manufacturing
ESTC (European Semantic Technology Conference 2009) Workshop, Vienna, Austria (02.12.2009).
 
Feilmayr, C.
ONTOSOPHIA - Ontology-driven Information Extraction supported by Corrective Feedback
Doctoral Consortium ISWC 2009, Washington D.C., USA (23.10.1009)
 
Feilmayr, C., Parzer, S., Pröll, B.
Ontology-based Information Extraction from Tourism Web sites
Modul University Vienna, Austria (13.11.2008).


Information Extraction @ FAW