On May 29, 2018, the workshop «Expert Search in Google and Scraping Techniques» took place at the Universidad Carlos III de Madrid, highlighting the growing interest in data and content mining from search engines. This follows the publication of the book «Expert Search Strategies in Google» and the data extraction tests «Google Scraping» and «Web Scraping in Google Finance». The rapid advancement of information technologies is compelling documentation professionals to enhance their knowledge and skills in using digital tools to extract information from the Web. However, it is also essential to develop applications that allow customization and adaptation of data mining to each specific source and resource. This workshop presents a comprehensive overview of information search using advanced query operators and web-scraping techniques, ultimately demonstrating how to apply these methods on search engines such as Google. The workshop program is as follows:

PART 1 – Advanced Google Search

  1. Advanced search strategies in search engines
  2. Search operators
  3. RESTful queries
  4. Examples of advanced search
  5. Applications and process automation

PART 2 – Scraping Technique

  1. Introduction to parser programs and scraping technique
  2. Working schema of the scraping method
  3. Technologies involved in scraping
  4. First approach with LinkKlipper

PART 3 – Practices

  1. The first parser
  2. Using an XML parser
  3. Using an HTML parser
  4. Methods for downloading the HTML code of a web page
  5. Extracting data from a web page
  6. Extracting news from a digital newspaper
  7. Extracting Webometrics information resources
  8. Extracting results from a simple Google search
  9. Extract results from an advanced Google search

Download Workshop Software

If you attend the workshop and have a VIP code, you will be able to download the trial software I have prepared for this occasion. To do so, enter your code in the form below and access the manual download page.