This section presents a list of programs, utilities, and datasets that have been opened for sharing with the academic, scientific, and university communities. They have been developed through the research activities of Prof. Dr. Blázquez-Ochando in the field of Documentation Sciences and offer practical applications for teaching, research, and innovation. Online experiments are also referenced, allowing any researcher or user to consult and test them. Software considered strategic or under development, which could compromise ongoing research or exploitation processes, is not distributed.

Academic and Scientific Software

  1. AMPdoc — Portable distribution of Apache + PHP + MySQL for teaching information unit automation. Versions 1.0, 1.1, and 2.0. https://sourceforge.net/projects/ampdoc/
  2. AXYZ — Experimental Big-data aggregator of RSS feed channels with automatic classification and correlation analysis among news items. https://sourceforge.net/projects/axyznews/

GitHub Repositories

  1. LaIAbot — RAG conversational agent for bibliographic recommendation and reader assistance. Python. MIT License. https://github.com/manublaz/laiabot
  2. ScholarDownPython — Mass extraction of papers from Google Scholar using anti-detection techniques. Python. https://github.com/manublaz/ScholarDownPython
  3. ScholarDownPHP — Web scraper for Google Scholar. PHP. https://github.com/manublaz/ScholarDownPHP
  4. phpScrapingPARES — Analytics and Big Data on the Spanish Archives Portal. PHP. https://github.com/manublaz/phpSrapingPARES
  5. sentiManPHP — Sentiment analysis for Spanish. PHP. https://github.com/manublaz/sentiManPHP
  6. promptAI — Documented AI prompts within the framework of scientific publications. https://github.com/manublaz/promptAI
  7. Cumulus — Software for comprehensive management of information sources and documentary resources. PHP. https://github.com/manublaz/cumulus
  8. Datasets — Open datasets generated from scientific research. MIT License. https://github.com/manublaz/datasets

Datasets

  1. Teseo Database 2015-11-14 — Database of Spanish doctoral theses extracted from TESEO. Used in bibliometric research on doctoral production in Spain. https://sourceforge.net/projects/teseo-database/files/TESEO_2015-11-14/

Online Experiments

  1. Google2down — Web scraping experiment on Google and Google Scholar for structured extraction of search results. https://mblazquez.es/lab/google2down/
  2. Google Spoofing — Experiment on interface spoofing techniques applied to search engines. https://mblazquez.es/lab/googleSpoofing/
  3. NewsMedia — Web scraping experiment on news media for extraction and analysis of informational content. https://mblazquez.es/lab/newsMedia/
  4. Google Finance scraping — Test for extracting structured financial data for experimentation with information retrieval techniques. https://mblazquez.es/lab/googleFinance/