[Article updated and reviewed 2015-11-15]
The Teseo database, published by the Ministry of Education, enables retrieval of information on doctoral theses defended at Spanish universities since 1976. The information provided by this online resource has fostered the development of numerous bibliometric and scientometric studies on the state of research in Spain across various fields of knowledge and specialties.
https://www.educacion.gob.es/teseo
To gauge the impact Teseo has had on research, we conducted a search in Google Scholar using the query (intitle:teseo OR intext:teseo) AND («base de datos»), yielding 1,690 results that trace an upward trend, demonstrating increased direct and indirect citation of the database.
Fig.2. Evolution of direct and indirect references to the Teseo database in academic-scientific publications retrieved from Google Scholar [Consulted on 2015-11-04]
Given the importance of this resource for the development of future research aimed at determining the scientific output of doctoral theses in a specific specialty or field of knowledge, a method for retrieving doctoral thesis records from the Teseo database has been developed. The method employs web scraping techniques combined with the crawling engine Mbot.
Fig.3. Sample of doctoral thesis records collected in the Teseo database
Each permalink in the database has been meticulously analyzed to extract, using XPath and REGEXP pattern techniques, the key data from each registered doctoral thesis. Information such as the full title, author, originating university, defense date, thesis supervisors, tribunal members, descriptors, and abstract is automatically collected and prepared for export in SQL, CSV (Comma Separated Values), and CSV formats compatible with Microsoft Excel, available from the open-source software repository Sourceforge.net [https://sourceforge.net/projects/teseo-database/].
Downloads of Teseo v1.1 [2015-11-14]
- Download Teseo v1.1 CSV (Complete Data)
- Download Teseo v1.1 CSV MS Excel (Complete Data)
- Download Teseo v1.1 SQL (Structure and Data – Complete)
- Download Teseo v1.1 SQL (Structure Only)
- Download Teseo v1.1 SQL (Data Only – Complete)
- Download Teseo v1.1 SQL (Data Only) Part 01
- Download Teseo v1.1 SQL (Data Only) Part 02
- Download Teseo v1.1 SQL (Data Only) Part 03
- Download Teseo v1.1 SQL (Data Only) Part 04
- Download Teseo v1.1 SQL (Data Only) Part 05
- Download Teseo v1.1 SQL (Data Only) Part 06
- Download Teseo v1.1 SQL (Data Only) Part 07
- Download Teseo v1.1 SQL (Data Only) Part 08
- Download Teseo v1.1 SQL (Data Only) Part 09
- Download Teseo v1.1 SQL (Data Only) Part 10
- Download Teseo v1.1 SQL (Data Only) Part 11
- Download Teseo v1.1 SQL (Data Only) Part 12
- Download Teseo v1.1 SQL (Data Only) Part 13
- Download Teseo v1.1 SQL (Data Only) Part 14
- Download Teseo v1.1 SQL (Data Only) Part 15
- Download Teseo v1.1 SQL (Data Only) Part 16
- Download Teseo v1.1 SQL (Data Only) Part 17
- Download Teseo v1.1 SQL (Data Only) Part 18
- Download Teseo v1.1 SQL (Data Only) Part 19
- Download Teseo v1.1 SQL (Data Only) Part 20
Importing Teseo into AMP (Apache, MySQL, PHP)
The Teseo database can be imported into any Apache, MySQL, and PHP-based distribution, such as XAMPP, WAMP, EasyPHP, AMPdoc, provided they include a PHPMyAdmin database management tool to facilitate data migration. Additionally, a special configuration of PHP via the «php.ini» file is required to allow unlimited script execution, increased memory limits, and larger maximum file sizes for import. Below is the recommended configuration and the steps for importing via the PHPMyAdmin database manager.
PHP Configuration (php.ini file)
max_input_time= -1
memory_limit=4028M
post_max_size=500M
upload_max_filesize=500M
max_file_uploads=20
Steps for importing Teseo using phpMyAdmin
- Create an empty database named «teseo». The database will be created without tables, ready for the import of Teseo’s structure and data.
- Import methods:
- Structure and Data – Complete. From the «Import» option, select the file «catalogoteseo-estructuraydatos.sql», previously downloaded. Then click the «Go» button to start the import process. The process may take several minutes. Finally, the system completes loading all Teseo data, making it ready for use.
- Teseo SQL in Parts. From the «Import» option, select the file «catalogoteseo-part01.sql», previously downloaded. Then click the «Go» button to start the import process. This step automatically creates the table with the necessary field structure and loads the first of the 14 available data batches. Repeat the import process with the remaining parts until the migration is complete.
- Verification of the import. It is recommended to verify that a total of 132,378 records corresponding to Doctoral Theses have been imported.
Fig.4. phpMyAdmin import screen. Note that the file size limit is 500MB, enabling a successful Teseo import.
Relationship of Teseo Articles