Ingeniería en computación e informáticahttps://ridda2.utp.ac.pa/handle/123456789/17232024-03-28T17:11:26Z2024-03-28T17:11:26ZAutomatización del proceso de transformación de formatos de publicaciones en revistas científicas a través de script en PhytonMurillo, DannyLópez, Sucelhttps://ridda2.utp.ac.pa/handle/123456789/180652023-08-23T12:41:21Z2023-07-19T00:00:00ZAutomatización del proceso de transformación de formatos de publicaciones en revistas científicas a través de script en Phyton
Murillo, Danny; López, Sucel
Scientific disclosure and diffusion is the way to make society and other scientists aware of the research results and the generation of new knowledge. Over the last few years, scientific journals in digital format have become the most widely used medium to demonstrate these results, but mentioning whether we are going to publish or consult a journal is necessary to analyze some aspects of them such as: their presentation, form of distribution, quality of its content and impact of the magazine. Although all these elements are of interest, the form of distribution is of great relevance since it is linked to the visibility of the journal, if it is not found, it is not read or cited, but without the publication formats they are not diverse, neither We will be able to improve the digital reach of those who use this content. According to data from the Scholastica report, a paid web platform that includes more than 900 publishers of academic journals, the most used formats are pdf and html. In some studies carried out in Central America, specifically Costa Rica and Panama, the formats of scientific journals used are pdf, html, ePub, xml-jats, audio and Flipbook. Of the 185 journals evaluated, only 50% use two formats and barely 15% use more than three formats, the most common being html and pdf. However, the limitation is not only the use of software such as MS Word to transform pdf to html due to the limitations, but according to the editors they do not use other formats because they are unaware of the software used for this process. In the case of Panamanian journals, of 30 journals evaluated, 100% used pdf, only six used html, and only four journals used more than three formats, so we can say that there is a deficiency in the number of formats and probably in the time that the transformation process may take publishers. The objective of this work is to generate a script using Python as programming language to automate the process of transforming scientific article formats in docx, to other formats such as pdf, html, ePub, txt and audio, minimizing the use of software and reducing the processing time of these documents. In the tests carried out with the script, it was necessary to generate document character style formats to achieve good results, where it was not only possible to transform 24 articles from two Panamanian magazines into the five formats, but the transformation time was 15 minutes compared to the 15 hours it took publishers for this transformation.
La divulgación y difusión científica es el medio para que la sociedad y otros científicos sean conscientes de los resultados de la investigación y la generación de nuevos conocimientos. En los últimos años, las revistas científicas en formato digital se han convertido en el medio más utilizado para mostrar estos resultados, pero es necesario considerar si vamos a publicar o consultar una revista para analizar algunos aspectos de ellas, como su presentación, forma de distribución, calidad de su contenido e impacto de la revista. Aunque todos estos elementos son de interés, la forma de distribución es de gran relevancia, ya que está vinculada a la visibilidad de la revista; si no se encuentra, no se lee ni se cita. Sin embargo, sin formatos de publicación diversos, no podremos mejorar el alcance digital de quienes utilizan este contenido. Según datos del informe de Scholastica, una plataforma web de pago que incluye a más de 900 editores de revistas académicas, los formatos más utilizados son pdf y html.
En algunos estudios realizados en América Central, específicamente en Costa Rica y Panamá, los formatos de revistas científicas utilizados son pdf, html, ePub, xml-jats, audio y Flipbook. De las 185 revistas evaluadas, solo el 50% utiliza dos formatos y apenas el 15% utiliza más de tres formatos, siendo html y pdf los más comunes. Sin embargo, la limitación no es solo el uso de software como MS Word para transformar pdf a html debido a las limitaciones, sino que, según los editores, no utilizan otros formatos porque desconocen el software utilizado para este proceso.
En el caso de las revistas panameñas, de 30 revistas evaluadas, el 100% utilizaba pdf, solo seis utilizaban html y solo cuatro revistas usaban más de tres formatos, por lo que podemos decir que existe una deficiencia en la cantidad de formatos y, probablemente, en el tiempo que el proceso de transformación pueda llevar a los editores.
El objetivo de este trabajo es generar un script utilizando Python como lenguaje de programación para automatizar el proceso de transformación de formatos de artículos científicos en docx a otros formatos como pdf, html, ePub, txt y audio, minimizando el uso de software y reduciendo el tiempo de procesamiento de estos documentos. En las pruebas realizadas con el script, fue necesario generar formatos de estilo de caracteres para lograr buenos resultados, donde no solo fue posible transformar 24 artículos de dos revistas panameñas en los cinco formatos, sino que el tiempo de transformación fue de 15 minutos en comparación con las 15 horas que llevó a los editores realizar esta transformación.
2023-07-19T00:00:00ZRelation of the research indicators of the university rankings and the measurement of scientific production in Open AccessMurillo, DannyLópez, Sucelhttps://ridda2.utp.ac.pa/handle/123456789/180642023-08-17T13:42:37Z2023-07-19T00:00:00ZRelation of the research indicators of the university rankings and the measurement of scientific production in Open Access
Murillo, Danny; López, Sucel
The evolution of university rankings over the years has been characterized by trying to respond to the shortcomings of their measurement systems and guiding the choice of students and academics in search of quality in teaching and research. where it seems that the rankings have outlined strategic lines aimed at meeting various objectives, including improving the accuracy of their indicators and expanding the measurements to other activities carried out by universities, however there are few indicators that promote or measure the open access of this research resources. The objective of this work is to identify the importance of the indicators of the university rankings related to the measurement and visibility of open access publications and to compare the coverage of the sources used in these indicators with other open access platforms used by academic institutions and research in Panama. The results obtained indicate that of the six rankings evaluated, only two use indicators that measure open access, but only one gives a percentage of barely 2% to this indicator, on the other hand, when making a comparison of the coverage of the fonts used. Through the ranking of universities and other open access platforms, it is identified that the open access platforms have greater coverage of publications where they almost double the number of integrated documents from Panamanian institutions in some cases and in some cases greater impact through citations.
La evolución de los rankings universitarios a lo largo de los años se ha caracterizado por intentar responder a las deficiencias de sus sistemas de medición y guiar la elección de estudiantes y académicos en busca de calidad en la enseñanza e investigación. En donde parece que los rankings han delineado líneas estratégicas orientadas a cumplir diversos objetivos, incluida la mejora de la precisión de sus indicadores y la expansión de las mediciones a otras actividades realizadas por las universidades. Sin embargo, son pocos los indicadores que promueven o miden el acceso abierto a estos recursos de investigación. El objetivo de este trabajo es identificar la importancia de los indicadores de los rankings universitarios relacionados con la medición y visibilidad de las publicaciones de acceso abierto, y comparar la cobertura de las fuentes utilizadas en estos indicadores con otras plataformas de acceso abierto utilizadas por instituciones académicas e investigativas en Panamá. Los resultados obtenidos indican que de los seis rankings evaluados, solo dos utilizan indicadores que miden el acceso abierto, pero solo uno asigna un porcentaje de apenas el 2% a este indicador. Por otro lado, al comparar la cobertura de las fuentes utilizadas a través de los rankings universitarios y otras plataformas de acceso abierto, se identifica que las plataformas de acceso abierto tienen una mayor cobertura de publicaciones, donde casi duplican el número de documentos integrados de instituciones panameñas en algunos casos y en algunos casos tienen un mayor impacto a través de las citas.
2023-07-19T00:00:00ZWeb application in Shiny for the extraction of data from profiles in Google ScholarMurillo, DannySaavedra, DalysZapata, Robinsonhttps://ridda2.utp.ac.pa/handle/123456789/180532023-04-25T13:31:18Z2023-07-18T00:00:00ZWeb application in Shiny for the extraction of data from profiles in Google Scholar
Murillo, Danny; Saavedra, Dalys; Zapata, Robinson
– The need to measure the contribution of researchers through academic profiles is of great importance, which is why in 2018 we created an algorithm in R language to dynamically extract data from individual and institutional public p rofiles in Google Scholar Citations. Although the algorithm has been of great use in the automatic extraction of data, allowing statistical reports and analyzes to be carried out with this data, it is only possible to use it if the user knows the R language, due to the multiple functions that the R language has integrated. algorithm. In this work we show the creation of a web application integrating the algorithm to extract data from Google Scholar Citations but improving the ease of use of these scripts using the R Shiny package, which integrates web components from Rstudio but maintaining the programming characteristics of the language. . Shiny converts scripts into interactive web applications, without any knowledge of HTML, CSS or Javascript, making it e asy for users to use, manipulate, view, and allow for future updates to improve functionality. The results of the tests and tasks carried out in this work show that the use of the web application in Shiny, the extraction algorithm could be integrated without difficulty, improving the extraction time in seconds and minutes, because the user does not interact with it. R code but with the Web interface allowing users new to R who are dedicated to the analysis of Google Scholar data to use it. Keywords— Google Scholar Citations, shiny, web application, web scraping, bibliometric indicators.
– The need to measure the contribution of researchers through academic profiles is of great importance, which is why in 2018 we created an algorithm in R language to dynamically extract data from individual and institutional public p rofiles in Google Scholar Citations. Although the algorithm has been of great use in the automatic extraction of data, allowing statistical reports and analyzes to be carried out with this data, it is only possible to use it if the user knows the R language, due to the multiple functions that the R language has integrated. algorithm. In this work we show the creation of a web application integrating the algorithm to extract data from Google Scholar Citations but improving the ease of use of these scripts using the R Shiny package, which integrates web components from Rstudio but maintaining the programming characteristics of the language. . Shiny converts scripts into interactive web applications, without any knowledge of HTML, CSS or Javascript, making it e asy for users to use, manipulate, view, and allow for future updates to improve functionality. The results of the tests and tasks carried out in this work show that the use of the web application in Shiny, the extraction algorithm could be integrated without difficulty, improving the extraction time in seconds and minutes, because the user does not interact with it. R code but with the Web interface allowing users new to R who are dedicated to the analysis of Google Scholar data to use it. Keywords— Google Scholar Citations, shiny, web application, web scraping, bibliometric indicators.
2023-07-18T00:00:00ZVisibility of Panamanian scientific journals in regional indexers and Google ScholarMurillo, DannyFenández, MadelaineAñino, YostinLópez, Omarhttps://ridda2.utp.ac.pa/handle/123456789/180522023-04-25T13:26:08Z2022-07-18T00:00:00ZVisibility of Panamanian scientific journals in regional indexers and Google Scholar
Murillo, Danny; Fenández, Madelaine; Añino, Yostin; López, Omar
– This work aims to show an analysis of scientific journals in Panama to know their visibility, reach on different platforms and the impact of their publications through basic bibliometric indicators such as citations and hindex extracted from the Google Scholar Citations platform and the Publish of Perish tool. The document identifies 97 magazines from 48 institutions in Panama that make their scientific production visible through magazines in digital and printed format. A search of these journals is carried out in 16 directories, databases and indexers to verify their visibility. Within the results, 62 digital journals are framed and barely 32 use the Opern Journal System, only 4 journa ls are found in 10 of the 16 platforms evaluated, none have open access policies and only one journal has a Hindex greater than 6. comparison with Costa Rican journals, where the average number of citations in Panamanian journals is only 0.38 citations per publicatio n, which indicates that there is little visibility, little reach, which is reflected in the little or no impact of the publications. Keywords-- bibliometrics, visibility, journals, Google Scholar, indicators
– This work aims to show an analysis of scientific journals in Panama to know their visibility, reach on different platforms and the impact of their publications through basic bibliometric indicators such as citations and hindex extracted from the Google Scholar Citations platform and the Publish of Perish tool. The document identifies 97 magazines from 48 institutions in Panama that make their scientific production visible through magazines in digital and printed format. A search of these journals is carried out in 16 directories, databases and indexers to verify their visibility. Within the results, 62 digital journals are framed and barely 32 use the Opern Journal System, only 4 journa ls are found in 10 of the 16 platforms evaluated, none have open access policies and only one journal has a Hindex greater than 6. comparison with Costa Rican journals, where the average number of citations in Panamanian journals is only 0.38 citations per publicatio n, which indicates that there is little visibility, little reach, which is reflected in the little or no impact of the publications. Keywords-- bibliometrics, visibility, journals, Google Scholar, indicators
2022-07-18T00:00:00Z