General information: 

This was my first assignment as a biomedical engineer for the UNED. I was asked to participate in the Kaggle Competition CORD-19 , which was organized by the Allen Institute of Artificial Intelligence in the context of the pandemic. It was developed between April 2020 and July 2020.

This project was launched as a machine learning competition by the Allen Insitute of Artificial Intelligence in Kaggle as an answer to the issues due to the COVID pandemics. 


I started working on this project in March 2020 and ended it in June 2020. 


I was a beginner and it helped me to understand a bit more NLP and machine learning project development. It also helped me to know how an Artificial Intelligence competition goes and I discovered the Kaggle tools.

Allen Institute of Technology

The Allen Institute for AI (abbreviated AI2) is a research institute founded by late Microsoft co-founder Paul Allen. The institute seeks to achieve scientific breakthroughs by constructing AI systems with reasoning, learning, and reading capabilities. Oren Etzioni was appointed by Paul Allen in September 2013 to direct the research at the institute. 

Link to the Allen Institute Website is here.


Kaggle is a subsidiary of Google LLC, is an online community of data scientists and machine learning practitioners. Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges. 

I had already talked about Kaggle and the link to the it and other programming helpful tool is here.

The project will be explained in detail in the blog section. Below I leave the links of interests. 

Links to my Kaggle:

Link to the project in Kaggle:

My solution (main code)

My complementary codes: