About

Hi! I am Jose, a doctor in Physics by the Universidad Autónoma of Madrid, Spain, since 2019. I am an expert in simulating electronic properties of materials. After finishing my PhD in Madrid, I worked as a Postdoctoral Research Associate in Bremen and Hamburg, as well as a Software Developer and Data Managener in the FAIRmat project, at the Humboldt University of Berlin. Now, I have a permanent position at the Bundesanstalt für Materialforschung und -prüfung (BAM) working as a Senior Data Manager.

I am passionate about Science and Software Development. You can find the details about the latest software developments I have been involved with in my GitHub profile. I also enjoy sharing my experience and being active on X/Twitter.

Research

During my career, I focused on studying strong electronic correlation effects in real materials. These correlations are crucial for certain physical phenomena, like superconductivity and magnetism. They are also key in understanding how materials interact with external electromagnetic fields. This work relies on performing highly complex simulations, but the main advantage is that it compares better with a real case scenario than more vanilla simulation methods.

As a Computational Materials Scientist, I am an expert in developing these complex methodologies, in analyzing these complex behaviors, and in writing highly optimized codes. In order to do that, I always tried to follow a few key principles:

Being systematic when exploring electronic-structure phases of real materials, doing multiple scans for different parameters.
Assisting experimental teams by providing a robust theoretical framework for their findings.
Managing my data in a consistent way, following the F.A.I.R. principles.

1. Research Data Management

For the last 2 years I have been working as an expert in Research Data Management (RDM). RDM is a keys aspect of Science and Good Scientific Practices that is often overlooked. Since the last years and the development of powerful Artificial Intelligence (AI) and Machine-Learning (ML) methodologies it is obvious that following a good treatment of data is key for the future of Science, see e.g., Nobel Prize of Chemistry 2024, where a key aspect was the existance of protein databases which boosted the ML interatomic potential findings that gave rise to this Nobel Prize.

As an expert on RDM, my goal is to provide a service for the scientific community: I like to develop data models and ontologies, write down parsers and mappers for the generated data, and work on the software infrastructure behind these databases. In order to do that, I followed the F.A.I.R. principles, i.e., data has to be organized to be Findable, Accessible, Interoperable, and Reusable.

In order to exploit the capabilities of databases, the (meta)data that we define in data models or ontologies have to follow the F.A.I.R. principles. In short, Findability means that the (meta)data fields we define must be unique; Accessibility means that the (meta)data has to be accessible by others; Interoperability means that the (meta)data has to be defined using a common language; Reusability means that the (meta)data must contain a detailed provenance of how it was extracted.

2. Strongly Correlated Materials

The main topic of my carrier has been to study and propose novel experiments for understanding the physics of strongly correlated materials. These systems show a plethora of effects such as the Mott (metal-to-insulator) transition and Hund metallicity. The electronic states of these materials are known to be intimately related with high-T_c superconductivity and magnetism. Specifically, I studied iron-based superconductors, magic-angle twisted bilayer graphene, and 2D materials.

Thus, understanding these states and proposing new materials with such properties using the capabilities of AI and ML is key to solve the long-standing problem of high-T_c superconductivity and to exploit their full potential in real world applications.

Chromium pnictides phase diagram. — My collaborators and I proposed to search for high-T_c superconductivity in chromium pnictides based on their potentially similar behavior with another class of high-T_c superconductors, the iron-based superconductors. Taken from J. M. Pizarro et al., *Strong correlations and the search for high-Tc superconductivity in chromium pnictides and chalcogenides*, Phys. Rev. B 95, 075115 (2017).

3. Methods and Software Development

Studying strongly correlated materials and running ML codes is computationally very demanding. In the case of strongly correlated materials, the paradigmatic model used is called the Hubbard model. This is a model that has been known for decades, extended, but whose exact solutions are impossible to obtain, so that often new approximations and methodologies, as well as optimized algorithms are developed to reach meaningful solutions.

Furthermore, this implies using several different tools to obtain solutions of the Hubbard model and managing workflows. A typical workflow will include: choosing the material and relaxing its crystal structure, performing Density Functional Theory (DFT) simulations for the ground-state properties, projecting and downfolding into a smaller sub-space (this step is normally done to make the original material problem more tractable), and finally, solving the Hubbard model by solving the interacting many-body quantum problem.

DFT and Wannier model. — Electronic band structure calculated for twisted and non-twisted 1T-TaSe₂and lower energy projection into a Wannier tight-binding model, extracted from J.M. Pizarro et al., npj Quantum Materials 5, 79 (2020). This shows an example of reducing the original DFT degrees of freedom to make the problem more tractable by interacting many-body quantum theories.

I worked on implementing new approximations, as the Slave-Spin Mean-Field (SSMF) method, and on optimizing the code defining new algorithms, see J.M. Pizarro, PhD Thesis: Electronic correlations in multiorbital systems, arXiv/1912.04141:

pySSMF: an open-source Python package to calculate the SSMF solution of an input tight-binding model.

I am also involved on the development of:

NOMAD: a free web-service to share Materials Science data.
NOMAD-Simulations: an open-source Python standard for managing Materials Science simulation data.
Multiple NOMAD parsers.

4. High-Throughput and Code Automation

An important aspect of strongly correlated materials is to define a smaller sub-space of bands and to solve the Hubbard model for these subset. However, this projection or downfolding is still a very human-involved problem, i.e., it needs a scientists deciding which bands to project to.

In the recent years, there has been a huge push for automation of both simulations and experiments in Materials Science. However, the aforementioned problem makes that still an expert in strong correlations is needed to study what is happening in these materials.

I am currently working on improving this procedure to make the study of strongly correlated materials more automated, or at least, with minimal human intervention.

Dr. Jose M. Pizarro

Research

1. Research Data Management

2. Strongly Correlated Materials

3. Methods and Software Development

4. High-Throughput and Code Automation