Fully automated processing and optimization of single particle and filamentous transmission electron cryomicroscopy samples
Single particle cryo-EM is getting increasingly accessible to researchers from diverse research areas. Furthermore, recent innovations in cryo-EM hardware development and data acquisition strategies have led to an increase in data quality as well as quantity. However, the overall quality of a data set does not directly correlate with the quality of a single image or the number of images collected, but also strongly depends on the sample itself. Nevertheless, with great data comes great responsibility and just having a large data set is not necessarily advantageous. To get the most out of a data collection, the researcher needs to carefully monitor and curate all data, ideally while its still being collected. Therefore, automated data processing and the analysis of the collected data live during acquisition becomes increasingly important. To get the most information in the shortest amount of time, ideally, all major pre-processing steps would be executed while the data are still collected. In this way, the researcher gets direct feedback about the sample quality and has the chance to make necessary adjustments to the data collection. While there are several tools available to execute the processing pipeline, they all use a static set of input settings for each individual task, limiting their applicability. In this thesis, I present TranSPHIRE, a tool for fully automated on-the-fly data processing. It executes all the important pre-processing steps required for the processing of single particle projects, as well as filamentous samples, in a parallel manner. Additionally, the important metrics of each processing step are presented via the TranSPHIRE GUI, allowing a fast evaluation of all parameters and the data collection itself. TranSPHIRE also includes a machine learning based feedback loop, which enables the optimization of particle picking for any given sample. Specifically, the loop performs iterations of particle picking, 2D classification, and 2D class selection, followed by training of a new model for particle picking. The curated particles can subsequently be subjected to a 3D refinement within TranSPHIRE. For further analysis, the output particles and volumes can eventually be transferred to other software packages, such as SPHIRE. I demonstrate the capabilities of the TranSPHIRE pipeline based on three different scenarios: A previously unknown data set; a data set consisting of two sub-populations, where only one is targeted for particle picking; and a filamentous sample. All three scenarios lead to a high-resolution 3D reconstruction of the target protein in a fully automated manner. Therefore, fully automated data processing and ptimization could pave the way for high-throughput screenings of unknown samples without user intervention. Despite decades of research, processing of filamentous cryo-EM samples remains challenging. One of the reasons is that most 3D refinement approaches require prior knowledge about the helical symmetry parameters. However, for most samples the helical symmetry differs locally, leading to model bias, low-resolution results, or even incorrectly reconstructed structures. Here, I present sp_meridien_alpha.py, a modification of the single particle 3D refinement program sp_meridien.py, to allow filamentous processing in the SPHIRE package. My refinement approach utilizes filamentous constraints to help convergence and does not require previous knowledge about the helical symmetry. Based on two examples, a tobacco mosaic virus and an actomyosin data set, I show that the final resolution and overall map quality achieved by sp_meridien_alpha.py surpasses the one achieved by sp_meridien.py. In summary, the software tool TranSPHIRE and the filamentous 3D refinement program sp_meridien_alpha.py combined simplify the cryo- EM data collection and processing and thereby present a valuable contribution to the field.
Search & People Search
CalendarTo event list
Location & approach
The campus of the Technical University of Dortmund is located near the freeway junction Dortmund West, where the Sauerland line A45 crosses the Ruhr expressway B1/A40. The Dortmund-Eichlinghofen exit on the A45 leads to the South Campus, the Dortmund-Dorstfeld exit on the A40 leads to the North Campus. The university is signposted at both exits.
The "Dortmund Universität" S-Bahn station is located directly on the North Campus. From there, the S-Bahn line S1 runs every 20 or 30 minutes to Dortmund main station and in the opposite direction to Düsseldorf main station via Bochum, Essen and Duisburg. In addition, the university can be reached by bus lines 445, 447 and 462. Timetable information can be found on the homepage of the Rhine-Ruhr transport association, and DSW21 also offer an interactive route network map.
One of the landmarks of the TU Dortmund is the H-Bahn. Line 1 runs every 10 minutes between Dortmund Eichlinghofen and the Technology Center via Campus South and Dortmund University S, while Line 2 commutes every 5 minutes between Campus North and Campus South. It covers this distance in two minutes.
From Dortmund Airport, it takes just over 20 minutes to get to Dortmund Central Station by AirportExpress and from there to the university by S-Bahn. A wider range of international flight connections is offered by Düsseldorf Airport, about 60 kilometers away, which can be reached directly by S-Bahn from the university's train station.
The facilities of TU Dortmund University are spread over two campuses, the larger Campus North and the smaller Campus South. Additionally, some areas of the university are located in the adjacent "Technologiepark".Zum Lageplan
Open weather forecast