SCK.CEN (Studiecentrum voor kernenergie)
Inleiding
-
making an inventory of all Arthrospira genes related to carbon fixation, carbonate transport, carotenoid synthesis, photosynthesis (including light-harvesting complexes, photosystems I and II, thylakoid structures, cytochromes, and electron transport mechanisms), membrane bioenergetics, nitrogen metabolism, respiration, glycogen synthesis, and fermentation. This will entail comparative analysis based on all-to-all BLAST operations, the use of the GenBank and CyanoClust databases and the MaGe platform (http://www.cns.fr/agc/microscope/).
-
The apprentice may also focus on certain loci, preferably in the frame of ongoing experimental research (e.g. genes involved in oxidative stress or DNA repair, R-M systems, CRISPR elements, and phylogenetic studies of psbA paralogs and -orthologs encoding different D1 protein isoforms).
The microbiology group at SCKCEN ran a first of its kind microbial growth experiment in the international space station. Four specially designed bioreactors were send to the ISS to compare the growth of cyanobacterium Arthrospira sp PCC 8005 in space with the growth on ground in the lab. Next to proteomics, DNA methylation and several biochemical analyses several growth parameters and data about bioreactor performance were collected.
Four consecutive batches were run on each reactor spanning in total five weeks of runtime. To compare different batches and different reactors and to integrate them with, for example, the proteomics data, a tool for clear visualization and options to select different batches and plotting options is needed. The data from more than 35 growth- and bioreactor-performance-related variables consist of 3 million data points for each reactor, and is stored in multiple CSV files. These files are too large to visualize and edit with graphical interfaces such as Microsoft Excel.
Furthermore, the data is recorded every second and stored in a log-file, regardless of the state of the reactor (active or idle). For example, optical density measurements are done every 12 hours, so only once per 12 hours, the recorded value changes. For proper and informative data use, the exact time of measurement with the corresponding value should be used. Extracting these points is challenging as the logging does not indicate where the recorded values change.
Another difficulty is that the experiment did not ran completely as planned, i.e., the running schedule of the experiment had to be adapted during the experiment. This also needed to be taken into account whilst organizing and extracting the data.
In addition, for the flight reactors, the provided data is not divided in the different batches. Therefore, the batch exchanges points need to be searched in the log-files. Although the batches were planned on fixed dates, the exact timing relied on the astronaut timing and adaption of the running scheme made based on observations during runtime.
R scripts were used to extract, organize and visualize the data. First, an R script was written to combine the reactor-specific data, search for the batch exchange points and extract the relevant time points. Next, a script was written to adapt the data-format for easier use in a Shiny dashboard. Finally, the R package Shiny dashboard was used to create a user friendly dashboard to visualize different bioreactor parameters and choose which reactor(s) and batch(es) are plotted and allows the user to download a table with selected data points.
The Shiny dashboard allows the user to visualize six parameters separately. It is possible to select data from one or more batches and one or more reactors. The generated plots allow zooming, saving, setting a grid to clearly separate batches and toggle between different x-axis input. It is also possible to lay the different batches over each other and a clear difference in marker shape and color is set for each batch and reactor. The selected data can also be viewed and downloaded as a table.
There is still ongoing work on the dashboard to add some additional parameters that need more calculation, make it possible to visualize text written errors or running scheme adaptions on the plot at the time of occurrence. A CSS-styling with the company colors could also still be added.
The developed dashboard will also be used for the follow up experiment where real-time follow up of the bioreactors is needed. Discussions are ongoing on how the data coming from the ISS will be made available for real time plotting and will require a script to fetch the data automatically
Address
Boeretang 200
2400 Mol
32 14 33 21 11 Belgium |
Contacts
Traineeship supervisor
Paul Janssen
pjanssen@sckcen.be |
Traineeship supervisor
Felice Mastroleo
|
Traineeship supervisor
Gleb Goussarov
0477136097 gleb.goussarov@sckcen.be |
Traineeship supervisor
Mohamed Ahmed
014332836 mohamed.mysara@sckcen.be |
Traineeship supervisor
Jürgen Claesen
jurgen.claesen@sckcen.be |