Hello! I'm Franklin 👋


Aspiring AI & Bioinformatics Researcher

| Machine Learning | Data analysis

| Deep Learning | Cheminformatics | Omics

Hire Me

About Me

Kiran Franklin

Highly passionate and credible AI and Bioinformatics Researcher with more than 2+ years of experience. Holds a bachelor's degree in Biotechnology engineering and certifications in full stack data science and GCP data engineering. Proficient in Python programming for data science and web development using flask. Expertise in machine learning, data science, GCP, and Nextflow for pipeline management.

Experience

Internship Experience

AI Master Class, Pantech Solutions Pvt Ltd (Pantech ProLabs India) · Apprenticeship

Oct 2020 - Nov 2020

A 30-day internship where I gained experience in applying AI to real-world applications. During the internship, I was taught a variety of AI techniques, including OpenCV for face recognition, emotion recognition, object detection and tracking, as well as NLP for speech recognition and emotion recognition. I also gained experience in deep learning techniques such as plant disease classification and drowsiness detection, as well as game development using AI techniques in a project such as AI Snake Game. Additionally, I learned about cloud-based AI and AI in hardware. Over the course of the internship, I successfully deployed more than 25 real-time codes/applications and gained hands-on experience with the latest AI technologies.

Skills :
    TensorFlow · TensorBoard · Data Science · Machine Learning · Matplotlib · NumPy · Seaborn
Research Intern, Bayes Labs

Jun 2020 - Oct 2020

I have worked across multiple areas in drug discovery and machine learning, with a particular focus on collecting data sources in chemistry and genomics, leveraging this data to generate new molecules with desired properties, and building machine learning-based libraries to accelerate all phases of drug discovery and development. Additionally, I have worked on understanding the backend processes of advanced protein structure prediction algorithms and implementing them.

Through my work, I have gained extensive experience in data collection, machine learning, and drug discovery. I am passionate about leveraging the latest technologies to drive innovation in the field of drug discovery and am constantly seeking new opportunities to deepen my knowledge and skills.

Skills :
    Data Science · Machine Learning · Matplotlib · NumPy · Bioinformatics · Python (Programming Language)
Student Intern, Internship Studio

Jul 2020 - Aug 2020

Excel automation with the title "Retail Sales record compilation

Skills :
    MS Office · MS Excel
Intern, shodhaka life sciences

Jun 2019 - Aug 2019

Compilation and comparative studies on gene expression and regulation in spermatogenesis and male infertility

Skills :
    Python (Programming Language) · NGS data analysis

Work Experience

Senior Research Associate Excelra

Jun 2022 - Present

  • Working closely with DevOps teams to promote advanced omics pipelines using Nextflow pipelines to a GCP SaaS architecture, develop pipeline specific features and handling the troubleshooting process of Nextflow pipelines.
  • Versioned the pipelines on GitHub using a submodule strategy and maintained the associated Docker images.
  • Created testing environments for pipeline testing using PyTest/Unit test and assisted in the development of CI/CD pipelines for automated deployment.
  • Designed user-friendly GUIs for non-coding users to develop pipelines using React and Flask, hosted on Cloud Run, and connected to various services such as BigQuery and Cloud SQL to add dynamic features.
  • Acted as a subject matter expert, designing wireframes for software development for Impurity Management and toxicology report visualization and interpretation.

Skills :
    PostgreSQL · SQL · GitHub · Docker · Data Engineering · PyTest · Google Cloud Platform (GCP) · Nextflow · Python (Programming Language) · Flask · React.js
Bioinformatics Engineer Boltzmann labs

Mar, 2021 - Jun, 2022

  • Designed and implemented automated data collection and curation pipelines, as well as created small molecule libraries and Python packages for use in-house.
  • Developed end-to-end pipelines for predicting chemical properties, including docking, site prediction, and pharmacophore modeling, utilizing uncertainty estimators like quantile and standard deviation.
  • Created APIs using FASTAPI for various built-in features and integrated them with frontend, leading to improved user interface and feature validation.
  • Utilized clustering analysis techniques to perform ML and chemo informatics-based analyses, leveraging tools like RDKit.
  • Led a team of 6 in conducting internal and external case studies, involving hypothesis development, filtering criteria, and docking, to design novel drugs.

Skills :
    Scikit-Learn · Microsoft Visual Studio Code · Pycaret · Team Management · Data Science · Machine Learning · Matplotlib · NumPy · Bioinformatics · Python (Programming Language)

Projects I Worked On

During My Bachelor's

Insilco study on human delta sarcoglycan protein involved in Limb Girdle Muscular Dystrophy.
Read More..

  • Conducted modeling of Delta Sarcoglycan protein using I-Tasser and validated the results using SAVES and Rampage. I then improved the structures using SPDBV and What-if software.
  • Acquired a comprehensive understanding of the structure of proteins, including primary, secondary, and tertiary structures, and their impact on protein function.
  • Investigated the interactions of proteins with other proteins, comparing wild-type and mutant forms using Haadock and PDBsum software.

Functional impact on G108R mutations in PPT1 causing infantile neuronal ceroid lipofuscinosis: A molecular dynamics Study
Read More..

  • Protein structure modeling and validation, leveraging various servers to achieve accurate results.
  • Studied the impact of various mutations on the PPT protein using SNP servers, gaining insights into the effects of genetic variation on protein function.
  • Induced mutations in the wild-type PPT protein to study the resulting effects on stability and other key protein properties. I then performed dynamic studies using GROMACS, a widely-used molecular dynamics simulation software.

Enzymatic and antibiotic degradation of Bacterial biofilm in Ventilator Associated pneumonia.
Read More..

  • Studied the formation of biofilms on endotracheal tubes of patients with Ventilator-Associated Pneumonia (VAP), deepening my understanding of the microbial ecology of this disease.
  • Conducted extensive research into the development of various combinations of enzymes and antibiotics to effectively degrade these biofilms, exploring innovative approaches to treating and preventing VAP in affected patients.

During My Tenure in Boltzmann Labs

Automated cheminformatics data-scraper-curator and QSAR modelling.
Read More..

  • Developed robust web scraping scripts capable of efficiently extracting data from a variety of public resources, using unique IDs as input, and curating the resulting data for use in Quantitative Structure-Activity Relationship (QSAR) modeling.
  • Utilized both automated machine learning (AutoML) and graph-based deep learning models to develop highly accurate QSAR models capable of predicting complex chemical interactions. I have also incorporated interpretability and uncertainty metrics into these models, improving their overall reliability and usefulness.
  • Deployed the resulting models using FastAPI, creating a user-friendly and highly accessible interface for scientists and other stakeholders in the chemical and pharmaceutical industries.

Automated virtual Screening and Validation of Small molecules: ADMET and Molecular docking
Read More..

  • Developed an in-house package for calculating rule-based properties, enabling efficient and accurate analysis of complex chemical structures.
  • Utilized advanced AI/ML/DL frameworks to develop predictive models for ADME and toxicity properties, providing critical insights into the safety and efficacy of potential drug compounds.
  • Developed a powerful re-scoring model for predicting the affinity of drugs against proteins, using cutting-edge RESENT architectures to achieve superior accuracy and reliability.
  • Created a sophisticated docking pipeline that can perform docking calculations based on user-supplied proteins and smiles. The pipeline returns detailed information about the resulting interactions, pose, efficiency, and ideal pKi values, enabling rapid analysis and optimization of potential drug candidates.

Chemical space exploration and surrogate modelling.
Read More..

  • Developed advanced strategies for exploring the chemical space in order to optimize QSAR modeling. By leveraging techniques such as active learning and uncertainty sampling, I am able to efficiently identify key areas of the chemical space that are most likely to yield high-quality data and accurate modeling results.
  • Implemented a sophisticated surrogate docking pipeline that is capable of predicting the affinity of drug compounds without actually performing docking calculations. This approach enables faster and more efficient analysis of potential drug candidates, while still providing accurate and reliable insights into their properties.

Current Work

SaaS for HTOmics.
Read More..

  • Worked closely with DevOps teams to promote dvanced omics pipelines using Nextflow pipelines to a GCP SaaS architecture, and have been instrumental in creating automated deployment and testing environments using PyTest and Unit.
  • Assisted in troubleshooting these pipelines, enabling efficient and accurate analysis of complex biological data.
  • Maintained rigorous version control of pipelines on Github using a submodule strategy, and have taken steps to ensure the reliability and reproducibility of analyses by maintaining docker image versions.
  • Designed a dynamic and user-friendly GUI for biologists to access and utilize all available pipelines with just a few clicks, using Flask. I have also hosted this interface on Cloud Run, and have integrated it with BQ and CloudSQL for enhanced functionality and flexibility.

Designing SaaS platform for Impurity management and toxicology reports.
Read More..

  • As a subject matter expert (SME), I have played a key role in designing wireframes and user flows for user interfaces, ensuring that they meet the needs of both clients and end users.
  • I have also been responsible for developing and implementing data parsing strategies that leverage automation where possible, ensuring that data is processed quickly and accurately.
  • In addition to my technical responsibilities, I have provided other technical support to programmers, helping them understand client requirements and guiding them as they develop software solutions.
  • I have also tracked and documented progress and processes throughout the development lifecycle, ensuring that all stakeholders are informed and engaged and that projects stay on track.

Technical Competency

Developer Tools

Programming Languages

Database,Testing & API-webframework

Data Science

Cloud Platform

Bioinformatics

Experienced in using and building AI based drug designing tools like playmolecule, BoltChem. Also, have begineer level skills in: RNA-seq, scRNA-seq

Education

B.E Biotechnology

2017 - 2021
Dayananda Sagar College of Engineering
Visvesvaraya Technological University
DSCE-biotech Address: Shavige Malleshwara Hills, 91st Main Rd, 1st Stage, Kumaraswamy Layout, Bengaluru, Karnataka 560078

My Work

Open Souce contibution

MolPro

MolPro is a comprehensive python package for small molecule generation using protein active site or/and similar molecules using 3D information of molecules with in-silico validation of molecules by docking , pharmacophore hypothesis. Also off target prediction based on the binding site similarities.
MolPro


Others

My interest is in integrating Omics and Cheminformatics data to enhance drug discovery. To pursue this goal, I recognize the importance of upskilling myself in Data Science and Bio/Cheminformatics skills. As part of my efforts to upskill, I have worked on a few mini projects in the field of Data Science and have shared the project code and details on my Github repository

Acheviments.

Participated and won 1st prize for paper presentation in international conference” HORIZON 2019.

Outstanding Employee Award, March, 2023.

Training.

User-end bioinformatics for molecular biology (includes NGS and Microarray data analysis), Shodhaka Life Science Pvt. Ltd.

Professional Courses.

  • Application of Proteomics and Interactomics, NPTEL.
  • Introduction to Genomic Data science and bio python, Cousera.
  • Machine Learning: A-Z hands on Python and R in Data Science, Udemy
  • Full stack Data Science, ineuron
  • GCP Data Engineering, Percipio.

Language Proficiency.

IELTS : English

British Council Dec 2022
Overall Band Score: 7.0
Listening: 6.5 Reading: 8.5 Writing: 6.0 Speaking: 6.0

Others:

Kannada,Tamil,Telugu,Hindi,Germany

+91 9743324727
kiranfranklinkiran@gmail.com
Bengaluru, Karnataka, India