Arbetsbeskrivning
Life Science startup company looking for a computer scientist with skills in python!
Do you want to work in an innovative Life Science startup in an exciting research field? Are you interested in learning and developing how machine learning can be put into practice in medical advancements? If yes, then you are welcome to apply to us! You will belong to a competent and dedicated multi-disciplinary team that are in the transition from university research to the private market.
Job description:
You will be responsible for developing and implementing a Python-based pipeline to preprocess chemical data. The role includes the application of imputation and normalization techniques to prepare the data for further analysis. Additionally, you will employ various machine learning algorithms, including but not limited to Logistic Regression, Random Forest, XGBoost, and SVC, to build models that are capable of clinical prediction.
Responsibilities:
Develop and maintain a Python pipeline for preprocessing chemical data.
Manage and optimize PostgreSQL databases, ensuring data integrity, performance, and efficient query handling for data processing needs.
Implement data imputation and normalization techniques to enhance dataset quality and readiness for machine learning applications.
Design and train machine learning models using Python libraries (e.g., scikit-learn, XGBoost) to support clinical predictions.
Evaluate model performance using appropriate statistical metrics and validation techniques.
Build and maintain backend systems with APIs to connect machine learning models to applications easily.
Manage server setups to keep data processing and model deployment reliable, secure, and able to handle growth.
Set up, monitor, and maintain Docker containers for consistent and efficient application deployment.
Collaborate closely with the team to integrate the machine learning models into a broader clinical prediction framework.
Qualifications:
Holds a degree in Computer Science, Bioinformatics, Statistics, or a related field.
Strong programming skills in Python, including experience with libraries like Pandas, NumPy, SciPy, and Scikit-learn.
Proficiency in working with relational databases, especially PostgreSQL, including writing and optimizing queries
Familiarity with machine learning concepts and models, particularly those mentioned in the job description.
Previous experience with data preprocessing and analysis is not mandatory but can be a plus.
Understanding of server environments, including experience with basic server setup, monitoring, and security.
Hands-on experience with Docker for containerizing applications.
Basic experience in building backend systems with RESTful APIs.
Excellent problem-solving skills and ability to work independently as well as part of a team.
Effective communication skills, both written and verbal (English).
What we offer:
Opportunity to work in a Life Science startup company and get real experience in the transition from university research to private sector.
Opportunity to work in a dynamic, interdisciplinary research environment.
Hands-on experience with real-world data and the chance to contribute to impactful medical research.
Mentorship from leading researchers in the fields of evolutionary biology, cancer research, Chemistry and Chemical Engineering, molecular medicine, and biomarker discovery.
About the position
Full time position
For questions about the position
Kazi Uddin, E-mail: kazi@capillonanalytics.com
Henrik Lindblom e-mail: henrik@capillonanalytics.com