Aartificial intelligence, Data pipelines, Data science techniques, ETL frameworks, FastAPI, Flask, Generative AI, Jupyter Notebook, LangChain, Large Language Models - LLMs, Machine learning techniques, Natural Language Processing (NLP), Python Programming, Scikit-learn, TensorFlow
As a Data Scientist at IBM, you will help transform our clients’ data into tangible business value by analyzing information, communicating outcomes and collaborating on product development. Work with Best in Class open source and visual tools, along with the most flexible and scalable deployment options. Whether it’s investigating patient trends or weather patterns, you will work to solve real world problems for the industries transforming how we live.
Your Role and Responsibilities
Develop ETL pipelines for data preprocessing and clensing, integrate data pipelines with generative AI frameworks, work closely with data scientist to understand requirements and translate to design and implentation, understand , evaluate and use various large language models for generative AI use cases, use various data generation techniques to generate training data for large language models and machine learning models, manual and automated evaluation of generative AI models, apply prompt engineer techniques as required by the use case.
Required Technical and Professional Expertise
Skills:
Preferred Technical and Professional Expertise
Skills:
IBM is a leading cloud platform and cognitive solutions company. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 290,000 employees serving clients in 177 countries. IBM Research provides unparalleled insight into business, industry and society by leveraging advanced computing architectures and methodologies to solve some of the world’s most pressing challenges.
Cluj-Napoca, Romania
2-4 year