Aartificial intelligence, Data Lake, Data management, Employee life cycle, Foundation Models, Lakehouse, Large Language Models - LLMs, Optimization
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today’s most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.
Your Role and Responsibilities
This is for a 2024 summer internship with the following start dates: May – August or June – September for quarter system schools.
We are broadly interested in making foundation models (FMs) effective for a range of data management tasks, particularly those related to the management of structured data in enterprise data lakes and lakehouses.
Topics of interest include research on effective and efficient tuning techniques, knowledge-driven reasoning, and causality-driven alignment for better control and run-time performance of FMs and their use in enterprise data tasks. Tasks of interest include semantic enrichment of structured data, semantic data management with metadata and knowledge graphs, code generation for data retrieval with transformations, and various data wangling tasks in the end-to-end data lifecycle in data lakes.
Tuning-related research spans full-space and parameter-efficient tuning techniques with supervised as well as reinforcement learning with reward functions that capture end-use performance. Grounding the generation of tuned models in domain-specific vocabulary, efficient techniques for human-in-the-loop adaptation at inference time, and retrieval augmentation techniques for data management tasks will be of interest.
For knowledge-driven reasoning, formulations and benchmarks that treat the database query-answering process as a knowledge-extraction task will be useful for experimenting with reasoning over database tables at different levels of complexity to improve and expand the reasoning skills of FMs.
For causal alignment, we’re interested in formulations that study and show the causal relationships behind the effectiveness of different prompt optimization methods, where a small set of prompt augmentation tokens improves FMs for issues like delusions, alignment, and transfer.
Required Technical and Professional Expertise
Preferred Technical and Professional Expertise
Candidates should have basic knowledge in one or more of the following skills:
IBM is a leading cloud platform and cognitive solutions company. Restlessly reinventing since 1911, we are the largest technology and consulting employer in the world, with more than 290,000 employees serving clients in 177 countries. IBM Research provides unparalleled insight into business, industry and society by leveraging advanced computing architectures and methodologies to solve some of the world’s most pressing challenges.
Research Triangle Park, Durham, NC, USA
2-4 year
Research Triangle Park, Durham, NC, USA
2-4 year
Cambridge, MA, USA
2-4 year
San Jose, CA, USA
2-4 year
Mountain View, CA, USA
2-4 year
Rio de Janeiro, Brazil
0-2 year
Hortolândia, State of São Paulo, Brazil
0-2 year
Rio de Janeiro, Brazil
0-2 year
Hortolândia, State of São Paulo, Brazil
0-2 year
Yorktown Heights, NY, USA
0-2 year
Yorktown Heights, NY, USA
0-2 year
Toronto, ON, Canada
0-2 year
Calgary, AB, Canada
0-2 year
Montreal, QC, Canada
0-2 year
Givatayim, Israel
0-2 year
Cluj-Napoca, Romania
2-4 year
Research Triangle Park, Durham, NC, USA
4-6 year
Givatayim, Israel
0-2 year
Givatayim, Israel
0-2 year
Givatayim, Israel
0-2 year
Givatayim, Israel
0-2 year
São Paulo, State of São Paulo, Brazil
2-4 year
Rio de Janeiro, State of Rio de Janeiro, Brazil
2-4 year
São Paulo, State of São Paulo, Brazil
2-4 year
Rio de Janeiro, State of Rio de Janeiro, Brazil
2-4 year
Rio de Janeiro, State of Rio de Janeiro, Brazil
0-2 year
Frankfurt, Germany
4-6 year
Ehningen, Germany
4-6 year
Austin, TX, USA; Chicago, IL, USA; Dallas, TX, USA; Houston, TX, USA; Los Angeles, CA, USA; New York, NY, USA; Philadelphia, PA, USA; Phoenix, AZ, USA; San Antonio, TX, USA; San Diego, CA, USA; San Jose, CA, USA
2-4 year
Austin, TX, USA; Chicago, IL, USA; Dallas, TX, USA; Houston, TX, USA; Los Angeles, CA, USA; New York, NY, USA; Philadelphia, PA, USA; Phoenix, AZ, USA; San Antonio, TX, USA; San Diego, CA, USA; San Jose, CA, USA
2-4 year
Austin, TX, USA; Chicago, IL, USA; Dallas, TX, USA; Houston, TX, USA; Los Angeles, CA, USA; New York, NY, USA; Philadelphia, PA, USA; Phoenix, AZ, USA; San Antonio, TX, USA; San Diego, CA, USA; San Jose, CA, USA
2-4 year
Chicago, IL, USA; Columbus, OH, USA; Detroit, MI, USA; Indianapolis, IN, USA; Kansas City, MO, USA; Milwaukee, WI, USA; Minneapolis, MN, USA; Omaha, NE, USA
2-4 year