Skip to main nav
  • Guests
  • Log in
  • Sign up
  • Schedule an Appointment
  • Handshake
  • The Herd
Career Center | Student Life | Tufts University Tufts Logo
LinkedInInstagramYouTube
Schedule An Appointment
Handshake
The Herd
Skip to content
  • People We Serve
    • Audiences
      • Undergraduate Students in AS&E and SMFA
      • Graduate Students in AS&E and SMFA
      • Alumni
      • Employers
      • Faculty & Staff
      • Parents and Families
    • Affinities / Identities
      • First Generation
      • International Students
      • LGBTQ+
      • Black, Indigenous & People of Color
      • Students with Disabilities
      • Students with Undocumented Status
      • Women & Gender
  • Career Communities
    • What is a Career Community?
    • Arts, Communications & Media
    • Education, Nonprofit & Social Impact
    • Engineering, Technology & Physical Sciences
    • Finance, Consulting, Entrepreneurship & Business
    • Government, International Affairs & Law
    • Healthcare, Life Sciences & the Environment
    • Reflect, Discover & Explore Multiple Interests
  • Learn More About
    • Exploring Your Interests, Careers & Majors
    • Writing Resumes & Cover Letters
    • Networking
    • Finding an Internship
    • Finding Jobs & Fellowships
    • Preparing for Interviews
    • Applying to Graduate & Professional School
  • For Employers
  • About Us
    • Contact & Location
    • Our Team
    • Career Fellows
    • Professionals in Residence
    • Career Services by School

IPwe


Jobs

Information Science / Data Science – Chemical Formulas and Compounds Internship

  • Share This: Share on TwitterShare on LinkedinShare on Facebook
  • Copy Link
Posted on: October 12, 2020 Apply Now
Government, Law & International AffairsExpires February 1, 2021

Project Description:

a. Project Name: Information Science / Data Science – Chemical Formulas and Compounds Internship

b. Scope: Find information on how to extract genomic sequences and bio-chemical

information from source documents. The intern will work on the backend side to build a parser

for chemical formulas and compounds to feed into the system.

c. Project Description: Our AI is working on mass-data ingestion and we would

like to explore new routes to identify genomic sequences in our data sources to

extract them without blurring/altering the data source.

i. Some initial orientation about machine-learning and genomics can be

found here: https://codete.com/blog/machine-learning-genomics/

ii. We typically extract data for our AI from scientific publications and

patent documents. These will be the target data sources.

iii. Just running an OCR over the documents (if they are PDFs) will destroy

the sequences or change their meaning/content. We need to find a way to

persist the extracted data in a database/library/collection which our

algorithms can then query.

d. Form of Delivery: Periodic updates by email and a final report on your findings

in Word format

Business Purpose: We need this information to improve the precision of our AI.

Duration: part-time, minimum 7 hours per week

Work Hours: Flexible (intern can work at any time, including nighttime or on weekend)

Location: Remote (Intern can live anywhere in the world.)

Primary Work Premise: Home

Compensation: $15 (US Dollar) per hour

Consideration for full-time employment: Yes

Training Provided: Yes.

Travel Required: No

Submission Requirements:

Please upload resume (in English)

Interview format: Video interview via Zoom or Microsoft Teams will be arranged upon

selection notification

Market Skills and Requirements

1. Requirement – Chemistry, chemical engineering or biochemistry (major or minor)

AND

2. Requirement – Library and Information Sciences, Information Sciences, Information Studies, Information Systems, bioinformatics, computer science or equivalent (major or minor)

Apply Now
Tufts University
Career Center
LinkedInInstagramYouTube
Dowling Hall Suite 740 (9am-5pm)
419 Boston Avenue
Medford, MA 02155
(617) 627-3299
careercenter@tufts.edu
Privacy Policy | Terms of Service
Copyright © 2021 Tufts University
Powered by uConnect