Data Scientist II

Posted 3 Days Ago
Maryland
1-200 Annually
Senior level
Artificial Intelligence
The Role
The Data Scientist will develop machine learning algorithms, analyze datasets, create visualizations, and build models to extract insights. Responsibilities include collaborating with subject matter experts, programming quantitative analyses, and integrating analytics into production frameworks while ensuring performance evaluation and validation of models.
Summary Generated by Built In

Figure Eight Federal (F8F): Leading the Future of AI Training Data


Figure Eight Federal (F8F) is the FOCI mitigated arm of Appen (APXYY) focused on the U.S. defense and intelligence sector. As a global leader in the development and optimization of AI training data, F8F sets the standard for innovation in quality, efficiency, and scalability across AI initiatives.


Our technology is utilized by over 6.8 million data specialists skilled in over 235 languages, delivering unparalleled expertise in capturing domain specific knowledge into formats models can train against. The F8F platform is currently used by over 80% of leading LLM developers.


As the world’s largest provider of geospatial human domain monitoring data, F8F processes and analyzes petabits worth of geospatial data to provide enhanced situational awareness around the globe. Through our Hydra AI platform, we handle more than 500 billion events per month across 200 countries and territories, serving over 1 billion active users.

At its core, F8F is dedicated to empowering defense and intelligence agencies with cutting-edge technology to drive the Nation’s strategic AI advantage and drive mission impacts. With a proven track record of delivering secure, reliable, and scalable data solutions, F8F remains at the forefront of driving innovation in AI training data.

 

Learn more about how F8F is shaping the future of AI at F8Federal.com.


The data scientist will develop machine learning, data mining, statistical and graph-based algorithms to analyze and make sense of datasets; prototype or consider several algorithms and decide upon final model based on suitable performance metrics; build models or develop experiments to generate data when training or example datasets are unavailable; generate reports and visualizations that summarize datasets and provide data-driven insights to customers; partner with subject matter experts to translate manual data analysis into automated analytics; implement prototype algorithms within production frameworks for integration into analyst workflows.

Responsibilities:

  • Produce data visualizations that provide insight into dataset structure and meaning
  • Collaborate with subject matters experts (SMEs) to identify important information in raw data and develop scripts that extract this information from a variety of data formats (e.g., SQL tables, structured metadata, network logs)
  • Incorporate SME input into feature vectors suitable for analytic development and testing
  • Translate customer qualitative analysis process and goals into quantitative formulations that are coded into software prototypes
  • Develop and implement statistical, machine learning, and heuristic techniques to create descriptive, predictive, and prescriptive analytics
  • Develop statistical tests to make data-driven recommendations and decisions
  • Develop experiments to collect data or models to simulate data when required data are unavailable
  • Develop feature vectors for input into machine learning algorithms
  • Identify the most appropriate algorithm for a given dataset and tune input and model parameters
  • Evaluate and validate the performance of analytics using standard techniques and metrics (e.g. cross validation, ROC curves, confusion matrices)
  • Evaluate individual analytic efforts and make recommendations in the analytic development process
  • Recommend solutions that can scale to large datasets
  • Collaborate with software engineers, cloud developers, and appropriate stakeholders to develop production analytics
  • Develop and train machine learning systems based on statistical analysis of data characteristics to support mission automation

Required Skills:

  • Requires a bachelor's degree in a relevant discipline (e.g., statistics, mathematics, operations research, engineering, or computer science) from an accredited college or university
  • Eight (8) years of relevant experience analyzing datasets and developing analytics
  • Five (5) years of relevant experience programming with data analysis software such as R, Python, SAS, or MATLAB
  • Master's degree in relevant discipline may be substituted for two (2) years of relevant experience reducing the requirement to six (6) years of relevant experience analyzing datasets and developing analytics, and three (3) years of relevant experience programming with data analysis software such as R, Python, SAS, or MATLAB. A PhD in relevant discipline may be substituted for four (4) years relevant experience reducing the requirement to four (4) years of relevant experience analyzing datasets and developing analytics, and one (1) year of experience programming with data analysis software such as R, Python, SAS, or MATLAB. In lieu of a bachelor’s degree, an additional four (4) years of relevant experience may be substituted for a total of twelve (12) years of relevant experience analyzing datasets and developing analytics, and nine (9) years of experience programming with data analysis software such as R, Python, SAS, or MATLAB.
  • ACTIVE, CURRENT SECURITY CLEARANCE

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Top Skills

Matlab
Python
R
SAS
The Company
Arlington, VA
13 Employees
On-site Workplace
Year Founded: 2007

What We Do

Figure Eight Federal is critical in the creation of the highest quality Decision-Grade AI for leaders engaged in advancing America's security and competitive position. With a deep expertise and ability to train all data types, Figure Eight Federal works with sectors including defense, health, finance, agriculture, and science to fuel and accelerate Federal AI initiatives, and explicate the most value from data.

Similar Jobs

Themis Insight Logo Themis Insight

Data Scientist

Business Intelligence • Consulting
Fort Meade, MD, USA
10 Employees

Ryan Logo Ryan

Data Scientist

Fintech • Consulting
Hunt Valley, MD, USA
3194 Employees

InSequence, Inc Logo InSequence, Inc

Data Scientist

Information Technology
Bethesda, MD, USA
37 Employees
Remote
Bethesda, MD, USA
2 Employees

Similar Companies Hiring

Smartcat Thumbnail
Natural Language Processing • Machine Learning • Conversational AI • Artificial Intelligence
Boston, Massachusetts
242 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
Charlotte, North Carolina
53 Employees
True Anomaly Thumbnail
Software • Machine Learning • Hardware • Defense • Artificial Intelligence • Aerospace
Colorado Springs, CO
131 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account