Data Scientist (Contract – Hybrid – Addison, TX or Charlotte, NC)
Our client is on the hunt for a talented Data Scientist (Contract – Hybrid – Addison, TX or Charlotte, NC). This 18-month contract position offers the flexibility of a hybrid work model, allowing you to be based in either Addison, Texas, or Charlotte, North Carolina. As a Data Scientist, you’ll be responsible for designing, building, and maintaining cutting-edge document capture applications. The ideal candidate will have a solid background in software engineering, hands-on experience building Machine Learning NLP Models, and a good familiarity with Generative AI Models.
Data Scientist (Contract – Hybrid – Addison, TX or Charlotte, NC)
Location: Hybrid – Addison, TX, or Charlotte, NC
Term: 18 Month Contract
Employment Type: Contract
Pay Range: Not Specified (Competitive)
Industry: Computer and Mathematical
Position Summary: Crafting Intelligent Document Capture Solutions
We’re looking for a Data Scientist who will be instrumental in the creation and maintenance of advanced document capture applications. This role is a unique blend of software engineering and machine learning, focusing on transforming how data is extracted and understood from various documents. The ideal candidate isn’t just about models; they possess a solid background in building robust software, coupled with practical experience in developing Machine Learning (ML) and Natural Language Processing (NLP) models. A keen familiarity with the rapidly evolving landscape of Generative AI (Gen AI) Models will also be highly valued.
What’s the Job? Driving AI/ML Innovation in Document Processing
As a Data Scientist, you’ll be at the forefront of designing and building intelligent applications that can understand and process document data. Your responsibilities will span the entire machine learning lifecycle, from data preparation and model development to deployment and optimization, all with a focus on real-world application.
- Design, Build, and Maintain Document Capture Applications: You’ll be responsible for the full lifecycle of designing, building, and maintaining document capture applications. This involves creating software solutions that can ingest, process, and extract information from various document types (e.g., invoices, forms, contracts). Your work will enhance efficiency and accuracy in data acquisition processes, fundamentally transforming how organizations handle information.
- Set Up Supervised and Unsupervised Learning ML/NLP Models: You’ll have hands-on experience in setting up both supervised and unsupervised learning ML/NLP models. This includes the critical stages of data cleaning (preprocessing raw data for accuracy), data analytics (exploring data to find patterns and insights), feature creation (engineering relevant variables for models), model selection (choosing appropriate algorithms), implementing ensemble methods (combining multiple models for improved performance), and defining/tracking performance metrics visualization. Your comprehensive approach ensures robust and effective models.
- Develop ML/NLP Pipelines for Large Data Sets: You’ll be instrumental in developing ML/NLP development pipelines of large data sets, encompassing both structured and unstructured data. This involves designing automated workflows for data ingestion, preprocessing, model training, validation, and deployment, capable of handling vast volumes of information efficiently. Your expertise in pipeline development is crucial for scalable ML operations.
- Design and Develop Enterprise-Scale ML/NLP Solutions: You’ll apply your expertise in designing and developing enterprise-scale ML/NLP solutions across one or more specialized areas. This includes:
- Named Entity Recognition (NER): Identifying and classifying entities (e.g., names, dates, organizations) in text.
- Document Classification: Categorizing documents based on content.
- Document Summarization: Generating concise summaries of longer texts.
- Topic Modelling: Discovering abstract “topics” that occur in a collection of documents.
- Dialog Systems: Developing conversational AI interfaces.
- Sentiment Analysis: Determining the emotional tone of text.
- OCR Text Processing: Extracting text from images (Optical Character Recognition) and further processing it. Your contributions will bring intelligent automation to complex document workflows.
- Utilize Optical Character Recognition (OCR) Products: You’ll have working knowledge and hands-on experience using OCR products (Optical Character Recognition). This is vital for extracting text from images or scanned documents, providing the raw data input for your NLP models and document capture applications.
Required Skills / Experience: Your Foundation in Data Science and Software Engineering
To excel as a Data Scientist in this role, you’ll need extensive experience in data science, strong programming skills, and a solid understanding of machine learning and generative AI concepts.
- Extensive Data Scientist or Related Experience: You must have 7+ years as a Data Scientist or in related roles. This extensive background demonstrates a seasoned professional capable of tackling complex data challenges and delivering impactful solutions.
- Bachelor’s Degree in Computer Science or Related Field: You must possess a Bachelor’s degree in Computer Science or a related technical field (e.g., Data Science, Engineering, Mathematics). This academic foundation provides the essential theoretical knowledge for advanced data science and software development.
- Deep Understanding and Exposure to Generative AI: You are required to have a deep understanding and some exposure to new Generative AI (Gen AI) Open-source Models. This includes familiarity with architectures, capabilities, and applications of models like large language models (LLMs) and other generative AI technologies. Your ability to integrate and leverage these cutting-edge models will be crucial.
- Software Development and Agile Experience (5+ years): You must have at least 5 years of programming experience in software development. This indicates a strong foundation in building robust software applications. Coupled with this, direct experience working within an Agile process is required, demonstrating your ability to thrive in iterative development cycles.
- Python Programming for ML/NLP (5+ years): You must have at least 5 years of Python (or equivalent) programming experience specifically for working with ML/NLP models. This signifies strong proficiency in Python’s data science ecosystem (e.g., scikit-learn, TensorFlow, PyTorch, NLTK, SpaCy) for model development, training, and deployment.
Desired Skills / Experience: Enhancing Your Impact
While the above are essential, the following skills and attributes would significantly enhance your application:
- Master’s Degree: A Master’s degree in Computer Science/Data Science, or a related technical field is highly desired, indicating advanced academic specialization.
- Collaboration with Diverse Partners: Experience collaborating with a diverse set of partners and stakeholders from various Lines of Business. This highlights your ability to bridge technical and business domains effectively.
- Highly Motivated and Self-Starter: You are highly motivated, proactive, and a self-starter, demonstrating a strong sense of ownership and the ability to create and execute plans without daily oversight.
- Critical Thinker and Problem Solver: You are a critical thinker with the proven ability to analyze problems, identify underlying issues, and provide effective, data-driven solutions.
- Ability to Navigate Enterprise Data Assets: The ability to navigate the enterprise data assets across multiple functions is valuable, indicating familiarity with complex data landscapes and data governance.
- Highly Organized and Prioritization Skills: You are highly organized, effectively prioritizing and balancing multiple efforts in a fast-paced environment.
- Excellent Communication and Presentation Skills: You possess excellent communication and Presentation skills, both verbal and written, for articulating complex findings clearly to diverse audiences.
- Strong Analytical Abilities: You have strong analytical abilities and are a great problem solver.
- Client Focused: You are client focused, consistently striving to deliver solutions that meet and exceed customer expectations.
What’s in it for me? Growth, Innovation, and a Dynamic Team
This 18-month contract Data Scientist role offers a compelling environment for professional growth and significant impact within a leading technology firm.
- Opportunity to Work on Innovative Projects: You’ll have the invaluable opportunity to work on innovative projects that are pushing the boundaries of AI in document capture and data processing.
- Collaborative and Dynamic Team: You will be part of a dynamic team that fosters collaboration and mutual support, allowing you to learn from and contribute to a group of talented professionals.
- Professional Growth and Development Opportunities: The organization is deeply committed to your professional growth and development, providing opportunities to enhance your skills in Machine Learning, NLP, and Generative AI.
- Engagement with Cutting-Edge Technology: You’ll gain hands-on experience and continuous exposure to cutting-edge technology in the AI/ML domain, ensuring your expertise remains current and competitive.
- Meaningful Impact on Data Processing: Your work will have a meaningful impact on how organizations capture and process documents, contributing to significant advancements in data efficiency and insights.
- Flexible Hybrid Work Model: The role offers a hybrid work model, providing flexibility to balance onsite collaboration in Addison or Charlotte with remote work, supporting work-life balance.
Upon completion of waiting period, consultants are typically eligible for a comprehensive suite of benefits designed to support their well-being and financial security. These include:
- Medical and Prescription Drug Plans: Comprehensive healthcare coverage for medical services and necessary prescription medications.
- Dental Plan: Benefits covering routine dental care and essential treatments.
- Vision Plan: Coverage for eye examinations, prescription glasses, and contact lenses.
- Health Savings Account (HSA): A tax-advantaged savings account to help pay for qualified medical expenses.
- Health Flexible Spending Account (HFSA): Allows pre-tax contributions for eligible healthcare costs.
- Dependent Care Flexible Spending Account (DCFSA): Provides tax advantages for dependent care expenses.
- Supplemental Life Insurance: Options for additional life insurance coverage for enhanced financial protection.
- Short Term and Long Term Disability Insurance: Income replacement benefits during periods of temporary or prolonged incapacitation due to illness or injury.
- Business Travel Insurance: Coverage for unforeseen events or emergencies that may occur during authorized business travel.
- 401(k), Plus Match: An opportunity to save for retirement with the added benefit of employer matching contributions, enhancing your long-term financial growth.
- Weekly Pay: Consistent and regular compensation provided on a weekly basis, ensuring stable financial flow throughout your contract engagement.
If this Data Scientist role, based in Addison, TX, or Charlotte, NC, aligns with your expertise in building ML/NLP models, your familiarity with Generative AI, and your passion for document capture applications, we encourage you to learn more about this exciting hybrid contract opportunity. This is a fantastic chance to contribute to cutting-edge AI initiatives within a leading technology firm.
Ready to transform data processing with advanced AI?
Job Features
Job Category | AI, Artificial Intelligence, Data |