
Data Architect
Posted Jun 20

Posted Jun 20
This is a fully remote position, open to applicants in United States.
• Design and create scalable enterprise data architectures utilizing AWS and technologies from the Apache ecosystem (e.g., Spark, Iceberg).
• Construct contemporary AI-enabled data platforms that support machine learning, LLM integration, and retrieval-augmented generation (RAG) patterns.
• Develop and sustain conceptual, logical, and physical data models, including Entity Relationship Diagrams (ERDs).
• Engineer modern data lakehouse and data warehouse solutions utilizing Apache Iceberg and cloud-native services.
• Establish and uphold standards for data integration, data quality, and data lifecycle management.
• Create and implement Knowledge Graph architectures that incorporate both structured and unstructured data sources.
• Design and build Knowledge Graphs and semantic data layers employing ontologies, taxonomies, and linked data principles.
• Utilize GraphRAG architectures to augment LLM-based applications with context-aware, explainable data retrieval.
• Develop and oversee ontologies and semantic models to facilitate interoperability, data discovery, and advanced analytics.
• Incorporate AI/ML and generative AI functionalities into enterprise data ecosystems, including vector databases and embedding pipelines.
• Ensure that data architecture aligns with AI governance, focusing on model transparency, traceability, and responsible AI practices.
• Create and manage enterprise metadata frameworks, including data dictionaries, business glossaries, and technical metadata repositories.
• Lead or assist in stakeholder listening initiatives to collect feedback from executives, data leaders, and practitioners across the organization.
• Collaborate with stakeholders to identify data challenges, AI use cases, and opportunities for advanced analytics and automation.
• Assist in the analysis of alternatives (AoA) for data and AI tools/platforms, offering recommendations based on cost, capability, and mission alignment.
• Monitor and report on the progress of the data strategy, improvements in maturity, and program outcomes.
• Bachelor’s degree in Computer Science, Information Systems, Data Science, or a related field, or equivalent experience.
• Over 8 years of experience in data architecture, data engineering, or enterprise data management.
• Proven track record of integrating AI/ML or generative AI capabilities into data platforms.
• Practical experience with AWS data services (e.g., S3, Glue, Redshift, Lake Formation), Apache technologies (e.g., Spark, Iceberg, Hive), and relational databases.
• Strong proficiency in data modeling and the development of ERDs.
• Experience in designing or implementing Knowledge Graphs, ontologies, or semantic data models.
• Familiarity with graph-based retrieval methodologies (e.g., GraphRAG or similar patterns).
• Experience in implementing metadata management, data cataloging, and data governance solutions.
• Demonstrated experience supporting federal data strategy initiatives or OCDO organizations.
• In-depth knowledge of data quality, lineage, observability, and AI data readiness frameworks.
• Proficient with AI-assisted tools and workflows (e.g., LLM copilots, automated code generation, data augmentation tools).
• Ability to convey complex technical concepts to non-technical stakeholders.
• U.S. Citizenship is required; must be able to obtain and maintain a federal clearance.
• Flexible work arrangements
• Professional development opportunities
Anord Mardix
Stefanini Brasil
InVision Communications
Get handpicked remote jobs straight to your inbox weekly.