Quality and Analytics SpecialistAbout UsWe do things differently. We build a solution for enterprises to make sense of all of their information. We know how important it is for companies to understand their customers, so we provide our technology to solve their biggest challenges. We believe in open and transparent communication, not strict rules and hierarchies. We are a team of hardworking, talented people who aim to build software that makes sense of data. We’ve got some huge challenges ahead of us, and we need smart, driven wordsmiths to help us tackle them. If you think you’ve got what it takes—join us.Role SummaryWe are seeking a QA to ensure the quality, accuracy, and reliability of data workflows executed through Notebook/JupyterLab and data platforms. This role focuses on validating processing logic, workflows, and analytical outputs built using Python, Spark, and modern data libraries.Key Responsibilities1. Data & Notebook Quality AssuranceValidate JupyterLab notebooks for logic accuracy, data transformation, and analytical correctness using libraries like Pandas, NumPy, PySpark, Apache Sedona, and GeoMesa.Ensure seamless integration of notebooks within the Syntasa platform, confirming all required libraries, kernels, and dependencies function reliably for users.Verify error-free execution, correct geospatial operations, and a stable, consistent user experience across user roles and environments..2. Data Validation & ReconciliationPerform backend data validation using complex SQL queries and Python-based comparison scripts.Validate aggregation logic, statistical calculations, and transformations implemented through Pandas and Spark frameworks.Identify anomalies, duplicates, and data inconsistencies across datasets.3. Pipeline & Platform TestingTest data pipelines developed using Spark/PySpark and associated libraries.Validate ingestion workflows from multiple sources including APIs, files, and cloud storage.Test performance and scalability of pipelines handling high-volume datasets.4. Test Planning & ReportingDesign detailed test cases covering notebook execution, data integrity, and analytical accuracy.Maintain test artifacts and data validation reports.Log and track defects using Jira or similar tools.5. Collaboration & Continuous ImprovementWork closely with data engineers and data scientists to validate logic developed using Notebook and Python-based analytics frameworks.Suggest improvements to data quality checks and automation strategies.Required SkillsMust HaveStrong Python scripting for data validation and automationExperience testing JupyterLab or notebook-based workflowsHands-on validation of data logic using Pandas, NumPy, PySpark, Apache Sedona (GeoSpark), and/or GeoMesaStrong SQL expertise for reconciliation and analysisExperience testing Spark-based data pipelinesGood to HaveFamiliarity with cloud platforms (GCP / AWS)Experience with Airflow or similar schedulersUnderstanding of geospatial data concepts and large-scale dataset processing
Job Title
Quality and Analytics Specialist