Location: Jacksonville Florida United States
POSITION SUMMARY
Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or a related field
- 5-8 years of data engineer experience in ETL Development, with at least 2 years specifically focused on the DataStage-Netezza integration
- Deep understanding of Netezza’s MPP architecture, S-Blades, and FPGA processing
- Proven ability to debug complex data issues and optimize long-running ETL workflows
- Advanced SQL (Netezza-specific SQL), Unix/Linux Shell Scripting
- Knowledge of Star/Snowflake schemas, Dim & Fact tables, and SCD (Type 1/2)
- Control-M or AutoSys (Scheduling), Git/SVN (Version Control), Aginity (Netezza Workbench)
- Nice to have - some experience with Snowflake or Databricks will be preferred.
Responsibilities
- Design and develop complex DataStage jobs (Parallel and Sequence) to extract data from various sources (Flat Files, DB2, SQL Server, APIs) and load them into Netezza
- Optimize DataStage jobs using partitioning, sorting, and buffer tuning
- Perform SQL tuning specifically for Netezza (e.g., managing Distribution Keys, Clustered Base Tables, and minimizing data skew)
- Utilize the Netezza Connector and Netezza Performance Server features within DataStage to implement high-speed data loading (nzload) and external table frameworks
- Write advanced Netezza SQL, including stored procedures, complex joins, and analytical functions to handle data transformation directly within the warehouse (ELT approach)
- Develop Unix/Linux shell scripts to automate job scheduling, file management, and error handling
- Implement data validation and cleansing rules to ensure the integrity and accuracy of the data warehouse
- Provide production support, troubleshoot job failures, and perform root cause analysis for performance bottlenecks
Details
