Skip to main content
TemplateFREE⏱️ 35 min

Feature Engineering Specification Template

A feature engineering specification template for ML models covering feature definitions, data sources, transformation logic, validation criteria, and...

Updated 2026-03-04
Feature Specification
#1
#2
#3
#4
#5

Edit the values above to try it with your own data. Your changes are saved locally.

Get this template

Choose your preferred format. Google Sheets and Notion are free, no account needed.

Frequently Asked Questions

What is the difference between feature engineering and feature selection?+
Feature engineering creates new features from raw data. Feature selection chooses which features to include in a model from the available set. This template covers engineering. After building features, use techniques like correlation analysis, mutual information, or model-based importance scores to select which features to keep.
How many features should a model have?+
There is no universal answer. Start with 10-20 well-reasoned features and add more only if evaluation metrics improve. More features increase training time, serving latency, and maintenance burden. Each feature you add is a dependency you must maintain. The [AI PM Handbook](/ai-guide) covers feature selection trade-offs in its model development chapters.
How do I prevent data leakage in feature engineering?+
Use point-in-time joins. When computing features for a training example dated January 15, only use data available before January 15. Never include the target variable or its proxies as features. Document the temporal boundary for each feature in the Aggregation Window column. The [AI Eval Scorecard](/tools/ai-eval-scorecard) includes leakage checks in its evaluation criteria.
Should I normalize or standardize features?+
It depends on the model. Tree-based models (XGBoost, Random Forest) do not require normalization. Neural networks and linear models benefit from standardization (zero mean, unit variance). Document the expected distribution in the Validation Criteria section so the normalization strategy is explicit.
How often should feature definitions be reviewed?+
Review quarterly or whenever model performance degrades. Features based on business logic (like plan_tier categories) must be updated when the business changes. Features based on behavioral data should be monitored for [model drift](/glossary/model-drift). If a feature's distribution shifts significantly, investigate whether the underlying user behavior changed or the data pipeline broke. ---

Explore More Templates

Browse our full library of PM templates, or generate a custom version with AI.