Databricks
Recommendations for Job Compute Clusters and Data Access
Pages
2
Time to read
1 min
Publication
Language
English
Pages
2
Time to read
1 min
Publication
Language
English
This guide presents a series of recommendations aimed at optimizing orchestration, retrieval, structured data access, inference, agent evaluation, caching, cost tracking, and governance in data processing environments. It outlines the use of job compute clusters instead of all-purpose clusters and suggests enabling auto-termination to maintain efficiency. For retrieval and retrieval-augmented generation (RAG), it recommends utilizing models with a higher context size and filtering data prior to vector searches. The guide emphasizes structured data access by querying only necessary columns and applying tight WHERE clauses. It also discusses inference and model serving, advising on autoscaling and monitoring idle endpoints. Additionally, it covers agent evaluation strategies, caching techniques for static queries, and the importance of tagging jobs for cost tracking. Finally, it suggests profiling cost and performance before scaling operations to ensure efficiency and effectiveness in data handling.