In the past decade, several studies have estimated the human per-generation germline mutation rate using large pedigrees. More recently, estimates for various non-human species have been published. However, methodological differences among studies in detecting germline mutations and estimating mutation rates make direct comparisons difficult. Here, we describe the many different steps involved in estimating pedigree-based mutation rates, including sampling, sequencing, mapping, variant calling, filtering, and how to appropriately account for false-positive and false-negative rates. For each step, we review the different methods and parameter choices that have been used in the recent literature. Additionally, we present the results from a “Mutationathon”, a competition organized among five research labs to compare germline mutation rate estimates for a single pedigree of rhesus macaques. We report almost a two-fold variation in the final estimated rate among groups using different post-alignment processing, calling, and filtering criteria and provide details into the sources of variation across studies. Though the difference among estimates is not statistically significant, this discrepancy emphasizes the need for standardized methods in mutation rate estimations and the difficulty in comparing rates from different studies. Finally, this work aims to provide guidelines for computational and statistical benchmarks for future studies interested in identifying germline mutations from pedigrees.