Anaplasma phagocytophilum is a tick-transmitted bacterial pathogen of humans and other animals, and is an obligate intracellular parasite. Throughout the course of infection, hosts acquire temporary resistance to granulocytic anaplasmosis as they develop immunity specific for the major antigen, major surface protein 2 (Msp2). However, the bacterium then utilizes a novel recombination mechanism shuffling functional pseudogenes sequentially into an expression cassette with conserved 5' and 3' ends, bypassing host immunity. Approximately 100 pseudogenes are present in the only fully sequenced human-origin HZ genome, representing the possibility for almost unlimited antigenic diversity. In the present study, we identified a select group of 20% of the A. phagocytophilum HZ msp2 pseudogenes that have matched preferentially to human, canine, and equine expression cassettes. Pseudogenes cluster predominantly in one spatial run limited to a single genomic island in less than 50% of the genome but phylogenetically related pseudogenes are neither necessarily located in close proximity on the genome nor share similar percent identity with expression cassettes. Pseudogenes near the expression cassette (and the origin) are more likely to be expressed than those farther away. Taken together, these findings suggest that there may be natural selection pressure to retain pseudogenes in one cluster near the putative origin of replication, even though global recombination shuffles pseudogenes around the genome, separating pseudogenes that share genetic origins as well as those with similar identities.