ENGINEERED CENTRAL NERVOUS SYSTEM COMPOSITIONS (2025)

This application claims the benefit of and priority to U.S. Provisional Patent Application No. 63/242,014, filed on Sep. 8, 2021, and U.S. Provisional Patent Application No. 63/322,191, filed on Mar. 21, 2022, the contents of which are incorporated by reference herein in their entireties.

This application contains a sequence listing filed in electronic form as an xml file entitled BROD-5465WP_ST26.xml, created on Sep. 8, 2022, and having a size of 10,872,431 bytes. The content of the sequence listing is incorporated herein in its entirety.

The subject matter disclosed herein is generally directed to engineered central nervous system targeting compositions including, but not limited to, recombinant adeno-associated virus (AAV) vectors, and systems, compositions, and uses thereof.

Recombinant AAVs (rAAVs) are the most commonly used delivery vehicles for gene therapy and gene editing. Nonetheless, rAAVs that contain natural capsid variants have limited cell tropism. Indeed, rAAVs used today mainly infect the liver after systemic delivery. Further, the transduction efficiency of conventional rAAVs in other cell-types, tissues, and organs by these conventional rAAVs with natural capsid variants is limited. Therefore, AAV-mediated polynucleotide delivery for diseased that affect cells, tissues, and organs other than the liver, such as the central nervous system) typically requires an injection of a large dose of virus (typically about 2×1014 vg/kg), which often results in liver toxicity. Furthermore, because large doses are required when using conventional rAAVs, manufacturing sufficient amounts of a therapeutic rAAV needed to dose adult patients is extremely challenging. Additionally, due to differences in gene expression and physiology, mouse and primate models respond differently to viral capsids. Transduction efficiency of different virus particles varies between different species, and as a result, preclinical studies in mice often do not accurately reflect results in primates, including humans. As such there exists a need for improved rAAVs for use in the treatment of various genetic diseases.

Citation or identification of any document in this application is not an admission that such a document is available as prior art to the present invention.

Described in certain example embodiments herein are compositions comprising a targeting moiety effective to target a central nervous system (CNS) cell, wherein the targeting moiety comprises an n-mer insert optionally comprising or consisting of a P-motif or a double valine motif, or both, wherein the P-motif comprises or consists of the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7, wherein the double valine motif comprises or consists of the amino acid sequence XmX1X2VX3X4VX5Xn, wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7; and optionally a cargo, wherein the cargo is coupled to or is otherwise associated with the targeting moiety.

In certain example embodiments, X2 of the P motif is Q, P, E, or H. In certain example embodiments, X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid. In certain example embodiments, X3 of the P motif is a nonpolar amino acid. In certain example embodiments, X1 of the double valine motif is R, K, V, or W. In certain example embodiments, X2 of the double valine motif is T, S, V, Y or R.

In certain example embodiments, X3 of the double valine motif is G, P, or S. In certain example embodiments, X4 of the double valine motif is S, D, or T. In certain example embodiments, X5 of the double valine motif is Y, G, S, or L.

In certain example embodiments, the targeting moiety comprises two or more n-mer inserts, optionally wherein each n-mer insert comprises or consists of a P-motif, wherein at least one of the P-motifs comprise or consists of the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7, optionally wherein X2 of the P motif is Q, P, E, or H, optionally wherein the X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid, and optionally wherein X3 of the P motif is a nonpolar amino acid.

In certain example embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide as set forth in one or more of SEQ ID NOs: 332-582 (Table 7), SEQ ID NOs: 583-8578 (Table 8), SEQ ID NOs: 3-819, 21-22, 24, 200, 202, 204, 212, 218, 224, 226, 228, 286, 234, 258, 260, 647, 649, 923, 1069, 1077, 1265, 2439, 2529, 2759, 3283, 3553, 3923, 4005, 4173, 4537, 4593, 4599, 4601, 4605, 4619, 4665, 4751, 4759, 4825, 4909, 4933, 5013, 5091, 5107, 5127, 5131, 5165, 5177, 5181, 5187, 5189, 5191, 5277, 5287, 5401, 5433, 5631, 5633, 5731, 5741, 5937, 6019, 6045, 6139, 6169, 6497, 7335, 8033, 8269, 8596-8613, (FIGS. 15A, 15B, 17A, 16A, 16B, 16C, and 19A-19C).

In certain example embodiments, the n-mer insert is 3-25 or 3-15 amino acids in length.

In certain example embodiments, X1 of the P motif is S, T, N, Q, C, Y or A, X2 of the P motif is Q, P, E, or H, X3 is G, A, M, W, L, V, F, or I, or any combination thereof.

In certain example embodiments, the targeting moiety comprises a polypeptide, a polynucleotide, a lipid, a polymer, a sugar, or any combination thereof, wherein the polypeptide, the polynucleotide, the lipid, the polymer, the sugar, or any combination thereof is operably coupled to the n-mer insert(s).

In certain example embodiments, the targeting moiety comprises a viral polypeptide.

In certain example embodiments, the viral polypeptide is a capsid polypeptide.

In certain example embodiments, the n-mer insert(s) is/are incorporated into the viral polypeptide such that at least the n-mer insert is located between two amino acids of the viral polypeptide such that at least the n-mer insert is external to a viral capsid.

In certain example embodiments, the viral polypeptide is an adeno associated virus (AAV) polypeptide.

In certain example embodiments, the AAV polypeptide is an AAV capsid polypeptide.

In certain example embodiments, one or more of the n-mer insert(s) are each incorporated into the AAV polypeptide such that the n-mer insert, optionally the P motif(s) and/or double valine motif(s), is/are inserted between any two contiguous amino acids independently selected from amino acids 262-269, 327-332, 382-386, 452-460, 488-505, 527-539, 545-558, 581-593, 598-599, 704-714, or any combination thereof in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, at least one n-mer insert is incorporated into the AAV polypeptide such that at least the P motif and/or double valine motif is inserted between amino acids 588 and 589 in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the AAV capsid polypeptide is an engineered AAV capsid polypeptide having reduced or eliminated uptake in a non-CNS cell as compared to a corresponding wild-type AAV capsid polypeptide.

In certain example embodiments, the non-CNS cell is a liver cell or a dorsal root ganglion (DRG) neuron.

In certain example embodiments, the wild-type capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the engineered AAV capsid polypeptide comprises one or more mutations that result in reduced or eliminated uptake in a non-CNS cell. In certain example embodiments, the one or more mutations are in position 267, in position 269, in position 272, in position 504, in position 505, in position 585, in position 590, or any combination thereof in the AAV9 capsid polypeptide (SEQ ID NO: 1) or in one or more positions corresponding thereto in a non-AAV9 capsid polypeptide.

In certain example embodiments, the non-AAV9 capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the mutation in position 267 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X mutation to A, wherein X is any amino acid.

In certain example embodiments, the mutation in position 269 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an S or X to T mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 272 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an N or to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 504 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 505 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a P or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 585 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an R or X to Q mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 590 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a Q or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 267, position 269 or both of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 267 is a G to A mutation and wherein the mutation at position 269 is an S to T mutation.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 590 of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 509 is a Q to A mutation.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 504, position 505, or both of a wild-type AAV9 capsid protein (SEQ ID NO: 1), wherein the mutation at position 504 is a G to A mutation and wherein the mutation at position 505 is a P to A mutation.

In certain example embodiments, the composition is an engineered viral particle.

In certain example embodiments, the engineered viral particle is an engineered AAV viral particle. In certain example embodiments, the AAV viral particle is an engineered AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 viral particle.

In certain example embodiments, the optional cargo is capable of treating or preventing a CNS, an eye, or inner ear disease or disorder. In certain example embodiments, the optional cargo is also detargeted in a non-target cell, optionally a CNS cell.

In certain example embodiments, the optional cargo comprises one or more specific RNAi molecule binding sequences specific for an RNAi molecule endogenous to a non-target cell, wherein expression of the RNAi molecule(s) is/are enriched in the non-target cell as compared to a CNS cell and/or specific for synthetic RNAi molecule(s). In certain example embodiments, the RNAi molecule is not expressed in a CNS cell. In certain example embodiments, the non-target cell is a liver cell or a dorsal root ganglion neuron. In certain example embodiments, the RNAi molecule is miR183, miR-182, miR122, miR122a, miR99a, miR-26a, miR199a, miRNA-143, miR101a, miR-30c, or any combination thereof.

Described in certain example embodiments herein are vector systems comprising one or more polynucleotides, wherein at least one of the one or more polynucleotides encodes all or part of a targeting moiety effective to target a central nervous system (CNS) cell, wherein the targeting moiety comprises an n-mer insert optionally comprising or consisting of a P-motif or a double valine motif, or both, wherein the P-motif comprises or consists of the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7, wherein the double valine motif comprises or consists of the amino acid sequence XmX1X2VX3X4VX5Xn, wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7; and optionally, a regulatory element operatively coupled to one or more of the one or more polynucleotides.

In certain example embodiments, X2 of the P motif is Q, P, E, or H. In certain example embodiments, X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid. In certain example embodiments, X3 of the P motif is a nonpolar amino acid.

In certain example embodiments, X1 of the double valine motif is R, K, V, or W. In certain example embodiments, X2 of the double valine motif is T, S, V, Y or R. In certain example embodiments, X3 of the double valine motif is G, P, or S. In certain example embodiments, X4 of the double valine motif is S, D, or T. In certain example embodiments, X5 of the double valine motif is Y, G, S, or L.

In certain example embodiments, the targeting moiety comprises two or more n-mer inserts, optionally wherein each n-mer insert comprises or consists of a P-motif, wherein at least one of the P-motifs comprise or consists of the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7, optionally wherein X2 of the P motif is Q, P, E, or H, optionally wherein the X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid, and optionally wherein X3 of the P motif is a nonpolar amino acid.

In certain example embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide as set forth in one or more of SEQ ID NOs: 332-582 (Table 7), SEQ ID NOs: 583-8578 (Table 8), SEQ ID NOs: 3-819, 21-22, 24, 200, 202, 204, 212, 218, 224, 226, 228, 286, 234, 258, 260, 647, 649, 923, 1069, 1077, 1265, 2439, 2529, 2759, 3283, 3553, 3923, 4005, 4173, 4537, 4593, 4599, 4601, 4605, 4619, 4665, 4751, 4759, 4825, 4909, 4933, 5013, 5091, 5107, 5127, 5131, 5165, 5177, 5181, 5187, 5189, 5191, 5277, 5287, 5401, 5433, 5631, 5633, 5731, 5741, 5937, 6019, 6045, 6139, 6169, 6497, 7335, 8033, 8269, 8596-8613, (FIGS. 15A, 15B, 17A, 16A, 16B, 16C, and 19A-19C).

In certain example embodiments, the n-mer insert(s) are each 3-25 or 3-15 amino acids in length.

In certain example embodiments, X1 of the P motif is S, T, N, Q, C, Y or A, X2 of the P motif is Q, P, E, or H, X3 is G, A, M, W, L, V, F, or I, or any combination thereof.

In certain example embodiments, the vector system further comprises a cargo.

In certain example embodiments, the cargo is a cargo polynucleotide and is optionally operatively coupled to one or more of the one or more polynucleotides encoding the targeting moiety.

In certain example embodiments, the vector system is a viral vector system and is capable of producing virus particles, virus particles that contain the cargo, or both.

In certain example embodiments, the vector system is capable of producing a polypeptide comprising one or more of the targeting moieties.

In certain example embodiments, the polypeptide is a viral polypeptide.

In certain example embodiments, the viral polypeptide is a capsid polypeptide.

In certain example embodiments, the capsid polypeptide is an adeno associated virus (AAV) capsid polypeptide. In certain example embodiments, the virus particles are AAV virus particles. In certain example embodiments, the AAV virus particles or AAV capsid polypeptide are engineered AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 viral particles or polypeptides.

In certain example embodiments, the n-mer insert(s) is/are incorporated into the viral polypeptide such that at least the n-mer insert is located between two amino acids of the viral polypeptide such that at least the n-mer insert is/are external to a viral capsid.

In certain example embodiments, the n-mer insert(s), optionally the P-motif(s) and/or double valine motif(s), are each inserted between any two contiguous amino acids independently selected from amino acids 262-269, 327-332, 382-386, 452-460, 488-505, 527-539, 545-558, 581-593, 598-599, 704-714, or any combination thereof in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the at least one polynucleotide that encodes all or part of a targeting moiety is inserted between the codons corresponding to amino acid 588 and 589 in the AAV9 capsid polynucleotide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the AAV capsid polypeptide is an engineered AAV capsid polypeptide having reduced or eliminated uptake in a non-CNS cell as compared to a corresponding wild-type AAV capsid polypeptide. In certain example embodiments, the non-CNS cell is a liver cell or a dorsal root ganglion (DRG) neuron. In certain example embodiments, the wild-type capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the engineered AAV capsid polypeptide comprises one or more mutations that result in reduced or eliminated uptake in a non-CNS cell. In certain example embodiments, the one or more mutations are in position 267, in position 269,in position 272,in position 504, in position 505, in position 585, in position 590, or any combination thereof in the AAV9 capsid polypeptide (SEQ ID NO: 1) or in one or more positions corresponding thereto in a non-AAV9 capsid polypeptide. In certain example embodiments, the non-AAV9 capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the mutation in position 267 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X mutation to A, wherein X is any amino acid.

In certain example embodiments, the mutation in position 269 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an S or X to T mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 272 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an N or to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 504 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 505 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a P or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 585 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an R or X to Q mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 590 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a Q or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 267, position 269, or both of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 267 is a G to A mutation and wherein the mutation at position 269 is an S to T mutation.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 590 of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 509 is a Q to A mutation.

In certain example embodiments, engineered AAV capsid protein is an engineered AAV9 capsid polypeptide comprising a mutation at position 504, position 505, or both of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 504 is a G to A mutation and wherein the mutation at position 505 is a P to A mutation.

In certain example embodiments, the cargo comprises one or more specific RNAi molecule binding sequences specific for an RNAi molecule endogenous to a non-target cell, wherein expression of the RNAi molecule(s) is/are enriched in the non-target cell as compared to a CNS cell and/or specific for synthetic RNAi molecule(s). In certain example embodiments, the RNAi molecule is not expressed in a CNS cell. In certain example embodiments, the non-target cell is a liver cell or a dorsal root ganglion neuron. In certain example embodiments, the RNAi molecule is miR183, miR-182, miR122, miR122a, miR99a, miR-26a, miR199a, miRNA-143, miR101a, miR-30c, or any combination thereof.

In some embodiments, the viral polypeptide is optionally a capsid polypeptide, wherein the composition is modified to include one or more azides, have a reduced number of one or more oxidation susceptible residues, wherein the oxidation susceptible residues are optionally Met, Tyr, Trp, His, Cys or any combination thereof; is PEGylated, or is otherwise functionalized for PEGylation; comprises one or more oligonucleotides tethered via click chemistry to the composition, optionally viral polypeptide; or any combination thereof.

In certain example embodiments, the viral vector and/or cargo is engineered to include one or more cis-acting elements or modifications, optionally a reduced number of CpG islands; one or more TLR9i oligonucleotides, optionally in one or both of the inverted terminal repeats of the vector system; one or more regulatory elements to modify cargo expression; a reduced number of ITR mimicking harpin or other structures; or any combination thereof.

In certain example embodiments, the vector comprising the one or more polynucleotides does not comprise splice regulatory elements.

In certain example embodiments, the vector system further comprises a polynucleotide that encodes a viral rep protein. In certain example embodiments, the viral rep polypeptide is an AAV rep protein. In certain example embodiments, the polynucleotide that encodes the viral rep polypeptide is on the same vector or a different vector as the one or more polynucleotides encoding the targeting moiety or portion thereof. In certain example embodiments, the polynucleotide that encodes the viral rep protein is operatively coupled to a regulatory element.

In certain example embodiments, the vector system is capable of producing a composition or portion thereof as described in any one of the preceding paragraphs or elsewhere herein.

Described in certain example embodiments herein are polynucleotides that encode a composition or portion thereof as described in any one of the preceding paragraphs or elsewhere herein.

Described in certain example embodiments herein are polypeptides encoded by, produced by, or both by a vector system as described in any one of the preceding paragraphs or elsewhere herein or a polynucleotide as described in any one of the preceding paragraphs or elsewhere herein.

In certain example embodiments, the polypeptide is a viral polypeptide. In certain example embodiments, the viral polypeptide is an AAV polypeptide. In certain example embodiments, the polypeptide is coupled to or otherwise associated with a cargo.

In certain example embodiments, the cargo comprises one or more specific RNAi molecule binding sequences specific for an RNAi molecule endogenous to a non-target cell, wherein expression of the RNAi molecule(s) is/are enriched in the non-target cell as compared to a CNS cell and/or specific for synthetic RNAi molecule(s). In certain example embodiments, the RNAi molecule is not expressed in a CNS cell. In certain example embodiments, the non-target cell is a liver cell or a dorsal root ganglion neuron. In certain example embodiments, the RNAi molecule is miR183, miR-182, miR122, miR122a, miR99a, miR-26a, miR199a, miRNA-143, miR101a, miR-30c, or any combination thereof.

In certain example embodiments, the polypeptide includes one or more azides; has a reduced number of one or more oxidation susceptible residues, wherein the oxidation susceptible residues are optionally Met, Tyr, Trp, His, Cys or any combination thereof; is PEGylated, or is otherwise functionalized for PEGylation; comprises one or more oligonucleotides tethered via click chemistry to the composition, optionally viral polypeptide; or any combination thereof.

Described in certain example embodiments herein are particles produced by a vector system as described in any one of the preceding paragraphs or elsewhere herein, optionally including a polypeptide s described in any one of the preceding paragraphs or elsewhere herein. In certain example embodiments, the particle is a viral particle. In certain example embodiments, the viral particle is an adeno-associated virus (AAV) particle, lentiviral particle, or a retroviral particle. In certain example embodiments, the particle comprises a cargo. In certain example embodiments, the viral particle has a central nervous system (CNS) tropism.

In certain example embodiments, the cargo comprises one or more specific RNAi molecule binding sequences specific for an RNAi molecule endogenous to a non-target cell, wherein expression of the RNAi molecule(s) is/are enriched in the non-target cell as compared to a CNS cell and/or specific for synthetic RNAi molecule(s). In certain example embodiments, the RNAi molecule is not expressed in a CNS cell. In certain example embodiments, non-target cell is a liver cell or a dorsal root ganglion neuron. In certain example embodiments, the RNAi molecule is miR183, miR-182, miR122, miR122a, miR99a, miR-26a, miR199a, miRNA-143, miR101a, miR-30c, or any combination thereof.

In certain example embodiments, the polypeptide includes one or more azides; has a reduced number of one or more oxidation susceptible residues, wherein the oxidation susceptible residues are optionally Met, Tyr, Trp, His, Cys or any combination thereof; is PEGylated, or is otherwise functionalized for PEGylation; comprises one or more oligonucleotides tethered via click chemistry to the composition, optionally viral polypeptide; or any combination thereof.

In certain example embodiments of the vector system, polynucleotide, polypeptide or any combination thereof, the cargo is capable of treating or preventing a CNS, an eye, or an inner ear disease or disorder. In certain example embodiments, the cargo is also detargeted in a non-target cell, optionally a CNS cell.

Described in certain example embodiments herein are cell(s) comprising a composition as described in any one of the preceding paragraphs or elsewhere herein; a vector system as described in any one of the preceding paragraphs or elsewhere herein; a polynucleotide as described in any one of the preceding paragraphs or elsewhere herein; a polypeptide as described in any one of the preceding paragraphs or elsewhere herein; a particle as described in any one of the preceding paragraphs or elsewhere herein; or any combination thereof. In certain example embodiments, the cell(s) is/are prokaryotic. In certain example embodiments, the cell(s) is/are eukaryotic.

Described in certain example embodiments herein are pharmaceutical formulation(s) comprising a composition as described in any one of the preceding paragraphs or elsewhere herein; a vector system as described in any one of the preceding paragraphs or elsewhere herein; a polynucleotide as described in any one of the preceding paragraphs or elsewhere herein; a polypeptide as described in any one of the preceding paragraphs or elsewhere herein; a particle as described in any one of the preceding paragraphs or elsewhere herein; a cell as described in any one of the preceding paragraphs or elsewhere herein; or any combination thereof; and a pharmaceutically acceptable carrier.

Described in certain example embodiments herein are methods of treating or preventing a central nervous system, an eye, or an inner ear disease, disorder, or a symptom thereof comprising administering, to the subject in need thereof, a composition as described in any one of the preceding paragraphs or elsewhere herein; a vector system as described in any one of the preceding paragraphs or elsewhere herein; a polynucleotide as described in any one of the preceding paragraphs or elsewhere herein; a polypeptide as described in any one of the preceding paragraphs or elsewhere herein; a particle as described in any one of the preceding paragraphs or elsewhere herein; a cell as described in any one of the preceding paragraphs or elsewhere herein; a pharmaceutical formulation as described in any one of the preceding paragraphs or elsewhere herein; or any combination thereof.

In certain example embodiments, the central nervous system disease or disorder comprises a secondary muscle disease, disorder, or symptom thereof.

In certain example embodiments, the central nervous system disease or disorder is Friedreich's Ataxia, Dravet Syndrome, Spinocerebellar Ataxia Type 3, Niemann Pick Type C, Huntington's Disease, Pompe Disease, Myotonic Dystrophy Type 1, Glut1 Deficiency Syndrome (De Vivo Syndrome), Tay-Sachs, Spinal Muscular Atrophy, Alzheimer's disease, Amyotrophic lateral sclerosis (ALS), Danon disease, Rett Syndrome, Angleman Syndrome, infantile neuronal dystorpy, Gaucher's disease, Krabbe disease, metachromatic leukodystrophy, Salla disease, Farber disease or Spinal Musular Atrophy with progressive myoclonic Epilepsy (also reffered to as Jankovic-Rivera syndrome, Unverricht-Lundborg disease, AADC deficiency, Parkinson's disease, Batten disease, a neuronal ceroid lipofuscinosis disease, giant axonal neuropathy, a mucopolysaccharidosis disease (e.g., Hurler syndrome, MPS III A-D), neurofibromatosis, a spinocerebellar ataxia disease, Sandoff disease, GM2 gangliosidosis, Canavan disease, Cockayne syndrome, or any combination thereof

In certain example embodiments, the eye disease or disorder is Stargardt disease, a Leber's congenital amaurosis (LCA) (e.g., Leber's congenital amaurosis type 2, LEBER CONGENITALAMAUROSIS (LCA) ANDEARLY-ONSET SEVERE RETINALDYSTROPHY (EOSRD)), Choroideremia, a macular degeneration, diabetic retinopathy, a retinopathy, vitelliform macular dystrophy, a macular dystrophy, Sorsby's fundus dystrophy, cataracts, glaucoma, optic neuropathies, Marfan syndrome, myopia, polypoidal choroidal vasculopathies, retinitis pigmentosa, uveal melanoma, X-linked retinoschisis, pattern dystrophy, achromatopsia, Blue cone monochromatism, Bornholm eye disease, ADGUCA1A-associated COD/CORD, autosomal dominant PRPH2 associated CORD, X-linkedRPGR-associatedCOD/CORD, fundus albipunctatus, Enhanced S-conesyndrome, Bietti crystalline comeoretinaldystorphy, or any combination thereof.

In certain example embodiments, the inner ear disease or disorder is GJB-2 deafness, Jeryell and Lange-Nielsen syndrome, Usher syndrome, Alport syndrome, Branchio-oto-renal syndrome, Waardenburg syndrome, Pendred syndrome, Stickler syndrome, Treacher Collins syndrome, CHARGE syndrome, Norrie disease, Perrault syndrome, Autosomal dominant Nonsyndromic hearing loss, utosomal Recessive Nonsyndromic Hearing Loss, X-linked nonsyndromic hearing loss, an auditory neuropathy, a congenital hearing loss, or any combination thereof.

These and other aspects, objects, features, and advantages of the example embodiments will become apparent to those having ordinary skill in the art upon consideration of the following detailed description of example embodiments.

An understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention may be utilized, and the accompanying drawings of which:

FIG. 1 shows the adeno-associated virus (AAV) transduction mechanism, which results in production of mRNA from the transgene.

FIG. 2 shows a graph that can demonstrate that mRNA-based selection of AAV variants can be more stringent than DNA-based selection. The virus library was expressed under the control of a CMV promoter.

FIGS. 3A-3B show graphs that can demonstrate a correlation between the virus library and vector genome DNA (FIG. 3A) and mRNA (FIG. 3B) in the liver.

FIGS. 4A-4F show graphs that can demonstrate capsid variants present at the DNA level, and expressed at the mRNA level identified in different tissues. For this experiment, the virus library was expressed under the control of a CMV promoter.

FIGS. 5A-5C show graphs that can demonstrate capsid mRNA expression in different tissues under the control of cell-type specific promoters (as noted on x-axis). CMV was included as an exemplary constitutive promoter. CK8 is a muscle-specific promoter. MHCK7 is a muscle-specific promoter. hSyn is a neuron specific promoter. Expression levels from the cell type-specific promoters have been normalized based on expression levels from the constitutive CMV promoter in each tissue.

FIGS. 6A-6B show (FIG. 6A) a schematic demonstrating embodiments of a method of producing and selecting capsid variants for tissue-specific gene delivery across species and (FIG. 6B) a schematic demonstrating benchmarking of the top selected capsids.

FIG. 7 shows a schematic demonstrating embodiments of generating an AAV capsid variant library, particularly insertion of a random n-mer (n=3-15 amino acids) into a wild-type AAV, e.g., AAV9.

FIG. 8 shows a schematic demonstrating embodiments of generating an AAV capsid variant library, particularly variant AAV particle production. Each capsid variant encapsulates its own coding sequence as the vector genome.

FIG. 9 shows schematic vector maps of representative AAV capsid plasmid library vectors (see e.g., FIG. 8) that can be used in an AAV vector system to generate an AAV capsid variant library.

FIG. 10 shows a graph that can demonstrate the viral titer (calculated as AAV9 vector genome/15 cm dish) produced by constructs containing different constitutive and cell-type specific mammalian promoters.

FIGS. 11A-11P show results from benchmarking the top selected capsids from the first and second round of selection.

FIGS. 12A-12C show a comparison of transduction between the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant with AAV9 in NHP tissues.

FIGS. 13A-13C show a comparison of the vector genome biodistribution between the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant with AAV9 in NHP tissues.

FIG. 14A-14B—The DELIVER strategy selects for AAV capsid variants with an enhanced ability to transcribe transgene mRNA in the tissue of interest. (FIG. 14A) (SEQ ID NO: 8614) Map of self-packaging capsid library construct for DELIVER. (FIG. 14B) Schematic of selection using DELIVER.

FIG. 15A-15F—Selection with DELIVER yields potent CNS-tropic capsid variants in multiple mouse strains. (FIGS. 15A and 15B) Amino acid sequence and logo of the 7-mer insert in the 10 most enriched capsid variants with the (FIG. 15A) (SEQ ID NO: 3-8) AQ or (FIG. 15B) (SEQ ID NO: 19, 21-22, 24, 647, 649) DG prefix in the brain of 8 week old C57BL6J and BALB/cJ mice injected with 1E+12 vectorgenomes (vg) virus library following two rounds of selection with DELIVER. Sequences with the same color in each table are encoded by synonymous DNA codons. (FIG. 15C) Predicted structure of the VR-VIII surface loops of AAV9, MDV1A, and MDV1B. (FIG. 15D) Fold difference in eGFP mRNA expression from MDV1A compared to AAV9 in the brain and spinal cord of male and female 8 week old C57BL/6J and BALB/cJ mice injected with 1E+12 vg of MDV1A- or AAV9-CMV-eGFP. Dashed red line represents AAV9-CMV-eGFP expression normalized to 1. Data are represented as mean±SD (n=3-4); *p<0.05, **p<0.01 (Welch's t-test between MDV1A- and AAV9-injected mice with Holm-Šidák MCT). (FIG. 15E) Quantification of transgene delivery efficiency, expressed as vector genomes per diploid genome, of MDV1A- and AAV9-CMV-eGFP in the brain and spinal cord of 8 week old C57BL/6J and BALB/cJ mice injected with 1E+12 vg of MDV1A- or AAV9-CMV-eGFP. Data are represented as mean±SD (n=3-4); *p<0.05, **p<0.01 (Welch's t-test between MDV1A- and AAV9-injected mice with Holm-Šidák MCT). (FIG. 15F) Representative images of mouse brain sagittal sections immunostained for eGFP, from 8 week old C57BL/6J and BALB/cJ mice injected with 5E+11 vg of MDV1A- or AAV9-CMV-eGFP. Blue insets show magnified features in the cortex. Scale bar: 1 mm.

FIG. 16A-16D—The Proline Arginine Loop (PAL) family of neurotropic capsid variants in cynomolgus macaques emerges after selection with DELIVER (see also FIG. 19A-19C). (FIG. 16A) (SEQ ID NO: 200, 202, 204, 212, 218, 224, 228, 234) DNA sequence and corresponding peptide sequence logo of the 7-mer insert in the 10 most enriched DNA sequences of capsid variants in the central nervous system of cynomolgus macaques injected with 3E+13 vg/kg virus library following two rounds of selection with DELIVER. Sequences with the same color are encoded by synonymous DNA codons. (FIG. 16B) (SEQ ID NO: 200, 204, 286, 4005, 4357, 4593, 4599, 4601) Amino acid sequence and logo of the 7-mer insert in the 10 most enriched amino acid-level capsid variants in the macaque CNS. The rank of each variant corresponds to the sum of the ranks of two synonymous DNA sequences. (FIG. 16C—SEQ ID NO: 200, 204, 226, 234, 258, 260, 923, 1265, 2759, 3923, 4593, 4599, 4713, 5277, 5433, 5741, 5937, 6019)

FIG. 17A-17E. Macaque-derived variants outperform AAV9 and mouse- and marmoset-derived variants in transduction of the macaque but not the mouse central nervous system (see also FIG. 20A-20B). (FIG. 17A)(SEQ ID NO: 8596-8613) Pool of capsid variants injected for characterization of the top mouse- and macaque-derived neurotropic variants. (FIG. 17B) Schematic of the barcoded human frataxin transgene and strategy for assessing the performance of top variants in cynomolgus macaques and C57BL/6J and BALB/cJ mice. (FIG. 17C-17E) Fold difference in within-individual hFXN mRNA expression from different variants normalized to AAV9 in various tissues of (FIG. 17C) C57BL/6J mice, (D) BALB/cJ mice, and (E) cynomolgus macaques. Dashed red line represents AAV9-CBh-hFXN expression normalized to 1. Data are represented as mean±SD (n=3 macaques, n=4-7 mice); *p<0.05, **p<0.01 (one-way ANOVA with Dunnett's MCT and AAV9 as the control).

FIG. 18A-18H—Second-generation capsid variant PAL2 transduces the central nervous system of one macaque in a head-to-head experiment with AAV9. (FIG. 18A) Heatmap of PAL2 transgene mRNA expression and vector genome abundance normalized to AAV9. Data are log2-transformed. (FIG. 18B) Immunostaining a coronal section of macaque brain hemisphere for the hFXN-HA transgene delivered by PAL2 suggests widespread and uniform transduction. Scale bar: 1 cm. (FIG. 18C-18E) Localization of hFXN-HA expression with respect to NeuN+ neurons in the macaque (FIG. 18C) parietal cortex, (FIG. 18D) hippocampus, and (FIG. 18E) spinal cord. Scale bars: 100 pm. (FIG. 18F) Localization of hFXN-HA expression with respect to rhodopsin+ photoreceptors in the macaque retina. Scale bars: 100 μm. (FIG. 18G) Representative spinal cord sections with pathology WNL (within normal limits). Scale bar: 200 μm. (FIG. 18H) Representative DRG sections with pathology WNL (within normal limits). Scale bar: 100 μm.

FIG. 19A-19C—Selection for capsid variants with neurotropic properties in cynomolgus macaques yields diverse families of motifs. (FIGS. 19A and 19B) Amino acid sequence and logo of the 7-mer insert in the 10 most enriched capsid variants in the (FIG. 19A) (SEQ ID NO: 260, 1069, 4665, 4751, 4909, 5013, 5107, 5191, 5287, 5401) cerebellum and (FIG. 19B) (SEQ ID NO: 224, 4759, 4971, 5091, 5127, 5165, 5177, 5181, 5187, 5189) spinal cord of cynomolgus macaques injected with 3E+13 vg/kg virus library following two rounds of selection with DELIVER. The rank of each variant corresponds to the sum of the ranks of two synonymous DNA sequences. (FIG. 19C) (SEQ ID NO: 971, 1077, 2439, 2529, 3103, 3283, 3553, 4605, 4619, 4629, 4825, 4933, 5131, 5209, 5233, 5341, 5367, 5461, 5547, 5631, 5633, 5731, 5959, 6001, 6045, 6139, 6169, 6497, 7335, 8033, 8269) Selected clusters of enriched variants in the macaque CNS with conserved sequence properties. Individual residues are color-coded according to their functional properties to highlight conserved aspects of the sequence motif. The rank of each variant corresponds to the sum of the ranks of two synonymous DNA sequences.

FIG. 20A-20B—PAL family capsid variants and other macaque-derived variants outperform AAV9 and mouse- and marmoset-derived variants in transduction of a variety of macaque brain regions. (FIG. 20A) Fold difference in within-individual hFXN mRNA expression from different variants normalized to AAV9 in various central nervous system tissues of cynomolgus macaques. Dashed red line represents AAV9-CBh-hFXN expression normalized to 1. Data are represented as mean±SD (n=3); *p<0.05, **p<0.01 (one-way ANOVA with Dunnett's MCT and AAV9 as the control). (FIG. 20B) Quantification of transgene delivery efficiency, expressed as vector genomes per diploid genome, of different variants in the macaque liver. Data are represented as mean±SD (n=3); *p<0.05, **p<0.01 (one-way ANOVA with Dunnett's MCT and AAV9 as the control).

The figures herein are for illustrative purposes only and are not necessarily drawn to scale.

Unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains. Definitions of common terms and techniques in molecular biology may be found in Molecular Cloning: A Laboratory Manual, 2nd edition (1989) (Sambrook, Fritsch, and Maniatis); Molecular Cloning: A Laboratory Manual, 4th edition (2012) (Green and Sambrook); Current Protocols in Molecular Biology (1987) (F. M. Ausubel et al. eds.); the series Methods in Enzymology (Academic Press, Inc.): PCR 2: A Practical Approach (1995) (M. J. MacPherson, B. D. Hames, and G. R. Taylor eds.): Antibodies, A Laboratory Manual (1988) (Harlow and Lane, eds.): Antibodies A Laboratory Manual, 2nd edition 2013 (E. A. Greenfield ed.); Animal Cell Culture (1987) (R.I. Freshney, ed.); Benjamin Lewin, Genes IX, published by Jones and Bartlet, 2008 (ISBN 0763752223); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, published by Blackwell Science Ltd., 1994 (ISBN 0632021829); Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 9780471185710); Singleton et al., Dictionary of Microbiology and Molecular Biology 2nd ed., J. Wiley & Sons (New York, N.Y. 1994), March, Advanced Organic Chemistry Reactions, Mechanisms and Structure 4th ed., John Wiley & Sons (New York, N.Y. 1992); and Marten H. Hofker and Jan van Deursen, Transgenic Mouse Methods and Protocols, 2nd edition (2011).

As used herein, the singular forms “a”, “an”, and “the” include both singular and plural referents unless the context clearly dictates otherwise.

As used herein, “administering” refers to any suitable administration for the agent(s) being delivered and/or subject receiving said agent(s) and can be oral, topical, intravenous, subcutaneous, transcutaneous, transdermal, intramuscular, intra-joint, parenteral, intra-arteriole, intra-arterial, intrathecal, lumbar, subdural, intracisternal, subpial, subretinal, subconjunctival, intravitreal, intratympanic, intracochlear, intradermal, intraventricular, intraosseous, intraocular, intracranial, intraperitoneal, intralesional, intranasal, intracardiac, intraarticular, intracavemous, intrathecal, intravireal, intracerebral, and intracerebroventricular, intratympanic, intracochlear, rectal, vaginal, by inhalation, by catheters, stents or via an implanted reservoir or other device that administers, either actively or passively (e.g. by diffusion) a composition the perivascular space and adventitia. For example, a medical device such as a stent can contain a composition or formulation disposed on its surface, which can then dissolve or be otherwise distributed to the surrounding tissue and cells. The term “parenteral” can include subcutaneous, intravenous, intramuscular, intra-articular, intra-synovial, intrasternal, intrathecal, intrahepatic, intralesional, and intracranial injections or infusion techniques. Administration routes can be, for instance, auricular (otic), buccal, conjunctival, cutaneous, dental, electro-osmosis, endocervical, endosinusial, endotracheal, enteral, epidural, extra-amniotic, extracorporeal, hemodialysis, infiltration, interstitial, intra abdominal, intra-amniotic, intra-arterial, intra-articular, intrabiliary, intrabronchial, intrabursal, intracardiac, intracartilaginous, intracaudal, intracavemous, intracavitary, intracerebral, intracisternal, intracorneal, intracoronal (dental), intracoronary, intracorporus cavernosum, intradermal, intradiscal, intraductal, intraduodenal, intradural, intraepidermal, intraesophageal, intragastric, intragingival, intraileal, intralesional, intraluminal, intralymphatic, intramedullary, intrameningeal, intramuscular, intraocular, intraovarian, intrapericardial, intraperitoneal, intrapleural, intraprostatic, intrapulmonary, intrasinal, intraspinal, intrasynovial, intratendinous, intratesticular, intrathecal, intrathoracic, intratubular, intratumor, intratym panic, intrauterine, intravascular, intravenous, intravenous bolus, intravenous drip, intraventricular, intravesical, intravitreal, iontophoresis, irrigation, laryngeal, nasal, nasogastric, occlusive dressing technique, ophthalmic, oral, oropharyngeal, other, parenteral, percutaneous, periarticular, peridural, perineural, periodontal, rectal, respiratory (inhalation), retrobulbar, soft tissue, subarachnoid, subconjunctival, subcutaneous, sublingual, submucosal, topical, transdermal, transmucosal, transplacental, transtracheal, transtympanic, ureteral, urethral, and/or vaginal administration, and/or any combination of the above administration routes, which typically depends on the disease to be treated, subject being treated, and/or agent(s) being administered.

The term “optional” or “optionally” means that the subsequent described event, circumstance or substituent may or may not occur, and that the description includes instances where the event or circumstance occurs and instances where it does not.

The recitation of numerical ranges by endpoints includes all numbers and fractions subsumed within the respective ranges, as well as the recited endpoints.

The terms “about” or “approximately” as used herein when referring to a measurable value such as a parameter, an amount, a temporal duration, and the like, are meant to encompass variations of and from the specified value, such as variations of +/−10% or less, +/−5% or less, +/−1% or less, and +/−0.1% or less of and from the specified value, insofar such variations are appropriate to perform in the disclosed invention. It is to be understood that the value to which the modifier “about” or “approximately” refers is itself also specifically, and preferably, disclosed.

As used herein, a “biological sample” may contain whole cells and/or live cells and/or cell debris. The biological sample may contain (or be derived from) a “bodily fluid”. The present invention encompasses embodiments wherein the bodily fluid is selected from amniotic fluid, aqueous humour, vitreous humour, bile, blood serum, breast milk, cerebrospinal fluid, cerumen (earwax), chyle, chyme, endolymph, perilymph, exudates, feces, female ejaculate, gastric acid, gastric juice, lymph, mucus (including nasal drainage and phlegm), pericardial fluid, peritoneal fluid, pleural fluid, pus, rheum, saliva, sebum (skin oil), semen, sputum, synovial fluid, sweat, tears, urine, vaginal secretion, vomit and mixtures of one or more thereof. Biological samples include cell cultures, bodily fluids, cell cultures from bodily fluids. Bodily fluids may be obtained from a mammal organism, for example by puncture, or other collecting or sampling procedures.

The terms “subject,” “individual,” and “patient” are used interchangeably herein to refer to a vertebrate, preferably a mammal, more preferably a human. Mammals include, but are not limited to, murines, simians, humans, farm animals, sport animals, and pets. Tissues, cells and their progeny of a biological entity obtained in vivo or cultured in vitro are also encompassed.

Various embodiments are described hereinafter. It should be noted that the specific embodiments are not intended as an exhaustive description or as a limitation to the broader aspects discussed herein. One aspect described in conjunction with a particular embodiment is not necessarily limited to that embodiment and can be practiced with any other embodiment(s). Reference throughout this specification to “one embodiment”, “an embodiment,” “an example embodiment,” means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment,” “in an embodiment,” or “an example embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment, but may. Furthermore, the particular features, structures or characteristics may be combined in any suitable manner, as would be apparent to a person skilled in the art from this disclosure, in one or more embodiments. Furthermore, while some embodiments described herein include some but not other features included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention. For example, in the appended claims, any of the claimed embodiments can be used in any combination.

All publications, published patent documents, and patent applications cited herein are hereby incorporated by reference to the same extent as though each individual publication, published patent document, or patent application was specifically and individually indicated as being incorporated by reference.

Embodiments disclosed herein provide central nervous system (CNS)-specific targeting moieties that can be coupled to or otherwise associated with a cargo and/or delivery vehicle or system. Embodiments disclosed herein provide polypeptides (used interchangeably herein with the term “proteins”) and particles that can incorporate one or more of the CNS-specific targeting moieties. The polypeptides and/or particles can be coupled to, attached to, encapsulate, or otherwise incorporate a cargo, thereby associating the cargo with the targeting moiety(ies). Embodiments disclosed herein provide CNS-specific targeting moieties that contain one or more n-mer insert as further described herein. The targeting moieties may be used to provide engineered adeno-associated virus (AAV) capsids with a reprogrammed cell-specific and/or species-specific tropism, such as CNS specific tropism, to an engineered AAV particle.

In one example embodiment, the n-mer insert(s) is or contains a P-motif. In one example embodiment, the P-motif comprises the amino acid sequence XmPX1QGTX2RXn (SEQ ID NO: 8580), wherein X1, X2, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7, and optionally a cargo, wherein the cargo is coupled to or is otherwise associated with the targeting moiety. In one example embodiment, the P-motif contains or is the amino acid sequence PX1QGTX2RXn (SEQ ID NO: 2), where X1, X2, Xn, are each selected from any amino acid and where n is 0, 1, 2, 3, 4, 5, 6, or 7.

In other example embodiments, the n-mer insert and/or P-motif is selected from the group consisting of SEQ ID NOs: 332-582 (Table 7).

In certain example embodiments, the targeting moiety comprises one or more n-mer inserts each comprising or consisting of a P-motif, wherein at least one of the P-motifs comprise the amino acid sequence XmPX1QGTX2RXn (SEQ ID NO: 8580), wherein X1, X2, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7.

Embodiments disclosed herein also provide methods of generating recombinant AAVs (rAAVs) having engineered capsids that can involve systematically directing the generation of diverse libraries of variants of modified surface structures, such as variant capsid polypeptides. Embodiments of the method of generating rAAVs having engineered capsids can also include stringent selection of capsid variants capable of targeting CNS cells. As used in this context herein, “targeting” refers to the ability to, in a target specific manner, recognize, bind, associate with, transduce or infect, or otherwise interact with a target molecule or moiety such that recognition, binding, association, affinity, avidity, transduction or infection, and/or other interaction with the target molecule or moiety by the targeting moiety is greater, more efficient, or otherwise more selective for the target molecule or moiety as compared with its recognition, binding, association, affinity, avidity, transduction or infection, and/or other interaction with a non-target molecule or moiety. For example, a CNS-specific targeting moiety can have increased and/or more efficient or selective recognition, binding, association, affinity, avidity, transduction or infection, and/or other interaction of or with CNS cells as compared to non-CNS cells. In one example embodiment the n-mer may result in increased transduction of neurons of the CNS. Embodiments of the method of generating rAAVs having engineered capsids can include stringent selection of capsid variants capable of efficient and/or homogenous transduction in at least two or more species.

Embodiments disclosed herein provide vectors and systems thereof capable of producing an engineered AAV described herein.

Embodiments disclosed herein provide cells that can be capable of producing the engineered AAV particles described herein. In some embodiments, the cells include one or more vectors or system thereof described herein.

Embodiments disclosed herein provide engineered AAVs that can include an engineered capsid described herein. In some embodiments, the engineered AAV can include a cargo polynucleotide to be delivered to a cell. In some embodiments, the engineered AAV may be used to deliver gene therapies including encoding gene editing systems. In other embodiments, the engineered AAV may be used to deliver vaccines, such as DNA or mRNA vaccines.

Embodiments disclosed herein provide formulations that can contain an engineered AAV vector or system thereof, an engineered AAV capsid, engineered AAV particles including an engineered AAV capsid described herein, and/or an engineered cell described herein that contains an engineered AAV capsid, and/or an engineered AAV vector or system thereof. In some embodiments, the formulation can also include a pharmaceutically acceptable carrier. The formulations described herein can be delivered to a subject in need thereof or a cell.

Embodiments disclosed herein also provide kits that contain one or more of the one or more of the polypeptides, polynucleotides, vectors, engineered AAV capsids, engineered AAV particles, cells, or other components described herein and combinations thereof and pharmaceutical formulations described herein. In embodiments, one or more of the polypeptides, polynucleotides, vectors, engineered AAV capsids, engineered AAV particles cells, and combinations thereof described herein can be presented as a combination kit.

Embodiments disclosed herein provide methods of using the engineered AAVs having a cell-specific tropism described herein to deliver, for example, a therapeutic polynucleotide to a cell. In this way, the engineered AAVs described herein can be used to treat and/or prevent a disease in a subject in need thereof. Embodiments disclosed herein also provide methods of delivering the engineered AAV capsids, engineered AAV virus particles, engineered AAV vectors or systems thereof and/or formulations thereof to a cell. Also provided herein are methods of treating a subject in need thereof by delivering an engineered AAV particle, engineered AAV capsid, engineered AAV capsid vector or system thereof, an engineered cell, and/or formulation thereof to the subject.

Additional features and advantages of the embodiments engineered AAVs and methods of making and using the engineered AAVs are further described herein.

Generally, described herein are compositions containing one or more CNS-specific targeting moieties that can effectively target CNS cells. In some embodiments, the CNS-specific targeting moieties can be specific to one or more types of CNS cells. CNS cells include any cell within the brain, brain stem, spinal cord, inner ear, and eyes. In some embodiments, one or more CNS-specific targeting moieties can be incorporated into a delivery vehicle, agent, or system thereof so as to provide CNS specific targeting capability to the delivery vehicle, agent, or system thereof. Exemplary delivery vehicles include, without limitation, viral particles, (e.g., AAV viral particles), micelles, liposomes, exosomes, and the like. Exemplary delivery vehicles in which the CNS targeting-moieties can be incorporated are described in greater detail elsewhere herein. The CNS-targeting moieties may also be indirectly or directly coupled to a cargo and thus provide CNS specificity to the coupled cargo. In some embodiments, the composition can be specific for a CNS-cell (e.g., as conferred by the CNS-Specific targeting moieties described herein) and have reduced specificity for a non-CNS cell (including but not limited to a liver cell). In some embodiments, the CNS targeting moiety can specifically interact with or otherwise associate with one or more AAV receptors on CNS cells, thus providing CNS specificity (or tropism). Methods of generating and identifying CNS-specific targeting moieties are described in greater detail elsewhere herein.

Described herein are targeting moieties capable of specifically targeting, binding, associating with, or otherwise interacting specifically with a CNS cell. In some embodiments, the targeting moiety effective to transduce, such as specifically transduce, a central nervous system (CNS) cell, comprises an n-mer insert optionally comprising or consisting of a P-motif, double valine motif, or both, and optionally a cargo, wherein the cargo is coupled to or is otherwise associated with the targeting moiety. Generally, n-mer inserts are short (e.g., about 3 to about 15, 20, or 25) amino acid sequences where each amino acid of the n-mer insert can be selected from any amino acid. In some embodiments, the n-mer insert is 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or 25 amino acids in length.

In certain example embodiments, where the targeting moiety comprises one or more n-mer inserts comprising or consisting of a P-motif, at least one of the P-motifs comprises or consists of the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7.

The term “P-motif” as used herein refers to an n-mer inserts that contains or is the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7. In some embodiments Xm is 2 and is AQ or DG. In some embodiments, the P-motif contains or is the amino acid sequence XmPX1QGTX3RXn (SEQ ID NO: 8581), where X1, X3, Xn, are each selected from any amino acid, where m is 0, 1, 2, or 3, and where n is 0, 1, 2, 3, 4, 5, 6, or 7. In some embodiments, the P-motif contains or is the amino acid sequence PX1QGTX3RXn (SEQ ID NO: 2), where X1, X3, Xn, are each selected from any amino acid and where n is 0, 1, 2, 3, 4, 5, 6, or 7. n-mer inserts are described in greater detail elsewhere herein.

In certain example embodiments, the n-mer insert is or includes a double valine motif. As used herein the term “double valine motif” refers to an n-mer insert motif that has the amino acid sequence XmX1X2VX3X4VX5Xn, wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7.

In some embodiments, where an n-mer insert is or includes a P motif having the sequence amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579) or XmPX1QGTX3RXn (SEQ ID NO: 8581) or a double valine motif having the sequence XmX1X2VX3X4VX5Xn, and Xm in the P motif or double valine motif is not 0 (i.e., m=1, 2 or 3) the amino acids of Xm residues of the motif can replace up to 1, 2, or 3, respectively amino acids of the polypeptide into which the n-mer insert is being incorporated, such as a targeting moiety (e.g., a polypeptide, viral polypeptide, viral capsid polypeptide, and/or the like). Incorporation of an n-mer insert in this manner can position a P motif or double valine motif as an “insertion” between any two desired contiguous amino acids of the recipient polypeptide.

In some embodiments, the two amino acid residues immediately preceding the n-mer insert are AQ or DG in a targeting moiety or a composition that is a polypeptide. In some embodiments, where Xm is 0, the two amino acid residues in the targeting moiety immediately preceding the P-motif or double valine motif are AQ or DG.

In some embodiments, Xn of the P-motif or double valine motif is 0. In some embodiments, Xn of the P-motif or double valine motif is 1. In some embodiments, Xn of the P-motif or double valine motif is 2. In some embodiments, Xn of the P-motif or double valine motif is 3. In some embodiments, Xn of the P-motif or double valine motif is 4. In some embodiments, Xn of the P-motif or double valine motif is 5. In some embodiments, Xn of the P-motif or double valine motif is 6. In some embodiments, Xn of the P-motif or double valine motif is 7. In some embodiments, Xm of the P-motif or double valine motif is 0. In some embodiments, Xm of the P motif or double valine motif is 3. In some embodiments, Xm of the P motif or double valine motif is 2. In some embodiments, Xm of the P motif or double valine motif is 1.

In certain example embodiments, X2 of the P motif is Q, P, E, or H. In certain example embodiments, X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid. In certain example embodiments, X3 of the P motif is a nonpolar amino acid. In certain example embodiments, X1 of the P motif is S, T, N, Q, C, Y or A, X2 of the P motif is Q, P, E, or H, X3 is G, A, M, W, L, V, F, or I, or any combination thereof.

In certain example embodiments, X1 of the double valine motif is R, K, V, or W. In certain example embodiments, X2 of the double valine motif is T, S, V, Y or R. In certain example embodiments, X3 of the double valine motif is G, P, or S. In certain example embodiments, X4 of the double valine motif is S, D, or T. In certain example embodiments, X5 of the double valine motif is Y, G, S, or L.

In some embodiments, Xn of the n-mer insert is 0. In some embodiments, the CNS-specific n-ner motif is as in any of Tables 1-3. In some embodiments, the CNS-specific n-mer insert is any one of the n-mer inserts in Table 6 (SEQ ID NOs.: 321-329). In some embodiments the CNS-specific n-mer insert is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324. In some embodiments the CNS-specific n-mer insert is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-325. In some embodiments the CNS-specific n-mer insert is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-327. In some embodiments the CNS-specific n-mer insert is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324 and 329. In some embodiments the CNS-specific n-mer insert and/or P-motif is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324. In some embodiments the CNS-specific n-mer insert any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324 and 326-327. In some embodiments the CNS-specific n-mer insert is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324 and 326-328. In some embodiments the CNS-specific n-mer insert and is any one or more of the n-mer inserts selected from the group of SEQ ID NOs.: 322-324 and 328.

In certain example embodiments, at least one P-motif is selected from any one of SEQ ID NOs: 332-582 (Table 7).

In some embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide as set forth in Table 8 (SEQ ID NOs: 583-8578). In some embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 583-2582. In some embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 2583-4582. In some embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 4583-6578. In some embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 6579-8578.

In certain example embodiments, the n-mer insert(s) and/or at least one P-motif and/or double valine motif is selected from any one n-mer insert and/or is encoded by a polynucleotide as set forth in one or more of SEQ ID NOs: 332-582 (Table 7), SEQ ID NOs: 583-8578 (Table 8), SEQ ID NOs: 3-819, 21-22, 24, 200, 202, 204, 212, 218, 224, 226, 228, 286, 234, 258, 260, 647, 649, 923, 1069, 1077, 1265, 2439, 2529, 2759, 3283, 3553, 3923, 4005, 4173, 4537, 4593, 4599, 4601, 4605, 4619, 4665, 4751, 4759, 4825, 4909, 4933, 5013, 5091, 5107, 5127, 5131, 5165, 5177, 5181, 5187, 5189, 5191, 5277, 5287, 5401, 5433, 5631, 5633, 5731, 5741, 5937, 6019, 6045, 6139, 6169, 6497, 7335, 8033, 8269, 8596-8613, (FIGS. 15A, 15B, 17A, 16A, 16B, 16C, and 19A-19C).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 15A (SEQ ID NOs. 3-8).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 15B (SEQ ID NOs. 19, 21-22, 24, 647, 649).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 16A (SEQ ID NOs. 200, 202, 204, 212, 218, 224, 228, 234).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 16B (SEQ ID NOs. 200, 204, 286, 4005, 4537, 4593, 4599, 4601).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 16C (SEQ ID NOs. 200, 204, 226, 234, 258, 260, 923, 1265, 2759, 3923, 4173, 4593, 4599, 5277, 5433, 5741, 5937, 6019).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 17A (SEQ ID NOs. 8596-8613).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 19A (SEQ ID NOs. 260, 1069, 4665, 4751, 4909, 5013, 5107, 5191, 5287, 5401).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 19B (SEQ ID NOs. 224, 4759, 4971, 5091, 5127, 5165, 5177, 5181, 5187, 5189).

In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide in FIG. 19C (SEQ ID NOs: 2439, 2529, 3103, 3283, 3553, 4605, 4619, 4825, 4933, 5131, 5631, 5731, 6001, 971, 4629, 5209, 5233, 5341, 5367, 5461, 5547, 5959, 6045, 6139, 1077, 7335, 8033, 8269, 5633, 6169, 6497). In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 2439, 2529, 3103, 3283, 3553, 4605, 4619, 4825, 4933, 5131, 5631, 5731, 6001. In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 971, 4629, 5209, 5233, 5341, 5367, 5461, 5547, 5959, 6045, 6139. In some embodiments, the CNS-specific n-mer motif is and/or is encoded by a polynucleotide having a sequence according to any one of SEQ ID NOs: 1077, 7335, 8033, 8269, 5633, 6169, 6497.

In some embodiments, the CNS-specific n-mer insert is species specific. In other words, in some embodiments, the CNS-specific n-mer insert can facilitate CNS targeting in one species better than another species. In some embodiments the CNS-specific n-mer insert is specific for primates. In some embodiments, the CNS-specific n-mer insert is specific for human and/or non-human primates.

In some embodiments, the CNS-specific n-mer insert is capable of targeting one or more cell and/or tissue types over others within the CNS. In some embodiments, the CNS-specific insert is not effective or is less effective at targeting the dorsal root ganglion cells than one or more other cells and/or tissue types of the CNS.

In some embodiments, the CNS-specific n-mer insert is capable of targeting a specific CNS tissue type or cell type. In some embodiments, the CNS-specific n-mer insert is capable of targeting one or more specific regions of the CNS as set forth in Table 9. n some embodiments, the CNS-specific n-mer insert is capable of targeting the frontal lobe, the temporal lobe or specific region thereof (e.g., the posterior or anterior temporal lobe), the parietal lobe or specific region thereof (e.g., the posterior or anterior parietal lobe), the occipital lobe the thalamus, the corpus callosum, the cerebellum, neuroretina, RPE, brain stem, the spinal cord or a region therein (e.g., the cervical spinal cord, the thoracic spinal cord, the lumbar spinal cord), cauda equina, DRGs or subset thereof (e.g., cervical DRG, thoracic DRG, lumbar DRG), or any combination thereof.

In some embodiments, the targeting moiety can include more than one n-mer inserts, such as a CNS-specific n-mer insert described herein. In some embodiments, the targeting moiety can include 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more n-mer inserts. In some embodiments, all the n-motifs included in the targeting moiety can be the same. In some embodiments where more than one n-mer insert is included, at least two of the n-mer inserts are different from each other. In some embodiments where more than one n-mer insert is included, all the n-mer inserts are different from each other.

In one example embodiment, the targeting moiety, e.g., the CNS-specific targeting moiety, can be coupled to or otherwise associated with a cargo. In some embodiments, one or more CNS-specific targeting moieties described herein is directly attached to the cargo. In some embodiments, one or more CNS-specific targeting moieties described herein is indirectly coupled to the cargo, such as via a linker molecule.

In another example embodiment, one or more CNS-specific targeting moieties described herein is coupled to associated with a particle that is coupled to, attached to, encapsulates, and/or contains a cargo. Exemplary particles include, without limitation, viral particles (e.g., viral capsids, which is inclusive of bacteriophage capsids), polysomes, liposomes, nanoparticles, microparticles, exosomes, micelles, and the like. The term “nanoparticle” as used herein includes a nanoscale deposit of a homogenous or heterogeneous material. Nanoparticles may be regular or irregular in shape and may be formed from a plurality of co-deposited particles that form a composite nanoscale particle. Nanoparticles may be generally spherical in shape or have a composite shape formed from a plurality of co-deposited generally spherical particles. Exemplary shapes for the nanoparticles include, but are not limited to, spherical, rod, elliptical, cylindrical, disc, and the like. In some embodiments, the nanoparticles have a substantially spherical shape.

As used herein, the term “specific” when used in relation to described an interaction between two moieties, refers to non-covalent physical association of a first and a second moiety wherein the association between the first and second moieties is at least 2 times as strong, at least 5 times as strong as, at least 10 times as strong as, at least 50 times as strong as, at least 100 times as strong as, or stronger than the association of either moiety with most or all other moieties present in the environment in which binding occurs. Binding of two or more entities may be considered specific if the equilibrium dissociation constant, Kd, is 10−3 M or less, 10−4 M or less, 10−5 M or less, 10−6 M or less, 10−7 M or less, 10−8 M or less, 10−9 M or less, 10−10 M or less, 10−11 M or less, or 10−12 M or less under the conditions employed, e.g., under physiological conditions such as those inside a cell or consistent with cell survival. In some embodiments, specific binding can be accomplished by a plurality of weaker interactions (e.g., a plurality of individual interactions, wherein each individual interaction is characterized by a Kd of greater than 10−3 M). In some embodiments, specific binding, which can be referred to as “molecular recognition,” is a saturable binding interaction between two entities that is dependent on complementary orientation of functional groups on each entity. Examples of specific interactions include primer-polynucleotide interaction, aptamer-aptamer target interactions, antibody-antigen interactions, avidin-biotin interactions, ligand-receptor interactions, metal-chelate interactions, hybridization between complementary nucleic acids, etc.

In some embodiments, in addition to the n-mer insert(s) the targeting moiety can include a polypeptide, a polynucleotide, a lipid, a polymer, a sugar, or a combination thereof.

In some embodiments, the targeting moiety is incorporated into a viral polypeptide, such as a capsid polypeptide, including but not limited to lentiviral, adenoviral, AAV, bacteriophage, and retroviral polypeptides. In some embodiments, the n-mer insert is inserted between two amino acids of the viral polypeptide such that the n-mer insert is external (i.e., is presented on the surface of) to a viral capsid.

In some embodiments, the composition containing one or more of the CNS-specific targeting moieties described herein has increased muscle cell potency, muscle cell specificity, reduced immunogenicity, or any combination thereof.

Cargos can include any molecule that is capable of being coupled to or associated with the CNS-specific targeting moieties described herein. Cargos can include, without limitation, nucleotides, oligonucleotides, polynucleotides, amino acids, peptides, polypeptides, riboproteins, lipids, sugars, pharmaceutically active agents (e.g., drugs, imaging and other diagnostic agents, and the like), chemical compounds, and combinations thereof. In some embodiments, the cargo is DNA, RNA, amino acids, peptides, polypeptides, antibodies, aptamers, ribozymes, guide sequences for ribozymes that inhibit translation or transcription of essential tumor proteins and genes, hormones, immunomodulators, antipyretics, anxiolytics, antipsychotics, analgesics, antispasmodics, anti-inflammatories, anti-histamines, anti-infectives, radiation sensitizers, chemotherapeutics, radioactive compounds, imaging agents, and combinations thereof.

The CNS-specific targeting moieties can be encoded in whole or in part by a polynucleotide. The encoding polynucleotides can be included in one or more vectors (or vector systems) that can be used to generate targeting moieties and compositions thereof that include the CNS-specific n-mer insert(s) Exemplary encoding polynucleotides, vectors, vector systems, and recombinant engineering techniques are described in greater detail herein and/or are generally known in the art and can be adapted for use with the targeting moieties and compositions thereof described herein.

In some embodiments, the cargo is capable of treating or preventing a CNS disease or disorder. Exemplary CNS diseases and disorders are described elsewhere herein.

Representative cargo molecules that may be delivered using the compositions disclosed herein include, but are not limited to, nucleic acids, polynucleotides, proteins, polypeptides, polynucleotide/polypeptide complexes, small molecules, sugars, or a combination thereof. Cargos that can be delivered in accordance with the systems and methods described herein include, but are not necessarily limited to, biologically active agents, including, but not limited to, therapeutic agents, imaging agents, and monitoring agents. A cargo may be an exogenous material or an endogenous material. In some embodiments, the cargo can be a “gene of interest”.

In some embodiments the cargos, in addition to the cargo of interest that is to be delivered to a CNS cell, the cargo contains one or more binding sites specific for one or more RNAi molecules that are endogenous to one or more non-target (such as non-CNS cells). In this context herein “non-target cells” refers to cells to which delivery or activity of a cargo is not desired. In other words, “non-target cells” are cells in which the targeting moiety, such as the CNS specific targeting moiety, and compositions thereof do not specifically target. When a cargo having one more specific binding sites for one or more RNAi molecules that are endogenous to one or more non-target cells is delivered to non-target cells, the endogenous RNAi molecule of the non-target cell degrades the cargo molecule via the endogenous RNAi pathway. In this way off-target toxicity or other deleterious off-target events can be reduced. This can also be referred to as a mechanism of detargeting the composition to non-target cells.

In some embodiments, the detargeting component of a cargo molecule is one or more specific binding sites for one or more RNAi molecules that are endogenous to one or more non-target cells. In some embodiments, the RNAi molecules that are endogenous to one or more non-target cells are specifically expressed in those non-target cell(s). In some embodiments, the RNAi molecules that are endogenous to one or more non-target cells are enriched or have greater expression in non-target cell(s) as compared to target cells, such as CNS cells. In some embodiments, the more RNAi molecules that are endogenous to one or more non-target cells are not expressed in a target cell, such as a CNS cell. Exemplary RNAi molecule types are described elsewhere herein. In some embodiments, the one or more RNAi molecules that are endogenous to one or more non-target cells are microRNAs. In some embodiments, the non-target cell(s) are liver cell(s) and/or dorsal root ganglion neuron(s). In some embodiments, the RNAi molecules are miR183, miR-182, miR122, miR122a, miR99a, miR-26a, miR199a, miRNA-143, miR101a, miR-30c, or any combination thereof.

Other exemplary detargeting RNAi molecules are described in e.g., International Patent Application Pub. WO2021231579A1 and WO2020132455A1, https://www-hebertpub-com.ezp-prod1.hul.harvard.edu/doi/pdf/10.1089%2Fnat.2015.0543.

In some embodiments, the cargo is a cargo polynucleotide. As used herein, “nucleic acid,” “nucleotide sequence,” and “polynucleotide” can be used interchangeably herein and can generally refer to a string of at least two base-sugar-phosphate combinations and refers to, among others, single- and double-stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. In addition, polynucleotide as used herein can refer to triple-stranded regions comprising RNA or DNA or both RNA and DNA. The strands in such regions can be from the same molecule or from different molecules. The regions may include all of one or more of the molecules, but more typically involve only a region of some of the molecules. One of the molecules of a triple-helical region often is an oligonucleotide. “Polynucleotide” and “nucleic acids” also encompasses such chemically, enzymatically, or metabolically modified forms of polynucleotides, as well as the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and complex cells, inter alia. For instance, the term polynucleotide as used herein can include DNAs or RNAs as described herein that contain one or more modified bases. Thus, DNAs or RNAs including unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two examples, are polynucleotides as the term is used herein. “Polynucleotide”, “nucleotide sequences” and “nucleic acids” also includes PNAs (peptide nucleic acids), phosphorothioates, and other variants of the phosphate backbone of native nucleic acids. Natural nucleic acids have a phosphate backbone, artificial nucleic acids can contain other types of backbones, but contain the same bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are “nucleic acids” or “polynucleotides” as that term is intended herein. As used herein, “nucleic acid sequence” and “oligonucleotide” also encompasses a nucleic acid and polynucleotide as defined elsewhere herein.

As used herein, “deoxyribonucleic acid (DNA)” and “ribonucleic acid (RNA)” can generally refer to any polyribonucleotide or polydeoxyribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. RNA can be in the form of non-coding RNA, including but not limited to, tRNA (transfer RNA), snRNA (small nuclear RNA), rRNA (ribosomal RNA), anti-sense RNA, RNAi (RNA interference construct), siRNA (short interfering RNA), microRNA (miRNA), or ribozymes, aptamers, guide RNA (gRNA), or coding mRNA (messenger RNA).

In some embodiments, the cargo polynucleotide is DNA. In some embodiments, the cargo polynucleotide is RNA. In some embodiments, the cargo polynucleotide is a polynucleotide (a DNA or an RNA) that encodes an RNA and/or a polypeptide. As used herein with reference to the relationship between DNA, cDNA, cRNA, RNA, protein/peptides, and the like “corresponding to” or “encoding” (used interchangeably herein) refers to the underlying biological relationship between these different molecules. As such, one of skill in the art would understand that operatively “corresponding to” can direct them to determine the possible underlying and/or resulting sequences of other molecules given the sequence of any other molecule which has a similar biological relationship with these molecules. For example, from a DNA sequence an RNA sequence can be determined and from an RNA sequence a cDNA sequence can be determined.

In some embodiments, the systems described herein comprise a polynucleotide encoding a gene of interest. As used herein, the term “gene of interest” refers to the gene selected for a particular purpose and being desired of delivery by a system or vesicle of the present invention. A gene of interest inserted into one or more regions a vector, such as an expression vector (including one or more of the engineered delivery vesicle generation system vectors) such that when expressed in a target cell or recipient cell it can be expressed and produce a desired gene product and/or be packaged as cargo in an engineered delivery vesicle of the present invention. It will be appreciated that other cargos specifically identified can also be genes of interest. For example, a polynucleotide encoding a Cas effector can be a gene of interest in this context where it is desired to deliver a Cas effector to a cell, for example.

In one embodiment, the gene of interest encodes a gene that provides a therapeutic function for the treatment of a disease. In some embodiments, the gene of interest can also be a vaccinating gene, that is to say a gene encoding an antigenic peptide that is capable of generating an immune response in humans or animals. This may include, but is not necessarily limited to, peptide antigens specific for viral and bacterial infections, or may be tumor-specific. In some embodiments, a gene of interest is a gene which confers a desired phenotype. As the embodiments described herein focus on improved methods for packaging and delivery of a gene of interest, the particular gene of interest is not limiting and the technology can generally be used to deliver any gene of interest generally recognized by one of ordinary skill in the art as deliverable using a lentiviral system. One skilled in the art can design a construct containing any gene that they are interested in. Designing a construct containing a known gene of interest can be performed without undue experimentation. One of ordinary skill in the art routinely selects genes of interest. For example, the GenBank public database has existed since 1982 and is routinely used by persons of ordinary skill in the art relevant to the presently claimed method. As of June 2019, GenBank contains 2013,383,758 loci, 329,835,282,370 bases, from 213,383,758 reported sequences. The nucleotide sequences are from more than 300,000 organisms with supporting bibliographic and biological annotation. GenBank is only example, as there are many other known repositories of sequence information.

In some embodiments, the gene of interest may be, for example, a synthetic RNA/DNA sequence, a codon optimized RNA/DNA sequence, a recombinant RNA/DNA sequence (i.e., prepared by use of recombinant DNA techniques), a cDNA sequence or a partial genomic DNA sequence, including combinations thereof. Preferably, this is in the sense orientation. Preferably, the sequence is, comprises, or is transcribed from cDNA. The gene(s) of interest may also be referred to herein as “heterologous sequence(s)” “heterologous gene(s)” or “transgene(s)”.

In some embodiments, the gene of interest may confer some therapeutic benefit. The terms “therapeutic agent”, “therapeutic capable agent” or “treatment agent” are used interchangeably and refer to a molecule or compound that confers some beneficial effect upon administration to a subject. The beneficial effect includes enablement of diagnostic determinations; amelioration of a disease, symptom, disorder, or pathological condition; reducing or preventing the onset of a disease, symptom, disorder, or condition; and generally counteracting a disease, symptom, disorder or pathological condition.

Preferably, the therapeutic agent may be administered in a therapeutically effective amount of the active components. The term “therapeutically effective amount” refers to an amount which can elicit a biological or medicinal response in a tissue, system, animal, or human that is being sought by a researcher, veterinarian, medical doctor or other clinician, and in particular can prevent or alleviate one or more of the local or systemic symptoms or features of a disease or condition being treated. In some embodiments, the disease or condition is a disease or condition of or affecting the CNS or cell thereof. Exemplary diseases and disorders of and/or affecting the CNS are described in greater detail elsewhere herein.

In some embodiments, the gene of interest may lead to altered expression in the target cell. As used herein the term “altered expression” may particularly denote altered production of the recited gene products by a cell. As used herein, the term “gene product(s)” includes RNA transcribed from a gene (e.g., mRNA), or a polypeptide encoded by a gene or translated from RNA.

Also, “altered expression” as intended herein may encompass modulating the activity of one or more endogenous gene products. Accordingly, “altered expression”, “altering expression”, “modulating expression”, or “detecting expression” or similar may be used interchangeably with respectively “altered expression or activity”, “altering expression or activity”, “modulating expression or activity”, or “detecting expression or activity” or similar. As used herein, “modulating” or “to modulate” generally means either reducing or inhibiting the activity of a target or antigen, or alternatively increasing the activity of the target or antigen, as measured using a suitable in vitro, cellular, or in vivo assay. In particular, “modulating” or “to modulate” can mean either reducing or inhibiting the (relevant or intended) activity of, or alternatively increasing the (relevant or intended) biological activity of the target or antigen, as measured using a suitable in vitro, cellular or in vivo assay (which will usually depend on the target or antigen involved), by at least 5%, at least 10%, at least 25%, at least 50%, at least 60%, at least 70%, at least 80%, or 90% or more, compared to activity of the target or antigen in the same assay under the same conditions but without the presence of the inhibitor/antagonist agents or activator/agonist agents described herein.

As will be clear to the skilled person, “modulating” can also involve effecting a change (which can either be an increase or a decrease) in affinity, avidity, specificity and/or selectivity of a target or antigen, for one or more of its targets compared to the same conditions but without the presence of a modulating agent. Again, this can be determined in any suitable manner and/or using any suitable assay known per se, depending on the target. In particular, an action as an inhibitor/antagonist or activator/agonist can be such that an intended biological or physiological activity is increased or decreased, respectively, by at least 5%, at least 10%, at least 25%, at least 50%, at least 60%, at least 70%, at least 80%, or 90% or more, compared to the biological or physiological activity in the same assay under the same conditions but without the presence of the inhibitor/antagonist agent or activator/agonist agent. Modulating can also involve activating the target or antigen or the mechanism or pathway in which it is involved.

In certain example embodiments, the one or more polynucleotides, such as cargo polynucleotides, may encode one or more interference RNAs. Interference RNAs are RNA molecules capable of suppressing gene expressions. Example types of interference RNAs include small interfering RNA (siRNA), micro RNA (miRNA), and short hairpin RNA (shRNA). It will be appreciated that a cargo can include an RNAi molecule to be delivered to a target cell as well as a binding site for an endogenous RNAi molecule of a non-target cell. RNAi molecules that are to be delivered to a target cell as cargo can be e.g., therapeutic.

In certain example embodiments, the interference RNA may be a siRNAs. Small interfering RNA (siRNA) molecules are capable of inhibiting target gene expression by interfering RNA. siRNAs may be chemically synthesized, or may be obtained by in vitro transcription, or may be synthesized in vivo in target cell. siRNAs may comprise double-stranded RNA from 15 to 40 nucleotides in length and can contain a protuberant region 3′ and/or 5′ from 1 to 6 nucleotides in length. Length of protuberant region is independent from total length of siRNA molecule. siRNAs may act by post-transcriptional degradation or silencing of target messenger. In some cases, the exogenous polynucleotides encode shRNAs. In shRNAs, the antiparallel strands that form siRNA are connected by a loop or hairpin region.

The RNAi molecules delivered as cargo can, in some embodiments, suppress expression of genes and/or degrade a gene product (e.g., a transcript) related to a CNS disease, eye disease, or inner ear disease. Therefore, in some embodiments, the RNAi cargo treats or prevents a CNS disease, eye disease, or inner ear disease or symptom thereof.

The interference RNA (e.g., siRNA) may suppress expression of genes to promote long term survival and functionality of cells after transplanted to a subject. In some examples, the interference RNAs suppress genes in TGFβ pathway, e.g., TGFβ, TGFβ receptors, and SMAD proteins. In some examples, the interference RNAs suppress genes in colony-stimulating factor 1 (CSF1) pathway, e.g., CSF1 and CSF1 receptors. In certain embodiments, the one or more interference RNAs suppress genes in both the CSF1 pathway and the TGFβ pathway. TGFβ pathway genes may comprise one or more of ACVR1, ACVR1C, ACVR2A, ACVR2B, ACVRL1, AMH, AMHR2, BMP2, BMP4, BMP5, BMP6, BMP7, BMP8A, BMP8B, BMPR1A, BMPR1B, BMPR2, CDKN2B, CHRD, COMP, CREBBP, CUL1, DCN, E2F4, E2F5, EP300, FST, GDF5, GDF6, GDF7, ID1, ID2, ID3, ID4, IFNG, INHBA, INHBB, INHBC, INHBE, LEFTY1, LEFTY2, LOC728622, LTBP1, MAPK1, MAPK3, MYC, NODAL, NOG, PITX2, PPP2CA, PPP2CB, PPP2R1A, PPP2R1B, RBL1, RBL2, RBX1, RHOA, ROCK1, ROCK2, RPS6KB1, RPS6KB2, SKP1, SMAD1, SMAD2, SMAD3, SMAD4, SMAD5, SMAD6, SMAD7, SMAD9, SMURF1, SMURF2, SP1, TFDP1, TGFB1, TGFB2, TGFB3, TGFBR1, TGFBR2, THBS1, THBS2, THBS3, THBS4, TNF, ZFYVE16, and/or ZFYVE9.

In some embodiments, the cargo polynucleotide is an RNAi molecule, antisense molecule, and/or a gene silencing oligonucleotide or a polynucleotide that encodes an RNAi molecule, antisense molecule, and/or gene silencing oligonucleotide.

As used herein, “gene silencing oligonucleotide” refers to any oligonucleotide that can alone or with other gene silencing oligonucleotides utilize a cell's endogenous mechanisms, molecules, proteins, enzymes, and/or other cell machinery or exogenous molecule, agent, protein, enzyme, and/or polynucleotide to cause a global or specific reduction or elimination in gene expression, RNA level(s), RNA translation, RNA transcription, that can lead to a reduction or effective loss of a protein expression and/or function of a non-coding RNA as compared to wild-type or a suitable control. This is synonymous with the phrase “gene knockdown” Reduction in gene expression, RNA level(s), RNA translation, RNA transcription, and/or protein expression can range from about 100, 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 89, 88, 87, 86, 85, 84, 83, 82, 81, 80, 79, 78, 77, 76, 75, 74, 73, 72, 71, 70, 69, 68, 67, 66, 65, 64, 63, 62, 61, 60, 59, 58, 57, 56, 55, 54, 53, 52, 51, 50, 49, 48, 47, 46, 45, 44, 43, 42 41, 40, 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, to 1% or less reduction. “Gene silencing oligonucleotides” include, but are not limited to, any antisense oligonucleotide, ribozyme, any oligonucleotide (single or double stranded) used to stimulate the RNA interference (RNAi) pathway in a cell (collectively RNAi oligonucleotides), small interfering RNA (siRNA), microRNA, and short-hairpin RNA (shRNA). Commercially available programs and tools are available to design the nucleotide sequence of gene silencing oligonucleotides for a desired gene, based on the gene sequence and other information available to one of ordinary skill in the art.

In some embodiments, the cargo molecule is a therapeutic polynucleotide. Therapeutic polynucleotides are those that provide a therapeutic effect when delivered to a recipient cell. The polynucleotide can be a toxic polynucleotide (a polynucleotide that when transcribed or translated results in the death of the cell) or polynucleotide that encodes a lytic peptide or protein. In embodiments, delivery vesicles having a toxic polynucleotide as a cargo molecule can act as an antimicrobial or antibiotic. This is discussed in greater detail elsewhere herein. In some embodiments, the cargo molecule can be exogenous to the producer cell and/or a first cell. In some embodiments, the cargo molecule can be endogenous to the producer cell and/or a first cell. In some embodiments, the cargo molecule can be exogenous to the recipient cell and/or a second cell. In some embodiments, the cargo molecule can be endogenous to the recipient cell and/or second cell.

As described herein the cargo polynucleotide can be any polynucleotide endogenous or exogenous to the eukaryotic cell. For example, the cargo polynucleotide can be a polynucleotide residing in the nucleus of the eukaryotic cell. The cargo polynucleotide can be a sequence coding a gene product (e.g., a protein) or a non-coding sequence (e.g., a regulatory polynucleotide).

In some embodiments, the cargo polynucleotide is a DNA or RNA (e.g., a mRNA) vaccine.

In certain example embodiments, the polynucleotide may be an aptamer. In certain embodiments, the one or more agents is an aptamer. Nucleic acid aptamers are nucleic acid species that have been engineered through repeated rounds of in vitro selection or equivalently, SELEX (systematic evolution of ligands by exponential enrichment) to bind to various molecular targets such as small molecules, proteins, nucleic acids, cells, tissues, and organisms. Nucleic acid aptamers have specific binding affinity to molecules through interactions other than classic Watson-Crick base pairing. Aptamers are useful in biotechnological and therapeutic applications as they offer molecular recognition properties similar to antibodies. In addition to their discriminate recognition, aptamers offer advantages over antibodies as they can be engineered completely in a test tube, are readily produced by chemical synthesis, possess desirable storage properties, and elicit little or no immunogenicity in therapeutic applications. In certain embodiments, RNA aptamers may be expressed from a DNA construct. In other embodiments, a nucleic acid aptamer may be linked to another polynucleotide sequence. The polynucleotide sequence may be a double stranded DNA polynucleotide sequence. The aptamer may be covalently linked to one strand of the polynucleotide sequence. The aptamer may be ligated to the polynucleotide sequence. The polynucleotide sequence may be configured, such that the polynucleotide sequence may be linked to a solid support or ligated to another polynucleotide sequence.

Aptamers, like peptides generated by phage display or monoclonal antibodies (“mAbs”), are capable of specifically binding to selected targets and modulating the target's activity, e.g., through binding, aptamers may block their target's ability to function. A typical aptamer is 10-15 kDa in size (30-45 nucleotides), binds its target with sub-nanomolar affinity, and discriminates against closely related targets (e.g., aptamers will typically not bind other proteins from the same gene family). Structural studies have shown that aptamers are capable of using the same types of binding interactions (e.g., hydrogen bonding, electrostatic complementarity, hydrophobic contacts, steric exclusion) that drives affinity and specificity in antibody-antigen complexes.

Aptamers have a number of desirable characteristics for use in research and as therapeutics and diagnostics including high specificity and affinity, biological efficacy, and excellent pharmacokinetic properties. In addition, they offer specific competitive advantages over antibodies and other protein biologics. Aptamers are chemically synthesized and are readily scaled as needed to meet production demand for research, diagnostic or therapeutic applications. Aptamers are chemically robust. They are intrinsically adapted to regain activity following exposure to factors such as heat and denaturants and can be stored for extended periods (>1 yr) at room temperature as lyophilized powders. Not being bound by a theory, aptamers bound to a solid support or beads may be stored for extended periods.

Oligonucleotides in their phosphodiester form may be quickly degraded by intracellular and extracellular enzymes such as endonucleases and exonucleases. Aptamers can include modified nucleotides conferring improved characteristics on the ligand, such as improved in vivo stability or improved delivery characteristics. Examples of such modifications include chemical substitutions at the ribose and/or phosphate and/or base positions. SELEX identified nucleic acid ligands containing modified nucleotides are described, e.g., in U.S. Pat. No. 5,660,985, which describes oligonucleotides containing nucleotide derivatives chemically modified at the 2′ position of ribose, 5 position of pyrimidines, and 8 position of purines, U.S. Pat. No. 5,756,703 which describes oligonucleotides containing various 2′-modified pyrimidines, and U.S. Pat. No. 5,580,737 which describes highly specific nucleic acid ligands containing one or more nucleotides modified with 2′-amino (2′-NH2), 2′-fluoro (2′-F), and/or 2′-O-methyl (2′-OMe) substituents. Modifications of aptamers may also include modifications at exocyclic amines, substitution of 4-thiouridine, substitution of 5-bromo or 5-iodo-uracil; backbone modifications, phosphorothioate or allyl phosphate modifications, methylations, and unusual base-pairing combinations such as the isobases isocytidine and isoguanosine. Modifications can also include 3′ and 5′ modifications such as capping. As used herein, the term phosphorothioate encompasses one or more non-bridging oxygen atoms in a phosphodiester bond replaced by one or more sulfur atoms. In further embodiments, the oligonucleotides comprise modified sugar groups, for example, one or more of the hydroxyl groups is replaced with halogen, aliphatic groups, or functionalized as ethers or amines. In one embodiment, the 2′-position of the furanose residue is substituted by any of an O-methyl, O-alkyl, 0-allyl, S-alkyl, S-allyl, or halo group. Methods of synthesis of 2′-modified sugars are described, e.g., in Sproat, et al., Nucl. Acid Res. 19:733-738 (1991); Cotten, et al, Nucl. Acid Res. 19:2629-2635 (1991); and Hobbs, et al, Biochemistry 12:5138-5145 (1973). Other modifications are known to one of ordinary skill in the art. In certain embodiments, aptamers include aptamers with improved off-rates as described in International Patent Publication No. WO 2009012418, “Method for generating aptamers with improved off-rates,” incorporated herein by reference in its entirety. In certain embodiments aptamers are chosen from a library of aptamers. Such libraries include, but are not limited to, those described in Rohloffet al., “Nucleic Acid Ligands With Protein-like Side Chains: Modified Aptamers and Their Use as Diagnostic and Therapeutic Agents,” Molecular Therapy Nucleic Acids (2014) 3, e201. Aptamers are also commercially available (see e.g., SomaLogic, Inc., Boulder, Colorado). In certain embodiments, the present invention may utilize any aptamer containing any modification as described herein.

In certain other example embodiments, the polynucleotide may be a ribozyme or other enzymatically active polynucleotide.

In some embodiments, the cargo is a biologically active agent. Biologically active agents include any molecule that induces, directly or indirectly, an effect in a cell. Biologically active agents may be a protein, a nucleic acid, a small molecule, a carbohydrate, and a lipid. When the cargo is or comprises a nucleic acid, the nucleic acid may be a separate entity from the DNA-based carrier. In these embodiments, the DNA-based carrier is not itself the cargo. In other embodiments, the DNA-based carrier may itself comprise a nucleic acid cargo. Therapeutic agents include, without limitation, chemotherapeutic agents, anti-oncogenic agents, anti-angiogenic agents, tumor suppressor agents, anti-microbial agents, enzyme replacement agents, gene expression modulating agents and expression constructs comprising a nucleic acid encoding a therapeutic protein or nucleic acid, and vaccines. Therapeutic agents may be peptides, proteins (including enzymes, antibodies and peptidic hormones), ligands of cytoskeleton, nucleic acid, small molecules, non-peptidic hormones and the like. To increase affinity for the nucleus, agents may be conjugated to a nuclear localization sequence. Nucleic acids that may be delivered by the method of the invention include synthetic and natural nucleic acid material, including DNA, RNA, transposon DNA, antisense nucleic acids, dsRNA, siRNAs, transcription RNA, messenger RNA, ribosomal RNA, small nucleolar RNA, microRNA, ribozymes, plasmids, expression constructs, etc.

Imaging agents include contrast agents, such as ferrofluid-based MRI contrast agents and gadolinium agents for PET scans, fluorescein isothiocyanate and 6-TAMARA. Monitoring agents include reporter probes, biosensors, green fluorescent protein, and the like. Reporter probes include photo-emitting compounds, such as phosphors, radioactive moieties, and fluorescent moieties, such as rare earth chelates (e.g., europium chelates), Texas Red, rhodamine, fluorescein, FITC, fluor-3, 5 hexadecanoyl fluorescein, Cy2, fluor X, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, dansyl, phycocrytherin, phycocyanin, spectrum orange, spectrum green, and/or derivatives of any one or more of the above. Biosensors are molecules that detect and transmit information regarding a physiological change or process, for instance, by detecting the presence or change in the presence of a chemical. The information obtained by the biosensor typically activates a signal that is detected with a transducer. The transducer typically converts the biological response into an electrical signal. Examples of biosensors include enzymes, antibodies, DNA, receptors, and regulator proteins used as recognition elements, which can be used either in whole cells or isolated and used independently (D'Souza, 2001, Biosensors and Bioelectronics 16:337-353).

One or two or more different cargoes may be delivered by the delivery particles described herein.

In some embodiments, the cargo may be linked to one or more envelope proteins by a linker, as described elsewhere herein. A suitable linker may include, but is not necessarily limited to, a glycine-serine linker. In some embodiments, the glycine-serine linker is (GGS)3 (SEQ ID NO: 27).

In some embodiments, the cargo comprises a ribonucleoprotein. In specific embodiments, the cargo comprises a genetic modulating agent.

As used herein the term “altered expression” may particularly denote altered production of the recited gene products by a cell. As used herein, the term “gene product(s)” includes RNA transcribed from a gene (e.g., mRNA), or a polypeptide encoded by a gene or translated from RNA.

In some embodiments, the cargo is a polynucleotide encoding a gene modifying system. Gene modifying systems may include, but are not limited to, zinc finger nucleases, TALE nucleases (TALENs), meganucleases, RNAi, and CRISPR-Cas systems. The generic modifying systems can, upon delivery as cargo to a target cell, such as a CNS cell, result in a genetic modification in that cell. In some embodiments, the genetic modification cures, treats, and/or prevents a disease or disorder, such as a CNS, eye, or inner ear disease or disorder.

The CRISPR-Cas system may include a Class 1 comprising a Type I, Type III or Type IV Cas proteins as described in Makarova et al. “Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants” Nature Reviews Microbiology, 18:67-81 (February 2020), and incorporated in its entirety herein by reference, and particularly as described in FIG. 1, p. 326. polynucleotide modifying system or component(s) thereof. The CRISPR-Cas system may also be a Class 2 CRISPR-Cas system such as a Type II, Type V, or Type VI system, which are described in Makarova et al. “Evolutionary classification of CRISPR-Cas systems: a burst of class 2 and derived variants” Nature Reviews Microbiology, 18:67-81 (February 2020), incorporated herein by reference.

CRISPR-Cas systems may also include further modified systems where the Cas protein is rendered catalytically inactive and fused to other functional domains or polypeptides to derive new functions. Example modified systems include base editor, primer editors, and CRISPR-associated transposase (CAST) systems.

Example base editing systems include DNA base editors (Komor et al. 2016 Nature. 533:420-424; Nishida et a. 2016. Science 353; Gaudelli et al. 2017 Nature 551:464-471; Mok et al., Cell. 182, 463-480 (2020); Koblan et al., Nature 589, 608-614 (2021); Rees and Liu. 2018. 19(12):770-788. doi: 10.1038/s41576-018-0059-1; Song et al., Nat Biomed Eng. 2020 Jan; 4(1):125-130. doi: 10.1038/s41551-019-0357-8; Koblan et al. 2018. 6(9):843-846. doi: 10.1038/nbt.4172; Thuronyi et al., Nat Biotechnol. 2019 September; 37(9):1070-1079. doi: 10.1038/s41587-019-0193-0; Doman et al., Nat Biotechnol. 2020 May; 38(5):620-628. doi: 10.1038/s41587-020-0414-6; Richter et al., Nat Biotechnol. 2020 July; 38(7):883-891. doi: 10.1038/s41587-020-0453-z; Huang et al., Nat Protoc. 2021 February; 16(2):1089-1128. doi: 10.1038/s41596-020-00450-9; Koblan et al., Nat Biotechnol. 2021 Jun. 28. doi: 10.1038/s41587-021-00938-z; WO 2018/213708, WO 2018/213726, WO/2019/126709, WO/2019/1267; WO/2019/126762) and RNA base editors (Cox et al. 2017. Science 358:1019-1027, Rees and Liu. 2018. 19(12):770-788. doi: 10.1038/s41576-018-0059-1; Abudayyeh 00, et al., A cytosine deaminase for programmable single-base RNA editing, Science 26 Jul. 2019; WO 2019/005883, WO 2019/005886, WO 2019/071048, PCT/US2018/0579, PCT US/2018/067207).

Example prime editing systems include those as described in Anzalone et al. 2019 Nature 576:149-157; Gao et al. 2021 Genome Biol. 22:83; Jang et al. 2021 Nature Biomed. Eng. doi.org/10.1038/s41551-021-00788-9; WO 2021/072328; WO 2020/191248; WO 2020/191249; WO 2020/191239; WO 2020/191245; WO 2020/191246; WO 2020/191241; WO 2020/191171; WO 202/191153; WO 2020/191242; WO 2020/191233; WO 2020/191243; and WO 2020/191234.

Example CAST systems include those as described in Klompe et al. 2019 Nature 571(7764):219-225; Strecker et al. 2019 Science 365:48-53; and Saito et al. 2021 Cell 184:2441-2453; WO 2020/131862; WO 2019090173; WO 2019090174; WO 2019090175, and WO 2019/241452.

Example non-LTR retrotransposon systems include those as described in WO2021/102042.

Example Cas-associated ligase systems include those as described in WO2021/133977.

For modified CRISPR-Cas system that exceed the cargo capacity for a delivery vehicle incorporating the targeting moieties disclosed herein, a split-intein approach to divide CBE and ABE into reconstitutable halves, is described in Levy et al. Nature Biomedical Engineering doi.org/10.1038/s41441-019-0505-5 (2019), which is incorporated herein by reference.

Zinc Finger proteins can comprise a functional domain. The first synthetic zinc finger nucleases (ZFNs) were developed by fusing a ZF protein to the catalytic domain of the Type IIS restriction enzyme FokI. (Kim, Y. G. et al., 1994, Chimeric restriction endonuclease, Proc. Natl. Acad. Sci. U.S.A. 91, 883-887; Kim, Y. G. et al., 1996, Hybrid restriction enzymes: zinc finger fusions to Fok I cleavage domain. Proc. Natl. Acad. Sci. U.S.A. 93, 1156-1160). Increased cleavage specificity can be attained with decreased off target activity by use of paired ZFN heterodimers, each targeting different nucleotide sequences separated by a short spacer. (Doyon, Y. et al., 2011, Enhancing zinc-finger-nuclease activity with improved obligate heterodimeric architectures. Nat. Methods 8, 74-79). ZFPs can also be designed as transcription activators and repressors and have been used to target many genes in a wide variety of organisms. Exemplary methods of genome editing using ZFNs can be found for example in U.S. Pat. Nos. 6,534,261, 6,607,882, 6,746,838, 6,794,136, 6,824,978, 6,866,997, 6,933,113, 6,979,539, 7,013,219, 7,030,215, 7,220,719, 7,241,573, 7,241,574, 7,585,849, 7,595,376, 6,903,185, and 6,479,626, all of which are specifically incorporated by reference.

In some embodiments, a meganuclease or system thereof can be used to modify a polynucleotide. Meganucleases, which are endodeoxyribonucleases characterized by a large recognition site (double-stranded DNA sequences of 12 to 40 base pairs). Exemplary methods for using meganucleases can be found in U.S. Pat. Nos. 8,163,514, 8,133,697, 8,021,867, 8,119,361, 8,119,381, 8,124,369, and 8,129,134, which are specifically incorporated herein by reference.

In certain embodiments, the genetic modifying agent is RNAi (e.g., shRNA). As used herein, “gene silencing” or “gene silenced” in reference to an activity of an RNAi molecule, for example a siRNA or miRNA refers to a decrease in the mRNA level in a cell for a target gene by at least about 5%, about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, about 95%, about 99%, about 100% of the mRNA level found in the cell without the presence of the miRNA or RNA interference molecule. In one preferred embodiment, the mRNA levels are decreased by at least about 70%, about 80%, about 90%, about 95%, about 99%, about 100%.

As used herein, the term “RNAi” refers to any type of interfering RNA, including but not limited to, siRNAi, shRNAi, endogenous microRNA and artificial microRNA. For instance, it includes sequences previously identified as siRNA, regardless of the mechanism of down-stream processing of the RNA (i.e., although siRNAs are believed to have a specific method of in vivo processing resulting in the cleavage of mRNA, such sequences can be incorporated into the vectors in the context of the flanking sequences described herein). The term “RNAi” can include both gene silencing RNAi molecules, and also RNAi effector molecules which activate the expression of a gene.

As used herein, a “siRNA” refers to a nucleic acid that forms a double stranded RNA, which double stranded RNA has the ability to reduce or inhibit expression of a gene or target gene when the siRNA is present or expressed in the same cell as the target gene. The double stranded RNA siRNA can be formed by the complementary strands. In one embodiment, a siRNA refers to a nucleic acid that can form a double stranded siRNA. The sequence of the siRNA can correspond to the full-length target gene, or a subsequence thereof. Typically, the siRNA is at least about 15-50 nucleotides in length (e.g., each complementary sequence of the double stranded siRNA is about 15-50 nucleotides in length, and the double stranded siRNA is about 15-50 base pairs in length, preferably about 19-30 base nucleotides, preferably about 20-25 nucleotides in length, e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length).

As used herein “shRNA” or “small hairpin RNA” (also called stem loop) is a type of siRNA. In one embodiment, these shRNAs are composed of a short, e.g., about 19 to about 25 nucleotide, antisense strand, followed by a nucleotide loop of about 5 to about 9 nucleotides, and the analogous sense strand. Alternatively, the sense strand can precede the nucleotide loop structure and the antisense strand can follow.

The terms “microRNA” or “miRNA” are used interchangeably herein are endogenous RNAs, some of which are known to regulate the expression of protein-coding genes at the posttranscriptional level. Endogenous microRNAs are small RNAs naturally present in the genome that are capable of modulating the productive utilization of mRNA. The term artificial microRNA includes any type of RNA sequence, other than endogenous microRNA, which is capable of modulating the productive utilization of mRNA. MicroRNA sequences have been described in publications such as Lim, et al., Genes & Development, 17, p. 991-1008 (2003), Lim et al Science 299, 1540 (2003), Lee and Ambros Science, 294, 862 (2001), Lau et al., Science 294, 858-861 (2001), Lagos-Quintana et al, Current Biology, 12, 735-739 (2002), Lagos Quintana et al, Science 294, 853-857 (2001), and Lagos-Quintana et al, RNA, 9, 175-179 (2003), which are incorporated herein by reference. Multiple microRNAs can also be incorporated into a precursor molecule. Furthermore, miRNA-like stem-loops can be expressed in cells as a vehicle to deliver artificial miRNAs and short interfering RNAs (siRNAs) for the purpose of modulating the expression of endogenous genes through the miRNA and or RNAi pathways.

As used herein, “double stranded RNA” or “dsRNA” refers to RNA molecules that are comprised of two strands. Double-stranded molecules include those comprised of a single RNA molecule that doubles back on itself to form a two-stranded structure. For example, the stem loop structure of the progenitor molecules from which the single-stranded miRNA is derived, called the pre-miRNA (Bartel et al. 2004. Cell 1 16:281-297), comprises a dsRNA molecule.

In certain example embodiments, the cargo molecule may one or more polypeptides. The polypeptide may be a full-length protein or a functional fragment or functional domain thereof, that is a fragment or domain that maintains the desired functionality of the full-length protein. As used within this section “protein” is meant to refer to full-length proteins and functional fragments and domains thereof. A wide array of polypeptides may be delivered using the engineered delivery vesicles described herein, including but not limited to, secretory proteins, immunomodulatory proteins, anti-fibrotic proteins, proteins that promote tissue regeneration and/or transplant survival functions, hormones, anti-microbial proteins, anti-fibrillating polypeptides, and antibodies. The one or more polypeptides may also comprise combinations of the aforementioned example classes of polypeptides. It will be appreciated that any of the polypeptides described herein can also be delivered via the engineered delivery vesicles and systems described herein via delivery of the corresponding encoding polynucleotide.

In certain example embodiments, the one or more polypeptides may comprise one or more secretory proteins. A secretory is a protein that is actively transported out of the cell, for example, the protein, whether it be endocrine or exocrine, is secreted by a cell. Secretory pathways have been shown conserved from yeast to mammals, and both conventional and unconventional protein secretion pathways have been demonstrated in plants. Chung et al., “An Overview of Protein Secretion in Plant Cells,” MIMB, 1662:19-32, Sep. 1, 2017. Accordingly, identification of secretory proteins in which one or more polynucleotides may be inserted can be identified for particular cells and applications. In embodiments, one of skill in the art can identify secretory proteins based on the presence of a signal peptide, which consists of a short hydrophobic N-terminal sequence.

In embodiments, the protein is secreted by the secretory pathway. In embodiments, the proteins are exocrine secretion proteins or peptides, comprising enzymes in the digestive tract. In embodiments the protein is endocrine secretion protein or peptide, for example, insulin and other hormones released into the blood stream. In other embodiments, the protein is involved in signaling between or within cells via secreted signaling molecules, for example, paracrine, autocrine, endocrine or neuroendocrine. In embodiments, the secretory protein is selected from the group of cytokines, kinases, hormones and growth factors that bind to receptors on the surface of target cells.

As described, secretory proteins include hormones, enzymes, toxins, and antimicrobial peptides. Examples of secretory proteins include serine proteases (e.g., pepsins, trypsin, chymotrypsin, elastase and plasminogen activators), amylases, lipases, nucleases (e.g. deoxyribonucleases and ribonucleases), peptidases enzyme inhibitors such as serpins (e.g., al-antitrypsin and plasminogen activator inhibitors), cell attachment proteins such as collagen, fibronectin and laminin, hormones and growth factors such as insulin, growth hormone, prolactin platelet-derived growth factor, epidermal growth factor, fibroblast growth factors, interleukins, interferons, apolipoproteins, and carrier proteins such as transferrin and albumins. In some examples, the secretory protein is insulin or a fragment thereof. In one example, the secretory protein is a precursor of insulin or a fragment thereof. In certain examples, the secretory protein is c-peptide. In a preferred embodiment, the one or more polynucleotides is inserted in the middle of the c-peptide. In some aspects, the secretory protein is GLP-1, glucagon, betatrophin, pancreatic amylase, pancreatic lipase, carboxypeptidase, secretin, CCK, a PPAR (e.g. PPAR-alpha, PPAR-gamma, PPAR-delta or a precursor thereof (e.g. preprotein or preproprotein). In aspects, the secretory protein is fibronectin, a clotting factor protein (e.g. Factor VII, VIII, IX, etc.), α2-macroglobulin, al-antitrypsin, antithrombin III, protein S, protein C, plasminogen, α2-antiplasmin, complement components (e.g. complement component C1-9), albumin, ceruloplasmin, transcortin, haptoglobin, hemopexin, IGF binding protein, retinol binding protein, transferrin, vitamin-D binding protein, transthyretin, IGF-1, thrombopoietin, hepcidin, angiotensinogen, or a precursor protein thereof. In aspects, the secretory protein is pepsinogen, gastric lipase, sucrase, gastrin, lactase, maltase, peptidase, or a precursor thereof. In aspects, the secretory protein is renin, erythropoietin, angiotensin, adrenocorticotropic hormone (ACM), amylin, atrial natriuretic peptide (ANP), calcitonin, ghrelin, growth hormone (GH), leptin, melanocyte-stimulating hormone (MSH), oxytocin, prolactin, follicle-stimulating hormone (FSH), thyroid stimulating hormone (TSH), thyrotropin-releasing hormone (TRH), vasopressin, vasoactive intestinal peptide, or a precursor thereof.

In certain example embodiments, the one or more polypeptides may comprise one or more immunomodulatory protein. In certain embodiments, the present invention provides for modulating immune states. The immune state can be modulated by modulating T cell function or dysfunction. In particular embodiments, the immune state is modulated by expression and secretion of IL-10 and/or other cytokines as described elsewhere herein. In certain embodiments, T cells can affect the overall immune state, such as other immune cells in proximity.

The polynucleotides may encode one or more immunomodulatory proteins, including immunosuppressive proteins. The term “immunosuppressive” means that immune response in an organism is reduced or depressed. An immunosuppressive protein may suppress, reduce, or mask the immune system or degree of response of the subject being treated. For example, an immunosuppressive protein may suppress cytokine production, downregulate or suppress self-antigen expression, or mask the MHC antigens. As used herein, the term “immune response” refers to a response by a cell of the immune system, such as a B cell, T cell (CD4+ or CD8+), regulatory T cell, antigen-presenting cell, dendritic cell, monocyte, macrophage, NKT cell, NK cell, basophil, eosinophil, or neutrophil, to a stimulus. In some embodiments, the response is specific for a particular antigen (an “antigen-specific response”) and refers to a response by a CD4 T cell, CD8 T cell, or B cell via their antigen-specific receptor. In some embodiments, an immune response is a T cell response, such as a CD4+ response or a CD8+ response. Such responses by these cells can include, for example, cytotoxicity, proliferation, cytokine or chemokine production, trafficking, or phagocytosis, and can be dependent on the nature of the immune cell undergoing the response. In some cases, the immunosuppressive proteins may exert pleiotropic functions. In some cases, the immunomodulatory proteins may maintain proper regulatory T cells versus effector T cells (Treg/Teff) balance. For examples, the immunomodulatory proteins may expand and/or activate the Tregs and blocks the actions of Teffs, thus providing immunoregulation without global immunosuppression. Target genes associated with immune suppression include, for example, checkpoint inhibitors such PD1, Tim3, Lag3, TIGIT, CTLA-4, and combinations thereof.

The term “immune cell” as used throughout this specification generally encompasses any cell derived from a hematopoietic stem cell that plays a role in the immune response. The term is intended to encompass immune cells both of the innate or adaptive immune system. The immune cell as referred to herein may be a leukocyte, at any stage of differentiation (e.g., a stem cell, a progenitor cell, a mature cell) or any activation stage. Immune cells include lymphocytes (such as natural killer cells, T-cells (including, e.g., thymocytes, Th or Tc; Th1, Th2, Th17, Thαβ, CD4+, CD8+, effector Th, memory Th, regulatory Th, CD4+/CD8+ thymocytes, CD4−/CD8− thymocytes, γδ T cells, etc.) or B-cells (including, e.g., pro-B cells, early pro-B cells, late pro-B cells, pre-B cells, large pre-B cells, small pre-B cells, immature or mature B-cells, producing antibodies of any isotype, T1 B-cells, T2, B-cells, naïve B-cells, GC B-cells, plasmablasts, memory B-cells, plasma cells, follicular B-cells, marginal zone B-cells, B-1 cells, B-2 cells, regulatory B cells, etc.), such as for instance, monocytes (including, e.g., classical, non-classical, or intermediate monocytes), (segmented or banded) neutrophils, eosinophils, basophils, mast cells, histiocytes, microglia, including various subtypes, maturation, differentiation, or activation stages, such as for instance hematopoietic stem cells, myeloid progenitors, lymphoid progenitors, myeloblasts, promyelocytes, myelocytes, metamyelocytes, monoblasts, promonocytes, lymphoblasts, prolymphocytes, small lymphocytes, macrophages (including, e.g., Kupffer cells, stellate macrophages, M1 or M2 macrophages), (myeloid or lymphoid) dendritic cells (including, e.g., Langerhans cells, conventional or myeloid dendritic cells, plasmacytoid dendritic cells, mDC-1, mDC-2, Mo-DC, HP-DC, veiled cells), granulocytes, polymorphonuclear cells, antigen-presenting cells (APC), etc.

T cell response refers more specifically to an immune response in which T cells directly or indirectly mediate or otherwise contribute to an immune response in a subject. T cell-mediated response may be associated with cell mediated effects, cytokine mediated effects, and even effects associated with B cells if the B cells are stimulated, for example, by cytokines secreted by T cells. By means of an example but without limitation, effector functions of MHC class I restricted Cytotoxic T lymphocytes (CTLs), may include cytokine and/or cytolytic capabilities, such as lysis of target cells presenting an antigen peptide recognized by the T cell receptor (naturally-occurring TCR or genetically engineered TCR, e.g., chimeric antigen receptor, CAR), secretion of cytokines, preferably IFN gamma, TNF alpha and/or or more immunostimulatory cytokines, such as IL-2, and/or antigen peptide-induced secretion of cytotoxic effector molecules, such as granzymes, perforins or granulysin. By means of example but without limitation, for MHC class II restricted T helper (h) cells, effector functions may be antigen peptide-induced secretion of cytokines, preferably, IFN gamma, TNF alpha, IL-4, IL5, IL-10, and/or IL-2. By means of example but without limitation, for T regulatory (Treg) cells, effector functions may be antigen peptide-induced secretion of cytokines, preferably, IL-10, IL-35, and/or TGF-beta. B cell response refers more specifically to an immune response in which B cells directly or indirectly mediate or otherwise contribute to an immune response in a subject. Effector functions of B cells may include in particular production and secretion of antigen-specific antibodies by B cells (e.g., polyclonal B cell response to a plurality of the epitopes of an antigen (antigen-specific antibody response)), antigen presentation, and/or cytokine secretion.

During persistent immune activation, such as during uncontrolled tumor growth or chronic infections, subpopulations of immune cells, particularly of CD8+ or CD4+ T cells, become compromised to different extents with respect to their cytokine and/or cytolytic capabilities. Such immune cells, particularly CD8+ or CD4+ T cells, are commonly referred to as “dysfunctional” or as “functionally exhausted” or “exhausted”. As used herein, the term “dysfunctional” or “functional exhaustion” refer to a state of a cell where the cell does not perform its usual function or activity in response to normal input signals, and includes refractivity of immune cells to stimulation, such as stimulation via an activating receptor or a cytokine. Such a function or activity includes, but is not limited to, proliferation (e.g., in response to a cytokine, such as IFN-gamma) or cell division, entrance into the cell cycle, cytokine production, cytotoxicity, migration and trafficking, phagocytotic activity, or any combination thereof. Normal input signals can include, but are not limited to, stimulation via a receptor (e.g., T cell receptor, B cell receptor, co-stimulatory receptor). Unresponsive immune cells can have a reduction of at least 10%, 20%, 300%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or even 100% in cytotoxic activity, cytokine production, proliferation, trafficking, phagocytotic activity, or any combination thereof, relative to a corresponding control immune cell of the same type. In some particular embodiments of the aspects described herein, a cell that is dysfunctional is a CD8+ T cell that expresses the CD8+ cell surface marker. Such CD8+ cells normally proliferate and produce cell killing enzymes, e.g., they can release the cytotoxins perforin, granzymes, and granulysin. However, exhausted/dysfunctional T cells do not respond adequately to TCR stimulation, and display poor effector function, sustained expression of inhibitory receptors and a transcriptional state distinct from that of functional effector or memory T cells. Dysfunction/exhaustion of T cells thus prevents optimal control of infection and tumors. Exhausted/dysfunctional immune cells, such as T cells, such as CD8+ T cells, may produce reduced amounts of IFN-gamma, TNF-alpha and/or one or more immunostimulatory cytokines, such as IL-2, compared to functional immune cells. Exhausted/dysfunctional immune cells, such as T cells, such as CD8+ T cells, may further produce (increased amounts of) one or more immunosuppressive transcription factors or cytokines, such as IL-10 and/or Foxp3, compared to functional immune cells, thereby contributing to local immunosuppression. Dysfunctional CD8+ T cells can be both protective and detrimental against disease control. As used herein, a “dysfunctional immune state” refers to an overall suppressive immune state in a subject or microenvironment of the subject (e.g., tumor microenvironment). For example, increased IL-10 production leads to suppression of other immune cells in a population of immune cells.

CD8+ T cell function is associated with their cytokine profiles. It has been reported that effector CD8+ T cells with the ability to simultaneously produce multiple cytokines (polyfunctional CD8+ T cells) are associated with protective immunity in patients with controlled chronic viral infections as well as cancer patients responsive to immune therapy (Spranger et al., 2014, J. Immunother. Cancer, vol. 2, 3). In the presence of persistent antigen CD8+ T cells were found to have lost cytolytic activity completely over time (Moskophidis et al., 1993, Nature, vol. 362, 758-761). It was subsequently found that dysfunctional T cells can differentially produce IL-2, TNFa and IFNg in a hierarchical order (Wherry et al., 2003, J. Virol., vol. 77, 4911-4927). Decoupled dysfunctional and activated CD8+ cell states have also been described (see, e.g., Singer, et al. (2016). A Distinct Gene Module for Dysfunction Uncoupled from Activation in Tumor-Infiltrating T Cells. Cell 166, 1500-1511 e1509; WO/2017/075478; and WO/2018/049025).

The invention provides compositions and methods for modulating T cell balance. The invention provides T cell modulating agents that modulate T cell balance. For example, in some embodiments, the invention provides T cell modulating agents and methods of using these T cell modulating agents to regulate, influence or otherwise impact the level of and/or balance between T cell types, e.g., between Th17 and other T cell types, for example, Th1-like cells. For example, in some embodiments, the invention provides T cell modulating agents and methods of using these T cell modulating agents to regulate, influence or otherwise impact the level of and/or balance between Th17 activity and inflammatory potential. As used herein, terms such as “h17 cell” and/or “Th17 phenotype” and all grammatical variations thereof refer to a differentiated T helper cell that expresses one or more cytokines selected from the group the consisting of interleukin 17A (IL-17A), interleukin 17F (IL-17F), and interleukin 17A/F heterodimer (IL17-AF). As used herein, terms such as “Th1 cell” and/or “Th1 phenotype” and all grammatical variations thereof refer to a differentiated T helper cell that expresses interferon gamma (IFNγ). As used herein, terms such as “M2 cell” and/or “Th2 phenotype” and all grammatical variations thereof refer to a differentiated T helper cell that expresses one or more cytokines selected from the group the consisting of interleukin 4 (IL-4), interleukin 5 (IL-5) and interleukin 13 (IL-13). As used herein, terms such as “Treg cell” and/or “Treg phenotype” and all grammatical variations thereof refer to a differentiated T cell that expresses Foxp3.

In some examples, immunomodulatory proteins may be immunosuppressive cytokines. In general, cytokines are small proteins and include interleukins, lymphokines and cell signal molecules, such as tumor necrosis factor and the interferons, which regulate inflammation, hematopoiesis, and response to infections. Examples of immunosuppressive cytokines include interleukin 10 (IL-10), TGF-β, IL-Ra, IL-18Ra, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-11, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-19, IL-20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32, IL-33, IL-34, IL-35, IL-36, IL-37, PGE2, SCF, G-CSF, CSF-1R, M-CSF, GM-CSF, IFN-α, IFN-β, IFN-γ, IFN-λ, bFGF, CCL2, CXCL1, CXCL8, CXCL12, CX3CL1, CXCR4, TNF-α and VEGF. Examples of immunosuppressive proteins may further include FOXP3, AHR, TRP53, IKZF3, IRF4, IRF1, and SMAD3. In one example, the immunosuppressive protein is IL-10. In one example, the immunosuppressive protein is IL-6. In one example, the immunosuppressive protein is IL-2.

In certain example embodiments, the one or more polypeptides may comprise an anti-fibrotic protein. Examples of anti-fibrotic proteins include any protein that reduces or inhibits the production of extracellular matrix components, fibronectin, proteoglycan, collagen, elastin, TGIFs, and SMAD7. In embodiments, the anti-fibrotic protein is a peroxisome proliferator-activated receptor (PPAR), or may include one or more PPARs. In some embodiments, the protein is PPARα, PPAR γ is a dual PPARα/γ. Derosa et al., “The role of various peroxisome proliferator-activated receptors and their ligands in clinical practice” Jan. 18, 2017 J. Cell. Phys. 223:1 153-161.

Proteins that Promote Tissue Regeneration and/or Transplant Survival Functions

In certain example embodiments, the one or more polypeptides may comprise proteins that promote tissue regeneration and/or transplant survival functions. In some cases, such proteins may induce and/or up-regulate the expression of genes for pancreatic β cell regeneration. In some cases, the proteins that promote transplant survival and functions include the products of genes for pancreatic β cell regeneration. Such genes may include proislet peptides that are proteins or peptides derived from such proteins that stimulate islet cell neogenesis. Examples of genes for pancreatic β cell regeneration include Reg1, Reg2, Reg3, Reg4, human proislet peptide, parathyroid hormone-related peptide (1-36), glucagon-like peptide-1 (GLP-1), extendin-4, prolactin, Hgf, Igf-1, Gip-1, adipsin, resistin, leptin, IL-6, IL-10, Pdx1, Ptfa1, Mafa, Pax6, Pax4, Nkx6.1, Nkx2.2, PDGF, vglycin, placental lactogens (somatomammotropins, e.g., CSH1, CHS2), isoforms thereof, homologs thereof, and orthologs thereof. In certain embodiments, the protein promoting pancreatic B cell regeneration is a cytokine, myokine, and/or adipokine.

In certain embodiments, the one or more polynucleotides may comprise one or more hormones. The term “hormone” refers to polypeptide hormones, which are generally secreted by glandular organs with ducts. Hormones include proteins from natural sources or from recombinant cell culture and biologically active equivalents of the native sequence hormone, including synthetically produced small-molecule entities and pharmaceutically acceptable derivatives and salts thereof. Included among the hormones are, for example, growth hormone such as human growth hormone, N-methionyl human growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; proinsulin; relaxin; prorelaxin; glycoprotein hormones such as follicle stimulating hormone (FSH), thyroid stimulating hormone (TSH), and luteinizing hormone (LH); prolactin, placental lactogen, mouse gonadotropin-associated peptide, inhibin; activin; mullerian-inhibiting substance; and thrombopoietin, growth hormone (GH), adrenocorticotropic hormone (ACTH), dehydroepiandrosterone (DHEA), cortisol, epinephrine, thyroid hormone, estrogen, progesterone, placental lactogens (somatomammotropins, e.g. CSH1, CHS2), testosterone. and neuroendocrine hormones. In certain examples, the hormone is secreted from pancreas, e.g., insulin, glucagon, somatostatin, pancreatic polypeptide and ghrelin. In some examples, the hormone is insulin.

Hormones herein may also include growth factors, e.g., fibroblast growth factor (FGF) family, bone morphogenic protein (BMP) family, platelet derived growth factor (PDGF) family, transforming growth factor beta (TGFbeta) family, nerve growth factor (NGF) family, epidermal growth factor (EGF) family, insulin related growth factor (IGF) family, hepatocyte growth factor (HGF) family, hematopoietic growth factors (HeGFs), platelet-derived endothelial cell growth factor (PD-ECGF), angiopoietin, vascular endothelial growth factor (VEGF) family, and glucocorticoids. In a particular embodiment, the hormone is insulin or incretins such as exenatide, GLP-1.

In embodiments, the secreted peptide is a neurohormone, a hormone produced and released by neuroendocrine cells. Example neurohormones include Thyrotropin-releasing hormone, Corticotropin-releasing hormone, Histamine, Growth hormone-releasing hormone, Somatostatin, Gonadotropin-releasing hormone, Serotonin, Dopamine, Neurotensin, Oxytocin, Vasopressin, Epinephrine, and Norepinephrine.

In some embodiments, the one or more polypeptides may comprise one or more anti-microbial proteins. In embodiments where the cell is mammalian cell, human host defense antimicrobial peptides and proteins (AMPs) play a critical role in warding off invading microbial pathogens. In certain embodiments, the anti-microbial is a-defensin HD-6, HNP-1 and β-defensin hBD-3, lysozyme, cathelcidin LL-37, C-type lectin RegIIIalpha, for example. See, e.g., Wang, “Human Antimicrobial Peptide and Proteins” Pharma, May 2014, 7(5): 545-594, incorporated herein by reference.

In certain example embodiments, the one or more polypeptides may comprise one or more anti-fibrillating polypeptides. The anti-fibrillating polypeptide can be the secreted polypeptide. In some embodiments, the anti-fibrillating polypeptide is co-expressed with one or more other polynucleotides and/or polypeptides described elsewhere herein. The anti-fibrillating agent can be secreted and act to inhibit the fibrillation and/or aggregation of endogenous proteins and/or exogenous proteins that it may be co-expressed therewith. In some embodiments, the anti-fibrillating agent is P4 (VITYF (SEQ ID NO: 55)), P5 (VVVVV (SEQ ID NO: 56)), KR7 (KPWWPRR (SEQ ID NO: 57)), NK9 (NIVNVSLVK (SEQ ID NO: 58)), iAb5p (Leu-Pro-Phe-Phe-Asp (SEQ ID NO: 59)), KLVF (SEQ ID NO: 60) and derivatives thereof, indolicidin, carnosine, a hexapeptide as set forth in Wang et al. 2014. ACS Chem Neurosci. 5:972-981, alpha sheet peptides having alternating D-amino acids and L-amino acids as set forth in Hopping et al. 2014. Elife 3:e01681, D-(PGKLVYA (SEQ ID NO: 61)), RI-OR2-TAT, cyclo(17, 21)-(Lys17, Asp21)A_(1-28), SEN304, SEN1576, D3, R8-Aβ(25-35), human yD-crystallin (HGD), poly-lysine, heparin, poly-Asp, polyGl, poly-L-lysine, poly-L-glutamic acid, LVEALYL (SEQ ID NO: 62), RGFFYT (SEQ ID NO: 63), a peptide set forth or as designed/generated by the method set forth in U.S. Pat. No. 8,754,034, and combinations thereof. In aspects, the anti-fibrillating agent is a D-peptide. In aspects, the anti-fibrillating agent is an L-peptide. In aspects, the anti-fibrillating agent is a retro-inverso modified peptide. Retro-inverso modified peptides are derived from peptides by substituting the L-amino acids for their D-counterparts and reversing the sequence to mimic the original peptide since they retain the same spatial positioning of the side chains and 3D structure. In aspects, the retro-inverso modified peptide is derived from a natural or synthetic Aβ peptide. In some embodiments, the polynucleotide encodes a fibrillation resistant protein. In some embodiments, the fibrillation resistant protein is a modified insulin, see e.g., U.S. Pat. No. 8,343,914.

In certain embodiments, the one or more polypeptides may comprise one or more antibodies. The term “antibody” is used interchangeably with the term “immunoglobulin” herein, and includes intact antibodies, fragments of antibodies, e.g., Fab, F(ab′)2 fragments, and intact antibodies and fragments that have been mutated either in their constant and/or variable region (e.g., mutations to produce chimeric, partially humanized, or fully humanized antibodies, as well as to produce antibodies with a desired trait, e.g., enhanced binding and/or reduced FcR binding). The term “fragment” refers to a part or portion of an antibody or antibody chain comprising fewer amino acid residues than an intact or complete antibody or antibody chain. Fragments can be obtained via chemical or enzymatic treatment of an intact or complete antibody or antibody chain. Fragments can also be obtained by recombinant means. Exemplary fragments include Fab, Fab′, F(ab′)2, Fabc, Fd, dAb, VHH and scFv and/or Fv fragments.

The one or more cargo polypeptides, as exemplified above, may comprise one or more protease cleavage sites, i.e., amino acid sequences that can be recognized and cleaved by a protease. The protease cleavage sites may be used for generating desired gene products (e.g., intact gene products without any tags or portion of other proteins). The protease cleavage site may be one end or both ends of the protein. Examples of protease cleavage sites that can be used herein include an enterokinase cleavage site, a thrombin cleavage site, a Factor Xa cleavage site, a human rhinovirus 3C protease cleavage site, a tobacco etch virus (TEV) protease cleavage site, a dipeptidyl aminopeptidase cleavage site and a small ubiquitin-like modifier (SUMO)/ubiquitin-like protein-1 (ULP-1) protease cleavage site. In certain examples, the protease cleavage site comprises Lys-Arg.

In some embodiments, the cargo molecule is a small molecule. Techniques and methods of coupling peptides to small molecule agents are generally known in the art and can be applied here to couple a targeting moiety effective to target a CNS cell to a small molecule cargo. Small molecules include, without limitation, hormones, immunomodulators, antipyretics, anxiolytics, antipsychotics, analgesics, antispasmodics, anti-inflammatories, anti-histamines, anti-infectives, radiation sensitizers, chemotherapeutics.

Suitable hormones include, but are not limited to, amino-acid derived hormones (e.g., melatonin and thyroxine), small peptide hormones and protein hormones (e.g., thyrotropin-releasing hormone, vasopressin, insulin, growth hormone, luteinizing hormone, follicle-stimulating hormone, and thyroid-stimulating hormone), eicosanoids (e.g., arachidonic acid, lipoxins, and prostaglandins), and steroid hormones (e.g., estradiol, testosterone, tetrahydro testosteron Cortisol). Suitable immunomodulators include, but are not limited to, prednisone, azathioprine, 6-MP, cyclosporine, tacrolimus, methotrexate, interleukins (e.g., IL-2, IL-7, and IL-12), cytokines (e.g., interferons (e.g., IFN-α, IFN-β, IFN-ε, IFN-K, IFN-ω, and IFN-γ), granulocyte colony-stimulating factor, and imiquimod), chemokines (e.g., CCL3, CCL26 and CXCL7), cytosine phosphate-guanosine, oligodeoxynucleotides, glucans, antibodies, and aptamers).

Suitable antipyretics include, but are not limited to, non-steroidal anti-inflammants (e.g., ibuprofen, naproxen, ketoprofen, and nimesulide), aspirin and related salicylates (e.g., choline salicylate, magnesium salicylae, and sodium salicaylate), paracetamol/acetaminophen, metamizole, nabumetone, phenazone, and quinine.

Suitable anxiolytics include, but are not limited to, benzodiazepines (e.g., alprazolam, bromazepam, chlordiazepoxide, clonazepam, clorazepate, diazepam, flurazepam, lorazepam, oxazepam, temazepam, triazolam, and tofisopam), serotenergic antidepressants (e.g. selective serotonin reuptake inhibitors, tricyclic antidepresents, and monoamine oxidase inhibitors), mebicar, afobazole, selank, bromantane, emoxypine, azapirones, barbiturates, hydroxyzine, pregabalin, validol, and beta blockers.

Suitable antipsychotics include, but are not limited to, benperidol, bromoperidol, droperidol, haloperidol, moperone, pipaperone, timiperone, fluspirilene, penfluridol, pimozide, acepromazine, chlorpromazine, cyamemazine, dizyrazine, fluphenazine, levomepromazine, mesoridazine, perazine, pericyazine, perphenazine, pipotiazine, prochlorperazine, promazine, promethazine, prothipendyl, thioproperazine, thioridazine, trifluoperazine, triflupromazine, chlorprothixene, clopenthixol, flupentixol, tiotixene, zuclopenthixol, clotiapine, loxapine, prothipendyl, carpipramine, clocapramine, molindone, mosapramine, sulpiride, veralipride, amisulpride, amoxapine, aripiprazole, asenapine, clozapine, blonanserin, iloperidone, lurasidone, melperone, nemonapride, olanzapine, paliperidone, perospirone, quetiapine, remoxipride, risperidone, sertindole, trimipramine, ziprasidone, zotepine, alstonie, befeprunox, bitopertin, brexpiprazole, cannabidiol, cariprazine, pimavanserin, pomaglumetad methionil, vabicaserin, xanomeline, and zicronapine.

Suitable analgesics include, but are not limited to, paracetamol/acetaminophen, nonsteroidal anti-inflammants (e.g. ibuprofen, naproxen, ketoprofen, and nimesulide), COX-2 inhibitors (e.g. rofecoxib, celecoxib, and etoricoxib), opioids (e.g. morphine, codeine, oxycodone, hydrocodone, dihydromorphine, pethidine, buprenorphine), tramadol, norepinephrine, flupiretine, nefopam, orphenadrine, pregabalin, gabapentin, cyclobenzaprine, scopolamine, methadone, ketobemidone, piritramide, and aspirin and related salicylates (e.g., choline salicylate, magnesium salicylate, and sodium salicylate).

Suitable antispasmodics include, but are not limited to, mebeverine, papverine, cyclobenzaprine, carisoprodol, orphenadrine, tizanidine, metaxalone, methodcarbamol, chlorzoxazone, baclofen, dantrolene, baclofen, tizanidine, and dantrolene. Suitable anti-inflammatories include, but are not limited to, prednisone, non-steroidal anti-inflammants (e.g., ibuprofen, naproxen, ketoprofen, and nimesulide), COX-2 inhibitors (e.g., rofecoxib, celecoxib, and etoricoxib), and immune selective anti-inflammatory derivatives (e.g., submandibular gland peptide-T and its derivatives).

Suitable anti-histamines include, but are not limited to, H1-receptor antagonists (e.g., acrivastine, azelastine, bilastine, brompheniramine, buclizine, bromodiphenhydramine, carbinoxamine, cetirizine, chlorpromazine, cyclizine, chlorpheniramine, clemastine, cyproheptadine, desloratadine, dexbromapheniramine, dexchlorpheniramine, dimenhydrinate, dimetindene, diphenhydramine, doxylamine, ebasine, embramine, fexofenadine, hydroxyzine, levocetirzine, loratadine, meclozine, mirtazapine, olopatadine, orphenadrine, phenindamine, pheniramine, phenyltoloxamine, promethazine, pyrilamine, quetiapine, rupatadine, tripelennamine, and triprolidine), H2-receptor antagonists (e.g., cimetidine, famotidine, lafutidine, nizatidine, rafitidine, and roxatidine), tritoqualine, catechin, cromoglicate, nedocromil, and p2-adrenergic agonists.

Suitable anti-infectives include, but are not limited to, amebicides (e.g., nitazoxanide, paromomycin, metronidazole, tinidazole, chloroquine, miltefosine, amphotericin b, and iodoquinol), aminoglycosides (e.g., paromomycin, tobramycin, gentamicin, amikacin, kanamycin, and neomycin), anthelmintics (e.g., pyrantel, mebendazole, ivermectin, praziquantel, abendazole, thiabendazole, oxamniquine), antifungals (e.g., azole antifungals (e.g., itraconazole, fluconazole, posaconazole, ketoconazole, clotrimazole, miconazole, and voriconazole), echinocandins (e.g., caspofungin, anidulafungin, and micafungin), griseofulvin, terbinafine, flucytosine, and polyenes (e.g., nystatin, and amphotericin b), antimalarial agents (e.g., pyrimethamine/sulfadoxine, artemether/lumefantrine, atovaquone/proquanil, quinine, hydroxychloroquine, mefloquine, chloroquine, doxycycline, pyrimethamine, and halofantrine), antituberculosis agents (e.g., aminosalicylates (e.g., aminosalicylic acid), isoniazid/rifampin, isoniazid/pyrazinamide/rifampin, bedaquiline, isoniazid, ethambutol, rifampin, rifabutin, rifapentine, capreomycin, and cycloserine), antivirals (e.g., amantadine, rimantadine, abacavir/lamivudine, emtricitabine/tenofovir, cobicistat/elvitegravir/emtricitabine/tenofovir, efavirenz/emtricitabine/tenofovir, avacavir/lamivudine/zidovudine, lamivudine/zidovudine, emtricitabine/tenofovir, emtricitabine/opinavir/ritonavir/tenofovir, interferon alfa-2v/ribavirin, peginterferon alfa-2b, maraviroc, raltegravir, dolutegravir, enfuvirtide, foscamet, fomivirsen, oseltamivir, zanamivir, nevirapine, efavirenz, etravirine, rilpivirine, delaviridine, nevirapine, entecavir, lamivudine, adefovir, sofosbuvir, didanosine, tenofovir, avacivr, zidovudine, stavudine, emtricitabine, xalcitabine, telbivudine, simeprevir, boceprevir, telaprevir, lopinavir/ritonavir, fosamprenvir, dranuavir, ritonavir, tipranavir, atazanavir, nelfinavir, amprenavir, indinavir, sawuinavir, ribavirin, valcyclovir, acyclovir, famciclovir, ganciclovir, and valganciclovir), carbapenems (e.g., doripenem, meropenem, ertapenem, and cilastatin/imipenem), cephalosporins (e.g., cefadroxil, cephradine, cefazolin, cephalexin, cefepime, ceflaroline, loracarbef, cefotetan, cefuroxime, cefprozil, loracarbef, cefoxitin, cefaclor, ceftibuten, ceftriaxone, cefotaxime, cefpodoxime, cefdinir, cefixime, cefditoren, cefizoxime, and ceftazidime), glycopeptide antibiotics (e.g., vancomycin, dalbavancin, oritavancin, and telvancin), glycylcyclines (e.g., tigecycline), leprostatics (e.g., clofazimine and thalidomide), lincomycin and derivatives thereof (e.g., clindamycin and lincomycin), macrolides and derivatives thereof (e.g., telithromycin, fidaxomicin, erthromycin, azithromycin, clarithromycin, dirithromycin, and troleandomycin), linezolid, sulfamethoxazole/trimethoprim, rifaximin, chloramphenicol, fosfomycin, metronidazole, aztreonam, bacitracin, penicillins (amoxicillin, ampicillin, bacampicillin, carbenicillin, piperacillin, ticarcillin, amoxicillin/clavulanate, ampicillin/sulbactam, piperacillin/tazobactam, clavulanate/ticarcillin, penicillin, procaine penicillin, oxaxillin, dicloxacillin, and nafcillin), quinolones (e.g., lomefloxacin, norfloxacin, ofloxacin, qatifloxacin, moxifloxacin, ciprofloxacin, levofloxacin, gemifloxacin, moxifloxacin, cinoxacin, nalidixic acid, enoxacin, grepafloxacin, gatifloxacin, trovafloxacin, and sparfloxacin), sulfonamides (e.g., sulfamethoxazole/trimethoprim, sulfasalazine, and sulfasoxazole), tetracyclines (e.g., doxycycline, demeclocycline, minocycline, doxycycline/salicyclic acid, doxycycline/omega-3 polyunsaturated fatty acids, and tetracycline), and urinary anti-infectives (e.g., nitrofurantoin, methenamine, fosfomycin, cinoxacin, nalidixic acid, trimethoprim, and methylene blue).

Suitable chemotherapeutics include, but are not limited to, paclitaxel, brentuximab vedotin, doxorubicin, 5-FU (fluorouracil), everolimus, pemetrexed, melphalan, pamidronate, anastrozole, exemestane, nelarabine, ofatumumab, bevacizumab, belinostat, tositumomab, carmustine, bleomycin, bosutinib, busulfan, alemtuzumab, irinotecan, vandetanib, bicalutamide, lomustine, daunorubicin, clofarabine, cabozantinib, dactinomycin, ramucirumab, cytarabine, Cytoxan, cyclophosphamide, decitabine, dexamethasone, docetaxel, hydroxyurea, decarbazine, leuprolide, epirubicin, oxaliplatin, asparaginase, estramustine, cetuximab, vismodegib, asparginase Erwinia chrysanthemi, amifostine, etoposide, flutamide, toremifene, fulvestrant, letrozole, degarelix, pralatrexate, methotrexate, floxuridine, obinutuzumab, gemcitabine, afatinib, imatinib mesylatem, carmustine, eribulin, trastuzumab, altretamine, topotecan, ponatinib, idarubicin, ifosfamide, ibrutinib, axitinib, interferon alfa-2a, gefitinib, romidepsin, ixabepilone, ruxolitinib, cabazitaxel, ado-trastuzumab emtansine, carfilzomib, chlorambucil, sargramostim, cladribine, mitotane, vincristine, procarbazine, megestrol, trametinib, mesna, strontium-89 chloride, mechlorethamine, mitomycin, busulfan, gemtuzumab ozogamicin, vinorelbine, filgrastim, pegfilgrastim, sorafenib, nilutamide, pentostatin, tamoxifen, mitoxantrone, pegaspargase, denileukin diftitox, alitretinoin, carboplatin, pertuzumab, cisplatin, pomalidomide, prednisone, aldesleukin, mercaptopurine, zoledronic acid, lenalidomide, rituximab, octretide, dasatinib, regorafenib, histrelin, sunitinib, siltuximab, omacetaxine, thioguanine (tioguanine), dabrafenib, erlotinib, bexarotene, temozolomide, thiotepa, thalidomide, BCG, temsirolimus, bendamustine hydrochloride, triptorelin, aresnic trioxide, lapatinib, valrubicin, panitumumab, vinblastine, bortezomib, tretinoin, azacitidine, pazopanib, teniposide, leucovorin, crizotinib, capecitabine, enzalutamide, ipilimumab, goserelin, vorinostat, idelalisib, ceritinib, abiraterone, epothilone, tafluposide, azathioprine, doxifluridine, vindesine, and all-trans retinoic acid.

Described herein are exemplary embodiments of engineered viral polypeptides, (e.g., capsid polypeptides), such as adeno-associated virus (AAV) viral polypeptides (e.g., capsid polypeptides), that can be engineered to confer cell-specific tropism to an engineered viral particle (AAV particle) that contains the engineered viral polypeptide (s). The engineered viral polypeptide (s) (e.g., capsid(s)) can be included in an engineered virus particle, and can confer cell-specific tropism, such as CNS-specific tropism, reduced immunogenicity, or both to the engineered viral (e.g., an AAV) particle. As is described elsewhere herein, the particles can include a cargo. In this way, the particles can be a cell-specific delivery vehicle for a cargo. The engineered viral capsids described herein can include one or more engineered viral capsid polypeptides described herein. Engineered viral capsid polypeptides can be lentiviral, retroviral, adenoviral, or AAV. Engineered capsids can contain one or more of the viral capsid polypeptides. Engineered virus particles can include one or more of the engineered viral capsid polypeptides and thus contain an engineered viral capsid. The engineered viral capsid polypeptides, capsids, and/or viral particles that contain one or more CNS-specific targeting moieties containing or composed of one or more n-mer inserts described elsewhere herein. In some embodiments, the engineered viral capsid polypeptides, viral capsids, and/or viral particles can have a CNS-specific tropism conferred to it by the one or more n-mer inserts contained therein.

The CNS-specific n-mer inserts and targeting moieties can be encoded in whole or in part by a polynucleotide. The engineered viral capsid and/or viral capsid polypeptides can be encoded by one or more engineered viral capsid polynucleotides. In some embodiments, the engineered viral capsid polynucleotide is an engineered AAV capsid polynucleotide, engineered lentiviral capsid polynucleotide, engineered retroviral capsid polynucleotide, or engineered adenovirus capsid polynucleotide. In some embodiments, an engineered viral capsid polynucleotide (e.g., an engineered AAV capsid polynucleotide, engineered lentiviral capsid polynucleotide, engineered retroviral capsid polynucleotide, or engineered adenovirus capsid polynucleotide) can include a 3′ polyadenylation signal. The polyadenylation signal can be an SV40 polyadenylation signal.

The engineered AAV capsids can be variants of wild-type AAV capsids. In some embodiments, the wild-type AAV capsids can be composed of VP1, VP2, VP3 capsid polypeptides or a combination thereof. In other words, the engineered AAV capsids can include one or more variants of a wild-type VP1, wild-type VP2, and/or wild-type VP3 capsid polypeptides. In some embodiments, the serotype of the reference wild-type AAV capsid can be AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 or any combination thereof. In some embodiments, the serotype of the wild-type AAV capsid can be AAV-9. The engineered AAV capsids can have a different tropism than that of the reference wild-type AAV capsid.

The engineered AAV capsid can contain 1-60 engineered capsid polypeptides. In some embodiments, the engineered AAV capsids can contain 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, or 60 engineered capsid polypeptides. In some embodiments, the engineered AAV capsid can contain 0-59 wild-type AAV capsid polypeptides. In some embodiments, the engineered AAV capsid can contain 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, or 59 wild-type AAV capsid polypeptides.

In some embodiments, the engineered AAV capsid polypeptide can have an n-mer amino acid insert (also referred herein as an “n-mer insert”), where n can be at least 3 amino acids. In some embodiments, n can be 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15 amino acids. In some embodiments, the engineered AAV capsid can have a 6-mer or 7-mer amino acid insert. In some embodiments, the n-mer amino acid inset can be inserted between two amino acids in the wild-type viral polypeptide (VP) (or capsid polypeptide). In some embodiments, the n-mer insert can be inserted between two amino acids in a variable amino acid region in an AAV capsid polypeptide. The core of each wild-type AAV viral polypeptide contains an eight-stranded beta-barrel motif (betaB to betaI) and an alpha-helix (alphaA) that are conserved in autonomous parvovirus capsids (see e.g., DiMattia et al. 2012. J. Virol. 86(12):6947-6958). Structural variable regions (VRs) occur in the surface loops that connect the beta-strands, which cluster to produce local variations in the capsid surface. AAVs have 12 variable regions (also referred to as hypervariable regions) (see e.g., Weitzman and Linden. 2011. “Adeno-Associated Virus Biology.” In Snyder, R. O., Moullier, P. (eds.) Totowa, NJ: Humana Press). In some embodiments, one or more n-mer inserts can be inserted between two amino acids in one or more of the 12 variable regions in the wild-type AVV capsid polypeptides. In some embodiments, the one or more n-mer inserts can be each be inserted between two amino acids in VR-I, VR-II, VR-III, VR-IV, VR-V, VR-VI, VR-VII, VR-III, VR-IX, VR-X, VR-XI, VR-XII, or a combination thereof. In some embodiments, the n-mer can be inserted between two amino acids in the VR-III of a capsid polypeptide. In some embodiments, the engineered capsid can have an n-mer inserted between any two contiguous amino acids between amino acids 262 and 269, between any two contiguous amino acids between amino acids 327 and 332, between any two contiguous amino acids between amino acids 382 and 386, between any two contiguous amino acids between amino acids 452 and 460, between any two contiguous amino acids between amino acids 488 and 505, between any two contiguous amino acids between amino acids 545 and 558, between any two contiguous amino acids between amino acids 581 and 593, between any two contiguous amino acids between amino acids 704 and 714 of an AAV9 viral polypeptide. In some embodiments, the engineered capsid can have an n-mer inserted between amino acids 588 and 589 of an AAV9 viral polypeptide. In some embodiments, the engineered capsid can have an n-mer insert inserted between amino acids 588 and 589 of an AAV9 viral polypeptide. In some embodiments, the engineered capsid can have an n-mer insert inserted between amino acids 598-599 of an AAV9 viral polypeptide SEQ ID NO: 1 is a reference AAV9 capsid sequence for at least referencing the insertion sites discussed above. It will be appreciated that n-mers can be inserted in analogous positions in AAV viral polypeptides of other serotypes. In some embodiments as previously discussed, the n-mer(s) can be inserted between any two contiguous amino acids within the AAV viral polypeptide and in some embodiments the insertion is made in a variable region.

In certain example embodiments, the targeting moiety comprises a viral polypeptide.

In certain example embodiments, the viral polypeptide is a capsid polypeptide.

In certain example embodiments, wherein the n-mer insert(s) is/are incorporated into the viral polypeptide such that the n-mer insert, or at least the P motif, or at least the double valine motifs located between two amino acids of the viral polypeptide such that the n-mer insert, or at least the P motif, or at least the double valine motif is external to a viral capsid.

In certain example embodiments, the viral polypeptide is an adeno associated virus (AAV) polypeptide.

In certain example embodiments, the AAV polypeptide is an AAV capsid polypeptide.

In certain example embodiments, one or more of the n-mer insert(s) are each incorporated into the AAV polypeptide such that n-mer motif, or at least the P motif, or at least the double valine motif is inserted between any two contiguous amino acids independently selected from amino acids 262-269, 327-332, 382-386, 452-460, 488-505, 527-539, 545-558, 581-593, 598-599, 704-714, or any combination thereof in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, at least one of the n-mer inserts is incorporated into the AAV polypeptide such that n-mer insert(s), or at least the P motif(s), or at least the double valine motif(s) is inserted between amino acids 588 and 589 in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, at least one of the n-mer insert(s) is incorporated into the AAV polypeptide such that the n-mer insert(s), or at least the P motif, or at least the double valine motif is inserted between amino acids 598-599 in an AAV9 capsid polypeptide or in an analogous position in an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide

SEQ ID NO: 1 AAV9 capsid (wild-type) reference
Sequence:
MAADGYLPDWLEDNLSEGIREWWALKPGAPQPKANQQHQDNARGLVLPG
YKYLGPGNGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLKYNHADA
EFQERLKEDTSFGGNLGRAVFQAKKRLLEPLGLVEEAAKTAPGKKRPVE
QSPQEPDSSAGIGKSGAQPAKKRLNFGQTGDTESVPDPQPIGEPPAAPS
GVGSLTMASGGGAPVADNNEGADGVGSSSGNWHCDSQWLGDRVITTSTR
TWALPTYNNHLYKQISNSTSGGSSNDNAYFGYSTPWGYFDFNRFHCHFS
PRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTDNNGVKTIANNLTSTVQ
VFTDSDYQLPYVLGSAHEGCLPPFPADVFMIPQYGYLTLNDGSQAVGRS
SFYCLEYFPSQMLRTGNNFQFSYEFENVPFHSSYAHSQSLDRLMNPLID
QYLYYLSKTINGSGQNQQTLKFSVAGPSNMAVQGRNYIPGPSYRQQRVS
TTVTQNNNSEFAWPGASSWALNGRNSLMNPGPAMASHKEGEDRFFPLSG
SLIFGKQGTGRDNVDADKVMITNEEEIKTTNPVATESYGQVATNHQSAQ
AQAQTGWVQNQGILPGMVWQDRDVYLQGPIWAKIPHTDGNFHPSPLMGG
FGMKHPPPQILIKNTPVPADPPTAFNKDKLNSFITQYSTGQVSVEIEWE
LQKENSKRWNPEIQYTSNYYKSNNVEFAVNTEGVYSEPRPIGTRYLTRN
L

In some embodiments, an AAV capsid and/or AAV vector can contain one or more targeting moieties having one or more n-mer inserts containing one or more P-motifs. n-mer inserts containing or being P-motifs are described in greater detail elsewhere herein. In some embodiments, an AAV capsid and/or AAV vector can contain one or more targeting moieties having one or more n-mer inserts that are each immediately preceded by AQ or DG in the AAV capsid and/or vector in which they are inserted. In other words, the n-mer insert can be inserted into an AAV capsid and/or AAV vector between two contiguous amino acids such that the two residues preceding the n-mer insert are AQ or DG. In some embodiments, the n-mer insert is engineered such that the two C-terminal residues of the n-mer insert and/or preceding a P-motif of an n-mer insert are AQ or DG. In some embodiments, amino acids 587 and 588 of the AAV capsid or vector or analogous amino acids thereto are DG or DG.

In some embodiments, an AAV capsid (such as a CNS-specific AAV capsid) contains an n-mer insert that is or contains an n-mer motif, a P-motif, and/or a double valine motif such as any one or more as set forth in Tables 1-38, S1, or FIGS. 15A, 15B, 16A, 16B, 16C, 19A-19C. In some embodiments, insertion of the n-mer insert in an AAV capsid can result in cell, tissue, organ, specific engineered AAV capsids. In some embodiments, the engineered viral polypeptide, engineered viral capsid polypeptide, engineered viral capsid, and/or engineered viral particle has specificity for one or more types of CNS cells and/or tissue. In some embodiments, an engineered viral polypeptide, engineered viral capsid polypeptide, engineered viral capsid and/or engineered viral particle having an n-mer insert that is or contains a P-motif (e.g., those described in Tables 8 and S1 or FIGS. 15A, 15B, 16A, 16B, 16C, 19A-19C and elsewhere herein), has specificity for one or more types of CNS cells and/or tissue.

In some embodiments, the n-mer insert(s) in an AAV capsid is or includes a “P motif” and/or double valine motif. N-mer inserts, P motifs and double valine motifs are described in greater detail elsewhere herein. In some embodiments, an AAV capsid includes an n-mer insert comprising or consisting of a P-motif having the amino acid sequence XmPX1X2GTX3RXn (SEQ ID NO: 8579), wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7. In some embodiments, the an AAV capsid includes an n-mer insert comprising or consisting of a P-motif having the amino acid sequence XmPX1QGTX3RXn (SEQ ID NO: 8581), where X1, X3, Xn, are each selected from any amino acid, where m is 0, 1, 2, or 3, and where n is 0, 1, 2, 3, 4, 5, 6, or 7. In some embodiments, an AAV capsid includes an n-mer insert comprising or consisting of a P-motif having the amino acid sequence PX1QGTX3RXn (SEQ ID NO: 2), where X1, X3, Xn, are each selected from any amino acid and where n is 0, 1, 2, 3, 4, 5, 6, or 7. In certain example embodiments, X2 of the P motif is Q, P, E, or H. In certain example embodiments, X1 of the P motif is a polar amino acid, optionally a polar uncharged amino acid. In certain example embodiments, X3 of the P motif is a nonpolar amino acid. In certain example embodiments, X1 of the double valine motif is R, K, V, or W. In certain example embodiments, X2 of the double valine motif is T, S, V, Y or R.

In some embodiments, the AAV capsid includes an n-mer insert that is or includes a double valine motif having the amino acid sequence of the amino acid sequence XmX1X2VX3X4VX5Xn, wherein X1, X2, X3, Xm, and Xn, are each independently selected from any amino acid, wherein m is 0, 1, 2, or 3, and wherein n is 0, 1, 2, 3, 4, 5, 6, or 7. Double valine motifs are further described in greater detail elsewhere herein. In certain example embodiments, X3 of the double valine motif is G, P, or S. In certain example embodiments, X4 of the double valine motif is S, D, or T. In certain example embodiments, X5 of the double valine motif is Y, G, S, or L.

Exemplary, non-limiting n-mer inserts, P motifs, and double valine motifs are shown at least in e.g., Table 1-38, S1 and FIGS. 15A, 15B, 16A, 16B, 16C, 19A-19C. N-mer inserts, P-motifs, and double valine motifs are further described elsewhere herein.

In some embodiments, one or more n-mer inserts can be as set forth in any one or more of Tables 1, 2, 3, 8, S1 and FIG. 15A, 15B, 16A, 16B, 16C, 19A, 19B, or 19C can be included in a CNS specific engineered capsid.

As is described above and demonstrated in e.g., Table 1 and the Working Examples, the n-mer insert can be inserted into an AAV vector between two contiguous amino acids where the amino acids in the AAV vector immediately preceding the n-mer insert can be DG or AQ. In connection with Table 1, the first two amino acids shown in the variants are either AQ or DG, which denote amino acid residues (e.g., residues 587 and 588 that were either endogenous to the vector or show amino acid residues that were part of the n-mer insert that replaced residues at position 587 and 588 in the AAV vector to which the n-mer insert was introduced. Each n-mer insert of Table 1 was tested in both configurations (e.g., with AQ and DG as amino acids 587 and 588 of the AAV).

In some embodiments, the n-mer insert (such as a 7-mer insert) can be inserted into an AAV vector between two contiguous amino acids where the amino acids in the AAV vector immediately preceding the n-mer insert can be DG or AQ. In some embodiments, the DG or AQ are the amino acids immediately preceding the n-mer insert in the capsid polypeptide when the n-mer insert is included in a capsid polypeptide, particularly an AAV capsid polypeptide. Without being bound by theory, inserts including a DG or AQ at the C terminal end or are inserted into a capsid polypeptide, such as an AAV capsid polypeptide, such that the insert(s) are immediately following an AQ or DG of the capsid polypeptide, may be able to transduce more hosts, such as more strains or species. In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are DG. In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are AQ. In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are AQ and are followed by an n-mer insert. In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are DG and are followed by an n-mer insert.

In some embodiments, the n-mer insert is such that when included in a host polypeptide (e.g., viral or AAV polypeptide, such as a capsid polypeptide) one or more residues of the host polypeptide are replaced with one or more of that from the n-mer insert. In some embodiments, when a C terminal AQ or DG are included in the n-mer insert but are not part of a P motif, the AQ or DG can optionally replace 1 or 2 amino acid residues immediately preceding where the P motif or double valine motif is to be inserted. For example, in some embodiments, where the P motif is desired to be inserted between e.g., 588 and 589 in an AAV9 or position analogous thereto in other AAVs, the n-mer insert can contain e.g., [e.g., AQ or DG]-[P motif or double valine motif]-Xn, where Xn is as described elsewhere herein with respect to the P motifs, where AQ or DG replaces residues 587 and 588 of the AAV9 or position analogous thereto in other AAVs leaving the P motif or double valine motif to be effectively inserted between positions 588 and 589 of the AAV9 or position analogous thereto in other AAVs. It will be appreciated that such an approach can be extrapolated to other host polypeptides besides AAVs as well as other positions within AAVs. Further this can be extrapolated to other C-terminal amino acids besides AQ or DG as the case may be (e.g., Xm in the context of P motifs or double valine motifs).

In some embodiments, the n-mer insert confers CNS transduction efficiency to the targeting moiety. At least Tables 1-3, 7-8, S1, FIGS. 15A, 15B, 16A, 16B, 16C, 19A-19C represent exemplary variants having CNS transduction efficiency. As further discussed in the Working Examples herein, engineered AAV variants such as at least in Table 1 were able to transduce cells from multiple strains of mice. This is in contrast to other AAVs, which at least in some cases, can only transduce certain strains of mice.

In some embodiments, an AAV capsid can contain one or more targeting moieties having one or more n-mer inserts that are each immediately preceded by AQ and wherein the n-mer insert is KTVGTVY (SEQ ID NO: 3), RSVGSVY (SEQ ID NO: 4), RYLGDAS (SEQ ID NO: 5), WVLPSGG (SEQ ID NO: 6), VTVGSIY (SEQ ID NO: 7), VRGSSIL (SEQ ID NO: 8), RHHGDAA (SEQ ID NO: 9), VIQAMKL (SEQ ID NO: 10), LTYGMAQ (SEQ ID NO: 11), LRIGLSQ (SEQ ID NO: 12), GDYSMIV (SEQ ID NO: 13), VNYSVAL (SEQ ID NO: 14), RHIADAS (SEQ ID NO: 15), RYLGDAT (SEQ ID NO: 16), QRVGFAQ (SEQ ID NO: 17), QIAHGYST (SEQ ID NO: 18), WTLESGH (SEQ ID NO: 19); or GENSARW (SEQ ID NO: 20). In some embodiments, an AAV capsid can contain one or more targeting moieties having one or more n-mer inserts that are each immediately preceded by DG and wherein the n-mer insert is REQQKLW (SEQ ID NO: 21), ASNPGRW (SEQ ID NO: 22), WTLESGH (SEQ ID NO: 23), REQKKLW (SEQ ID NO: 24), ERLLVQL (SEQ ID NO: 25); or RMQRTLY (SEQ ID NO: 26). In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are DG and are followed by a 7-mer amino acid insert. In some embodiments, amino acids 587 and 588 of the AAV or analogous amino acids thereto are DG and are followed by a 7-mer amino acid insert, where the 7-mer insert is REQQKLY (SEQ ID NO: 64), ASNPGRW (SEQ ID NO: 22), WTLESGH (SEQ ID NO: 23, REQKKLW (SEQ ID NO: 24), ERLLVQL (SEQ ID NO: 25); or RMQRTLY (SEQ ID NO: 26).

In some embodiments, the AAV capsids can be CNS-specific. In some embodiments, CNS-specificity of the engineered AAV capsid is conferred by a CNS specific n-mer insert incorporated in the engineered AAV capsid. While not intending to be bound by theory, it is believed that the n-mer insert confers a 3D structure to or within a domain or region of the engineered AAV capsid such that the interaction of an engineered AAV containing said engineered AAV capsid has increased or improved interactions (e.g., increased affinity) with a cell surface receptor and/or other molecule on the surface of an endothelial and/or a CNS cell. In some embodiments the cell surface receptor is AAV receptor (AAVR). In some embodiments, the cell surface receptor is a CNS cell specific AAV receptor. In some embodiments, a CNS specific engineered AAV containing the CNS-specific capsid can have an increased transduction rate, efficiency, amount, or a combination thereof in a CNS cell as compared to other cell types and/or other AAVs that do not contain a CNS-specific engineered AAV capsid.

TABLE 1
Exemplary CNS n-mer inserts
Variant
Initial ″AQ″ or ″DG″ in the
inserts in Table 1 correspond
to the two amino acids in the
targeting moiety that immediately
precede the ″n-mer insert″ in a
targeting moiety or composition
(e.g., AA 587 and 588 of an AAV9
that has an n-mer insert placedCNS Transduction efficiency
between AA 588 and 589).ScoreSEQ ID NO:
AQRSVGSVY4600065
AQKTVGTVY4598066
AQRYLGDAS4059267
DGREQQKLW3915168
AQWVLPSGG3759769
AQVTVGSIY3296870
AQVRGSSIL3233071
AQRHHGDAA3217172
AQVIQAMKL3212773
AQLTYGMAQ3195674
AQLRIGLSQ3171075
AQGDYSMIV3149776
AQVNYSVAL3127177
AQRYSGDAS3119878
AQRYSGDSV2986079
AQRHIADAS2955480
AQRYLGDAT2952781
AQQRVGFAQ2945482
AQIAHGYST2821683
AQWTLESGH2747184
AQGENSARW2728785
DGASNPGRW2458386
AQLAVGQKW2444587
AQVKLGYSQ2391288
AQEAGSARW2388889
AQLNYSVSL2197290
AQWAISDGY2197091
AQRGPGLSQ2173892
DGWTLESGH2053493
AQRYVGESS1963594
DGREQKKLW1769595
AQFTLTTPK1560796
DGERLLVQL1551397
AQEDLLRLR1492098
AQPIIEHAV1283799
DGRMQRTLY12453100
DGWAISDGY10828101
AQRYISDSA10788102
AQWSTSSGF10614103
AQWSLGSGH10498104
AQWSQSSGY10258105
DGVRGSSIL9714106
AQIMLGYST9404107
DGKLADSVP9356108
AQASNPGRW9173109
AQHVENWHI8680110
AQVAGSSIL8645111
DGRQQQKLW8393112
DGTVNNDRF8028113
DGMSANERT8000114
AQATVAGQF7885115
DGRDQQKLW7761116
AQGKSPGVW7685117
DGGASNGGT7674118
AQSLVTSST6782119
AQLLYGYSS6779120
DGVTELTKF6655121
AQALVQNGV6638122
AQVLESNPR6572123
AQPASHEVL6460124
AQAGVQNAL6452125
DGKEISVSV6420126
AQGLNERVA6410127
DGQVAQQGA6393128
DGGVAGTNT6386129
DGASAQGAL6382130
AQAGVSSQT6357131
AQKNRRHSV6312132
AQKVDSAQL6311133
AQYTLSQGW6310134
DGQSVDRSK6293135
AQASASSPR6266136
DGRYVGESS6252137
DGLGHNAGV6239138
AQPNERINV6142139
AQVMSGTSH6122140
DGVLVSPGP6000141
DGVGISSGV6000142
DGSGETLRI5977143
DGSTEGAAL5954144
AQTSLSQDR5943145
AQSANPVVT5937146
DGVLASNGP5898147
AQAHLDNAP5893148
DGVVQVTGR5875149
DGFAVRLSS5855150
DGLVRDTKT5811151
DGSGESLSR5804152
AQTNEQAQR5796153
DGTLANSQR5746154
AQLLADKSV5680155
DGSQEQRAR5679156
AQVNGNTTY5655157
AQALAEAGA5624158
DGSREGGNV5580159
AQMGDSVTI5574160
DGLGGSSMG5565161
AQGVRDTNI5562162
DGSGSTDKL 5556163
AQASQNSTV5493164
AQGGTSSGH5462165
AQAADSSVR5404166
AQAANSSVR5387167
AQWADSKDQ5374168
AQPTQGTVR5353169
AQGSTDFKT5344170
AQVDHGGVV5342171
AQGEQQKGW5322172
DGIANLAAS5311173
DGAGGVRDR5299174
DGGSGSGGL5252175
DGTLANSER5237176
AQKGASVTL5236177
AQSNVALTG5235178
DGVNYSVAL5206179
AQGLNEHGA5193180
DGKNPGVYT5173181
DGQREAARI5173182
AQGLVDSSR5168183
DGNGSEGDR5157184
DGNVGVVQL5144185
AQVTDGVRS5109186
AQVIASNEH5109187
AQMSVGQSW5098188
DGHSLQTSA5096189
AQQDGYGTR5093190
AQLSNGQGP5071191
AQPVTDSKM5068192
AQNGTAADR5057193
AQIIVDNGS5024194
AQEADNHGR5023195
AQAADSSGR4995196
AQVVDSNNL4986197
DGSGANLSY4985198
DGKAHDGEV4978199
TABLE 2
Additional Exemplary CNS n-mer inserts
N-merSEQ IDSEQ ID
RankinsertNO:Encoding sequenceNO:
 1PSQGTLR200CCTTCTCAGGGGACGCTTCGG201
 2TDALTTK202ACTGATGCGCTTACGACTAAG203
 3PTQGTVR204CCCACACAAGGCACAGTCCGT205
 4PTQGTLR206CCTACTCAGGGGACGCTTCGG207
 5PTQGTVR208CCTACTCAGGGGACGGTTCGG209
 6STIPTMK210AGTACTATTCCTACTATGAAG211
 7TDAGDGK212ACAGACGCGGGGGACGGCAAA213
 8YQRTESL214TATCAGAGGACGGAGTCTCTG215
 9RVDPSGL216AGAGTCGACCCCAGTGGACTA217
10SLVTSST218TCGCTTGTTACTTCTAGTACG219
11LLAGADR220TTGCTTGCTGGTGCTGATCGT221
12STDRESR222TCCACGGACCGTGAAAGCCGA223
13NGYTEGR224AATGGGTATACGGAGGGGCGT225
14PTQGTFR226CCGACACAAGGAACATTCAGG227
15MTGISIV228ATGACAGGCATCTCTATCGTA229
16DGRAELR230GATGGGCGGGCGGAGTTGCGT231
17AADSSAR232GCCGCTGACTCATCGGCCCGT233
18PTQGTIR234CCTACTCAGGGGACGATTCGG235
19LSRGEEK236CTTTCGAGGGGTGAGGAGAAG237
20AIVSIAQ238GCGATTGTGTCGATTGCTCAG239
21LTSGLAA240TTGACGTCTGGTTTGGCGGCG241
22PTQGTFR242CCTACTCAGGGGACGTTTCGG243
23TLAISGR244ACTTTGGCGATTTCTGGGCGG245
24VHSQDVS246GTCCACAGTCAAGACGTTTCC247
25FQVEQVK248TTTCAGGTTGAGCAGGTTAAG249
26NRELALG250AACCGCGAACTCGCACTCGGG251
27SIGDLGK252AGTATCGGTGACCTAGGTAAA253
28TVGHDNK254ACCGTAGGACACGACAACAAA255
29HSKGFDY256CACAGTAAAGGTTTCGACTAC257
30HTQGTLR258CATACTCAGGGGACGCTTCGG259
31PAQGTLR260CCGGCGCAAGGAACACTACGA261
32AGGGDPR262GCTGGTGGAGGTGACCCCCGA263
33LGKADPV264TTGGGAAAAGCTGACCCAGTA265
34ALNEHVA266GCTCTGAATGAGCATGTGGCG267
35GSGGVSV268GGTTCGGGTGGTGTTAGTGTG269
36PSQGTLR270CCGTCCCAAGGAACACTCAGG271
37TGGRDQY272ACTGGTGGTCGGGATCAGTAT273
38YLVTTEN274TATTTGGTTACTACTGAGAAT275
39LSRDVAV276TTGTCGAGGGATGTGGCGGTT277
40RIVDSVP278AGGATTGTGGATAGTGTTCCG279
41KGYDTPM280AAAGGCTACGACACACCCATG281
42TSREEQW282ACTTCTCGTGAGGAGCAGTGG283
43RASADVV284AGGGCGAGTGCGGATGTTGTG285
44NLGAALS286AACCTTGGGGCTGCCCTATCG287
45SVTDIKH288TCGGTGACGGACATAAAACAC289
46FQDTIGV290TTTCAGGATACGATTGGGGTG291
47PNERLAV292CCTAACGAACGATTGGCAGTC293
48HTIAASM294CACACCATAGCCGCAAGTATG295
49NSDLMGR296AACAGTGACCTAATGGGCCGA297
50AGVSASL298GCGGGTGTTTCTGCGTCGTTG299
TABLE 3
Exemplary P-motifs
n-merSEQSEQ
insertID NO:Encoding Sequence(s)ID NO:
PSQGTLR300CCTTCTCAGGGGACGCTTCGG;301
CCGTCCCAAGGAACACTCAGG302
PTQGTVR303CCCACACAAGGCACAGTCCGT;304
CCTACTCAGGGGACGGTTCGG305
PTQGTLR306CCTACTCAGGGGACGCTTCGG307
PTQGTFR308CCGACACAAGGAACATTCAGG;309
CCTACTCAGGGGACGTTTCGG310
PTQGTIR311CCTACTCAGGGGACGATTCGG312
PAQGTLR313CCGGCGCAAGGAACACTACGA314

Also described herein are polynucleotides that encode the engineered targeting moieties, viral polypeptides (e.g., capsid polypeptides), and other polypeptides described herein, including but not limited to, the engineered AAV capsids described herein. In some embodiments, the engineered AAV capsid encoding polynucleotide can be included in a polynucleotide that is configured to be an AAV genome donor in an AAV vector system that can be used to generate engineered AAV particles described elsewhere herein.

In some embodiments, the AAV capsids or other viral capsids or compositions can be CNS-specific. In some embodiments, CNS-specificity of the engineered AAV or other viral capsid or other composition is conferred by one or more CNS specific n-mer inserts incorporated in the engineered AAV or other viral capsid or other composition described herein. While not intending to be bound by theory, it is believed that the n-mer insert confers a 3D structure to or within a domain or region of the engineered AAV capsid or other viral capsid or other composition such that the interaction of the viral particle or other composition containing the engineered AAV capsid or other viral capsid or other composition described herein has increased or improved interactions (e.g., increased affinity) with a cell surface receptor and/or other molecule on the surface of a CNS cell. In some embodiments, the cell surface receptor is AAV receptor (AAVR). In some embodiments, the cell surface receptor is a CNS cell specific AAV receptor. In some embodiments, the cell surface receptor or other molecule is a cell surface receptor or other molecule selectively expressed on the surface of a CNS cell.

In some embodiments the engineered viral (e.g., AAV) capsid encoding polynucleotide can be operably coupled to a poly adenylation tail. In some embodiments, the poly adenylation tail can be an SV40 poly adenylation tail. In some embodiments, the viral (e.g., AAV) capsid encoding polynucleotide can be operably coupled to a promoter. In some embodiments, the promoter can be a tissue specific promoter. In some embodiments, neurons an/or supporting cells (e.g., astrocytes, glial cells, Schwann cells, etc.), and combinations thereof. In some embodiments, the promoter can be a constitutive promoter. Suitable tissue specific promoters and constitutive promoters are discussed elsewhere herein and are generally known in the art and can be commercially available.

Suitable neuronal tissue/cell specific promoters include, but are not limited to, GFAP promoter (astrocytes), SYN1 promoter (neurons), and NSE/RU5′ (mature neurons).

Other suitable CNS specific promoters can include, but are not limited to, neuroactive peptide cholecystokinin (CCK) (see e.g., Chhatawl et al. Gene Therapy volume 14, pages 575-583(2007)), a brain specific DNA MiniPromoter (such as any of those identified for brain or pan-neronal expression as in de Leeuw et al. Mol. Therapy. 1(5): 2014. doi:10.1038/mtm.2013.5), myelin basic promoter (MBP) (see e.g., von Jonquieres, G., Mersmann, N., Klugmann, C. B., Harasta, A. E., Lutz, B., Teahan, O., et al. (2013). Glial promoter selectivity following AAV-delivery to the immature brain. PLoS One 8 (6), e65646. doi: 10.1371/journal.pone.0065646), glial fibrillary acid protein (GFAP) for expression in astrocytes (see e.g., Smith-Arica, J. R., Morelli, A. E., Larregina, A. T., Smith, J., Lowenstein, P. R., Castro, M. G. (2000). Cell-type-specific and regulatable transgenesis in the adult brain: adenovirus-encoded combined transcriptional targeting and inducible transgene expression. Mol. Ther. 2 (6), 579-587. doi: 10.1006/mthe.2000.0215 and Lee, Y., Messing, A., Su, M., Brenner, M. (2008). GFAP promoter elements required for region-specific and astrocyte-specific expression. Glia 56 (5), 481-493. doi: 10.1002/glia.20622), human myelin associated glycoprotein promoter (full-length or truncated) (see e.g., von Jonquieres, G., Frohlich, D., Klugmann, C. B., Wen, X., Harasta, A. E., Ramkumar, R., et al. (2016). Recombinant human myelin-associated glycoprotein promoter drives selective AAV-mediated transgene expression in oligodendrocytes. Front. Mol. Neurosci. 9, 13. doi: 10.3389/fnmol.2016.00013), F4/80 promoter (see e.g., Rosario, A. M., Cruz, P. E., Ceballos-Diaz, C., Strickland, M. R., Siemienski, Z., Pardo, M., et al. (2016). Microglia-specific targeting by novel capsid-modified AAV6 vectors. Mol. Ther. Methods Clin. Dev. 3, 16026. doi: 10.1038/mtm.2016.26), phosphate-activated glutaminase (PAG) or the vesicular glutamate transporter (vGLUT) promoter (for about 90% glutamatergic neuron-specific expression) (see e.g., Rasmussen, M., Kong, L., Zhang, G. R., Liu, M., Wang, X., Szabo, G., et al. (2007). Glutamatergic or GABAergic neuron-specific, long-term expression in neocortical neurons from helper virus-free HSV-1 vectors containing the phosphate-activated glutaminase, vesicular glutamate transporter-1, or glutamic acid decarboxylase promoter. Brain Res. 1144, 19-32. doi: 10.1016/j.brainres.2007.01.125), glutamic acid decarboxylase (GAD) promoter (for about 90% GABAergic neuron-specific expression) (see e.g., Rasmussen, M., Kong, L., Zhang, G. R., Liu, M., Wang, X., Szabo, G., et al. (2007). Glutamatergic or GABAergic neuron-specific, long-term expression in neocortical neurons from helper virus-free HSV-1 vectors containing the phosphate-activated glutaminase, vesicular glutamate transporter-1, or glutamic acid decarboxylase promoter. Brain Res. 1144, 19-32. doi: 10.1016/j.brainres.2007.01.125), MeCP2 promoter (see e.g., Gray et al. Hum Gene Ther. 2011 September; 22(9):1143-53. doi: 10.1089/hum.2010.245), and retinoblastoma gene promoter (see e.g., Jiang et al., J. Biol. Chem. 2001. 276, 593-600).

Suitable constitutive promoters include, but are not limited to CMV, RSV, SV40, EF1alpha, CAG, and beta-actin.

A AVs with Reduced Non-CNS Cell Specificity

In some embodiments, the n-mer insert(s) and/or P-motif(s) are inserted into an AAV polypeptide (e.g., an AAV capsid polypeptide) that has reduced specificity (or no detectable, measurable, or clinically relevant interaction) for one or more non-CNS cell types. Exemplary non-CNS cell types include, but are not limited to, liver, kidney, lung, heart, spleen, muscle (skeletal and cardiac), bone, immune, stomach, intestine, eye, skin cells and the like. In some embodiments, the non-CNS cells are liver cells.

In certain example embodiments, the AAV capsid polypeptide is an engineered AAV capsid polypeptide having reduced or eliminated uptake in a non-CNS cell as compared to a corresponding wild-type AAV capsid polypeptide.

In certain example embodiments, the non-CNS cell is a liver cell.

In certain example embodiments, the wild-type capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the engineered AAV capsid polypeptide comprises one or more mutations that result in reduced or eliminated uptake in a non-CNS cell. In certain example embodiments, the engineered AAV capsid polypeptide comprises one or more mutations that result in reduced or eliminated uptake in a non-CNS cell as compared to a CNS cell. In certain example embodiments, the engineered AAV capsid polypeptide comprises one or more mutations that result in increased update in a CNS cell as compared to a non-CNS cell, where such a mutation is not the inclusion of a targeting moiety of the present invention, but a mutation that is in addition to such a targeting moiety. In some embodiments, the non-CNS cell is a liver cell or a dorsal root ganglion neuron.

In certain example embodiments, the one or more mutations are in position 267, in position 269, in position 272, in position 504, in position 505, in position 585, in position 590, or any combination thereof in the AAV9 capsid polypeptide (SEQ ID NO: 1) or in one or more positions corresponding thereto in a non-AAV9 capsid polypeptide.

In certain example embodiments, the non-AAV9 capsid polypeptide is an AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 capsid polypeptide.

In certain example embodiments, the mutation in position 267 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X mutation to A, wherein X is any amino acid.

In certain example embodiments, the mutation in position 269 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an S or X to T mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 272 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an N or to A mutation, wherein X is any amino acid. See also, e.g., International Patent Application Publication No. WO2018119330.

In certain example embodiments, the mutation in position 504 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a G or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 505 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a P or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 585 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is an R or X to Q mutation, wherein X is any amino acid.

In certain example embodiments, the mutation in position 590 in the AAV9 capsid polypeptide (SEQ ID NO: 1) or position corresponding thereto in a non-AAV9 capsid polypeptide is a Q or X to A mutation, wherein X is any amino acid.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 267, position 269 or both of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 267 is a G to A mutation and wherein the mutation at position 269 is an S to T mutation.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 590 of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 509 is a Q to A mutation.

In certain example embodiments, the engineered AAV capsid polypeptide is an engineered AAV9 capsid polypeptide comprising a mutation at position 504, position 505, or both of a wild-type AAV9 capsid polypeptide (SEQ ID NO: 1), wherein the mutation at position 504 is a G to A mutation and wherein the mutation at position 505 is a P to A mutation.

In some embodiments, the AAV capsid polypeptide in which the n-mer insert(s) and/or P motif(s), and/or double valine motifs are inserted are 80-100 (e.g., 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, to/or 100) percent identical to SEQ ID NO: 4 or SEQ ID NO: 5 of International Patent Application Publication WO 2019/217911, which is incorporated by reference as if expressed in its entirety herein. These sequences are also incorporated herein as SEQ ID NOS: 330 and 331 respectively. It will be appreciated that when considering variants of these AAV9 capsid proteins with reduced liver specificity, that residues 267 and/or 269 must contain the relevant mutations or equivalents.

In some embodiments, the AAV capsid polypeptide in which the in which the n-mer insert(s), such as an n-mer insert containing a P-motif and/or double valine motif, is/are inserted can be 80-100 (e.g., 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, to/or 100) percent identical to any of those described in Adachi et al., (Nat. Comm. 2014. 5:3075, DOI: 10.1038/ncomms4075) that have reduced specificity for a non-CNS cell, particularly a liver cell. Adachi et al., (Nat. Comm. 2014. 5:3075, DOI: 10.1038/ncomms4075) is incorporated by reference herein as if expressed in its entirety.

In some embodiments, the modified AAV can have about a 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent or fold reduction in specificity for a non-CNS cells as compared to a wild-type AAV or control. In some embodiments, the modified AAV can have no measurable or detectable uptake and/or expression in one or more non-CNS cells.

In some embodiments, the AAV capsid protein in which the n-mer insert(s) and/or P motif(s), and/or double valine motifs are inserted are 80-100 (e.g., 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, to/or 100) percent identical to any one of those set forth in International Patent Application Pub. WO 2018119330.

Also provided herein are methods of generating engineered AAV capsids. The engineered AAV capsid variants can be variants of wild-type AAV capsids. FIGS. 6A-8 can illustrate various embodiments of methods capable of generating engineered AAV capsids described herein. Generally, an AAV capsid library can be generated by expressing engineered capsid vectors each containing an engineered AAV capsid polynucleotide previously described in an appropriate AAV producer cell line. See e.g., FIG. 8. It will be appreciated that although FIG. 8 shows a helper-dependent method of AAV particle production, it will be appreciated that this can be done via a helper-free method as well. This can generate an AAV capsid library that can contain one more desired cell-specific engineered AAV capsid variant. As shown in FIG. 6 the AAV capsid library can be administered to various non-human animals for a first round of mRNA-based selection. As shown in FIG. 1, the transduction process by AAVs and related vectors can result in the production of an mRNA molecule that is reflective of the genome of the virus that transduced the cell. As is at least demonstrated in the Examples herein, mRNA based-selection can be more specific and effective to determine a virus particle capable of functionally transducing a cell because it is based on the functional product produced as opposed to just detecting the presence of a virus particle in the cell by measuring the presence of viral DNA.

After first-round administration, one or more engineered AAV virus particles having a desired capsid variant can then be used to form a filtered AAV capsid library. Desirable AAV virus particles can be identified by measuring the mRNA expression of the capsid variants and determining which variants are highly expressed in the desired cell type(s) as compared to non-desired cells type(s). Those that are highly expressed in the desired cell, tissue, and/or organ type are the desired AAV capsid variant particles. In some embodiments, the AAV capsid variant encoding polynucleotide is under control of a tissue-specific promoter that has selective activity in the desired cell, tissue, or organ.

The engineered AAV capsid variant particles identified from the first round can then be administered to various non-human animals. In some embodiments, the animals used in the second round of selection and identification are not the same as those animals used for first round selection and identification. Similar to round 1, after administration the top expressing variants in the desired cell, tissue, and/or organ type(s) can be identified by measuring viral mRNA expression in the cells. The top variants identified after round two can then be optionally barcoded and optionally pooled. In some embodiments, top variants from the second round can then be administered to a non-human primate to identify the top cell-specific variant(s), particularly if the end use for the top variant is in humans. Administration at each round can be systemic.

In some embodiments, the method of generating an AAV capsid variant can include the steps of: (a) expressing a vector system described herein that contains an engineered AAV capsid polynucleotide in a cell to produce engineered AAV virus particle capsid variants; (b) harvesting the engineered AAV virus particle capsid variants produced in step (a); (c) administering engineered AAV virus particle capsid variants to one or more first subjects, wherein the engineered AAV virus particle capsid variants are produced by expressing an engineered AAV capsid variant vector or system thereof in a cell and harvesting the engineered AAV virus particle capsid variants produced by the cell; and (d) identifying one or more engineered AAV capsid variants produced at a significantly high level by one or more specific cells or specific cell types in the one or more first subjects. In this context, “significantly high” can refer to a titer that can range from between about 2×1011 to about 6×1012 vector genomes per 15 cm dish.

The method can further include the steps of: (e) administering some or all engineered AAV virus particle capsid variants identified in step (d) to one or more second subjects; and (f) identifying one or more engineered AAV virus particle capsid variants produced at a significantly high level in one or more specific cells or specific cell types in the one or more second subjects. The cell in step (a) can be a prokaryotic cell or a eukaryotic cell. In some embodiments, the administration in step (c), step (e), or both is systemic. In some embodiments, one or more first subjects, one or more second subjects, or both, are non-human mammals. In some embodiments, one or more first subjects, one or more second subjects, or both, are each independently selected from the group consisting of: a wild-type non-human mammal, a humanized non-human mammal, a disease-specific non-human mammal model, and a non-human primate.

Also provided herein are vectors and vector systems that can contain one or more of the engineered polynucleotides, (e.g., an AAV capsid polynucleotide) described herein. As used in this context, engineered viral (e.g., AAV) capsid polynucleotides refers to any one or more of the polynucleotides described herein capable of encoding an engineered viral (e.g., AAV) capsid as described elsewhere herein and/or polynucleotide(s) capable of encoding one or more engineered viral (e.g., AAV) capsid proteins described elsewhere herein. Further, where the vector includes an engineered viral (e.g., AAV) capsid polynucleotide described herein, the vector can also be referred to and considered an engineered vector or system thereof although not specifically noted as such. In embodiments, the vector can contain one or more polynucleotides encoding one or more elements of an engineered viral (e.g., AAV) capsid described herein. The vectors can be useful in producing bacterial, fungal, yeast, plant cells, animal cells, and transgenic animals that can express one or more components of the engineered viral (e.g., AAV) capsid described herein. Within the scope of this disclosure are vectors containing one or more of the polynucleotide sequences described herein. One or more of the polynucleotides that are part of the engineered viral (e.g., AAV) capsid and system thereof described herein can be included in a vector or vector system.

In some embodiments, the vector can include an engineered viral (e.g., AAV) capsid polynucleotide having a 3′ polyadenylation signal. In some embodiments, the 3′ polyadenylation is an SV40 polyadenylation signal. In some embodiments the vector does not have splice regulatory elements. In some embodiments, the vector includes one or more minimal splice regulatory elements. In some embodiments, the vector can further include a modified splice regulatory element, wherein the modification inactivates the splice regulatory element. In some embodiments, the modified splice regulatory element is a polynucleotide sequence sufficient to induce splicing, between a rep protein polynucleotide and the engineered viral (e.g., AAV) capsid protein variant polynucleotide. In some embodiments, the polynucleotide sequence can be sufficient to induce splicing is a splice acceptor or a splice donor. In some embodiments, the viral (e.g., AAV) capsid polynucleotide is an engineered viral (e.g., AAV) capsid polynucleotide as described elsewhere herein.

The vectors and/or vector systems can be used, for example, to express one or more of the engineered viral (e.g., AAV) capsid polynucleotides in a cell, such as a producer cell, to produce engineered viral (e.g., AAV) particles containing an engineered viral (e.g., AAV) capsid described elsewhere herein. Other uses for the vectors and vector systems described herein are also within the scope of this disclosure. In general, and throughout this specification, the term is a tool that allows or facilitates the transfer of an entity from one environment to another. In some contexts which will be appreciated by those of ordinary skill in the art, “vector” can be a term of art to refer to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. A vector can be a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment. Generally, a vector is capable of replication when associated with the proper control elements.

Vectors include, but are not limited to, nucleic acid molecules that are single-stranded, double-stranded, or partially double-stranded; nucleic acid molecules that comprise one or more free ends, no free ends (e.g., circular); nucleic acid molecules that comprise DNA, RNA, or both; and other varieties of polynucleotides known in the art. One type of vector is a “plasmid,” which refers to a circular double stranded DNA loop into which additional DNA segments can be inserted, such as by standard molecular cloning techniques. Another type of vector is a viral vector, wherein virally-derived DNA or RNA sequences are present in the vector for packaging into a virus (e.g., retroviruses, replication defective retroviruses, adenoviruses, replication defective adenoviruses, and adeno-associated viruses (AAVs)). Viral vectors also include polynucleotides carried by a virus for transfection into a host cell. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively-linked. Such vectors are referred to herein as “expression vectors.” Common expression vectors of utility in recombinant DNA techniques are often in the form of plasmids.

Recombinant expression vectors can be composed of a nucleic acid (e.g., a polynucleotide) of the invention in a form suitable for expression of the nucleic acid in a host cell, which means that the recombinant expression vectors include one or more regulatory elements, which can be selected on the basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid sequence to be expressed. Within a recombinant expression vector, “operably linked” and “operatively-linked” are used interchangeably herein and further defined elsewhere herein. In the context of a vector, the term “operably linked” is intended to mean that the nucleotide sequence of interest is linked to the regulatory element(s) in a manner that allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system or in a host cell when the vector is introduced into the host cell). Advantageous vectors include adeno-associated viruses, and types of such vectors can also be selected for targeting particular types of cells, such as those engineered viral (e.g., AAV) vectors containing an engineered viral (e.g., AAV) capsid polynucleotide with a desired cell-specific tropism. These and other embodiments of the vectors and vector systems are described elsewhere herein.

In some embodiments, the vector can be a bicistronic vector. In some embodiments, a bicistronic vector can be used for one or more elements of the engineered viral (e.g., AAV) capsid system described herein. In some embodiments, expression of elements of the engineered viral (e.g., AAV) capsid system described herein can be driven by a suitable constitutive or tissue specific promoter. Where the element of the engineered viral (e.g., AAV) capsid system is an RNA, its expression can be driven by a Pol III promoter, such as a U6 promoter. In some embodiments, the two are combined.

Vectors can be designed for expression of one or more elements of the engineered targeting moieties, polypeptides, viral (e.g., AAV) capsid system described herein (e.g., nucleic acid transcripts, proteins, enzymes, and combinations thereof), etc. in a suitable host cell. In some embodiments, the suitable host cell is a prokaryotic cell. Suitable host cells include, but are not limited to, bacterial cells, yeast cells, insect cells, and mammalian cells. The vectors can be viral-based or non-viral based. In some embodiments, the suitable host cell is a eukaryotic cell. In some embodiments, the suitable host cell is a suitable bacterial cell. Suitable bacterial cells include, but are not limited to, bacterial cells from the bacteria of the species Escherichia coli. Many suitable strains of E. coli are known in the art for expression of vectors. These include, but are not limited to Pir1, Stbl2, Stbl3, Stbl4, TOP10, XL1 Blue, and XL10 Gold. In some embodiments, the host cell is a suitable insect cell. Suitable insect cells include those from Spodoptera frugiperda. Suitable strains of S. frugiperda cells include, but are not limited to, Sf9 and Sf21. In some embodiments, the host cell is a suitable yeast cell. In some embodiments, the yeast cell can be from Saccharomyces cerevisiae. In some embodiments, the host cell is a suitable mammalian cell. Many types of mammalian cells have been developed to express vectors. Suitable mammalian cells include, but are not limited to, HEK293, Chinese Hamster Ovary Cells (CHOs), mouse myeloma cells, HeLa, U20S, A549, HT1080, CAD, P19, NIH 3T3, L929, N2a, MCF-7, Y79, SO-Rb50, HepG G2, DIKX-X11, J558L, Baby hamster kidney cells (BHK), and chicken embryo fibroblasts (CEFs). Suitable host cells are discussed further in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990).

In some embodiments, the vector can be a yeast expression vector. Examples of vectors for expression in yeast Saccharomyces cerevisiae include pYepSec1 (Baldari, et al., 1987. EMBO J. 6: 229-234), pMFa (Kuijan and Herskowitz, 1982. Cell 30: 933-943), pJRY88 (Schultz et al., 1987. Gene 54: 113-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), and picZ (InVitrogen Corp, San Diego, Calif.). As used herein, a “yeast expression vector” refers to a nucleic acid that contains one or more sequences encoding an RNA and/or polypeptide and may further contain any desired elements that control the expression of the nucleic acid(s), as well as any elements that enable the replication and maintenance of the expression vector inside the yeast cell. Many suitable yeast expression vectors and features thereof are known in the art; for example, various vectors and techniques are illustrated in in Yeast Protocols, 2nd edition, Xiao, W., ed. (Humana Press, New York, 2007) and Buckholz, R. G. and Gleeson, M.A. (1991) Biotechnology (NY) 9(11): 1067-72. Yeast vectors can contain, without limitation, a centromeric (CEN) sequence, an autonomous replication sequence (ARS), a promoter, such as an RNA Polymerase III promoter, operably linked to a sequence or gene of interest, a terminator such as an RNA polymerase III terminator, an origin of replication, and a marker gene (e.g., auxotrophic, antibiotic, or other selectable markers). Examples of expression vectors for use in yeast may include plasmids, yeast artificial chromosomes, 2μ plasmids, yeast integrative plasmids, yeast replicative plasmids, shuttle vectors, and episomal plasmids.

In some embodiments, the vector is a baculovirus vector or expression vector and can be suitable for expression of polynucleotides and/or proteins in insect cells. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF9 cells) include the pAc series (Smith, et al., 1983. Mol. Cell. Biol. 3: 2156-2165) and the pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). rAAV (recombinant Adeno-associated viral) vectors are preferably produced in insect cells, e.g., Spodoptera frugiperda Sf9 insect cells, grown in serum-free suspension culture. Serum-free insect cells can be purchased from commercial vendors, e.g., Sigma Aldrich (EX-CELL 405).

In some embodiments, the vector is a mammalian expression vector. In some embodiments, the mammalian expression vector is capable of expressing one or more polynucleotides and/or polypeptides in a mammalian cell. Examples of mammalian expression vectors include, but are not limited to, pCDM8 (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et al., 1987. EMBO J. 6: 187-195). The mammalian expression vector can include one or more suitable regulatory elements capable of controlling expression of the one or more polynucleotides and/or proteins in the mammalian cell. For example, commonly used promoters are derived from polyoma, adenovirus 2, cytomegalovirus, simian virus 40, and others disclosed herein and known in the art. More detail on suitable regulatory elements is described elsewhere herein.

For other suitable expression vectors and vector systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of Sambrook, et al., MOLECULAR CLONING: A LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989.

In some embodiments, the recombinant mammalian expression vector is capable of directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific promoters include the albumin promoter (liver-specific; Pinkert, et al., 1987. Genes Dev. 1: 268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol. 43: 235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 8: 729-733) and immunoglobulins (Baneiji, et al., 1983. Cell 33: 729-740; Queen and Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byme and Ruddle, 1989. Proc. Natl. Acad. Sci. USA 86: 5473-5477), pancreas-specific promoters (Edlund, et al., 1985. Science 230: 912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 4,873,316 and European Application Publication No. 264,166). Developmentally-regulated promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) and the a-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546). With regards to these prokaryotic and eukaryotic vectors, mention is made of U.S. Pat. No. 6,750,059, the contents of which are incorporated by reference herein in their entirety. Other embodiments can utilize viral vectors, with regards to which mention is made of U.S. patent application Ser. No. 13/092,085, the contents of which are incorporated by reference herein in their entirety. Tissue-specific regulatory elements are known in the art and in this regard, mention is made of U.S. Pat. No. 7,776,321, the contents of which are incorporated by reference herein in their entirety. In some embodiments, a regulatory element can be operably linked to one or more elements of an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system so as to drive expression of the one or more elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein.

Vectors may be introduced and propagated in a prokaryote or prokaryotic cell. In some embodiments, a prokaryote is used to amplify copies of a vector to be introduced into a eukaryotic cell or as an intermediate vector in the production of a vector to be introduced into a eukaryotic cell (e.g., amplifying a plasmid as part of a viral vector packaging system). In some embodiments, a prokaryote is used to amplify copies of a vector and express one or more nucleic acids, such as to provide a source of one or more proteins for delivery to a host cell or host organism.

In some embodiments, the vector can be a fusion vector or fusion expression vector. In some embodiments, fusion vectors add a number of amino acids to a protein encoded therein, such as to the amino terminus, carboxy terminus, or both of a recombinant protein. Such fusion vectors can serve one or more purposes, such as: (i) to increase expression of recombinant protein; (ii) to increase the solubility of the recombinant protein; and (iii) to aid in the purification of the recombinant protein by acting as a ligand in affinity purification. In some embodiments, expression of polynucleotides (such as non-coding polynucleotides) and proteins in prokaryotes can be carried out in Escherichia coli with vectors containing constitutive or inducible promoters directing the expression of either fusion or non-fusion polynucleotides and/or proteins. In some embodiments, the fusion expression vector can include a proteolytic cleavage site, which can be introduced at the junction of the fusion vector backbone or other fusion moiety and the recombinant polynucleotide or protein to enable separation of the recombinant polynucleotide or protein from the fusion vector backbone or other fusion moiety subsequent to purification of the fusion polynucleotide or protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Example fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988. Gene 67: 31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the target recombinant protein. Examples of suitable inducible non-fusion E. coli expression vectors include pTrc (Amrann et al., (1988) Gene 69:301-315) and pET 11d (Studier et al., GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990) 60-89).

In some embodiments, one or more vectors driving expression of one or more elements of an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein are introduced into a host cell such that expression of the elements of the engineered delivery system described herein direct formation of an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein (including but not limited to an engineered gene transfer agent particle, which is described in greater detail elsewhere herein). For example, different elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein can each be operably linked to separate regulatory elements on separate vectors. RNA(s) of different elements of the engineered delivery system described herein can be delivered to an animal or mammal or cell thereof to produce an animal or mammal or cell thereof that constitutively or inducibly or conditionally expresses different elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein that incorporates one or more elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein or contains one or more cells that incorporates and/or expresses one or more elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein.

In some embodiments, two or more of the elements expressed from the same or different regulatory element(s), can be combined in a single vector, with one or more additional vectors providing any components of the system not included in the first vector. Engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system polynucleotides that are combined in a single vector may be arranged in any suitable orientation, such as one element located 5′ with respect to (“upstream” of) or 3′ with respect to (“downstream” of) a second element. The coding sequence of one element may be located on the same or opposite strand of the coding sequence of a second element, and oriented in the same or opposite direction. In some embodiments, a single promoter drives expression of a transcript encoding one or more engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polypeptides, embedded within one or more intron sequences (e.g., each in a different intron, two or more in at least one intron, or all in a single intron). In some embodiments, the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides can be operably linked to and expressed from the same promoter.

The vectors can include additional features that can confer one or more functionalities to the vector, the polynucleotide to be delivered, a virus particle produced there from, or polypeptide expressed thereof. Such features include, but are not limited to, regulatory elements, selectable markers, molecular identifiers (e.g., molecular barcodes), stabilizing elements, and the like. It will be appreciated by those skilled in the art that the design of the expression vector and additional features included can depend on such factors as the choice of the host cell to be transformed, the level of expression desired, etc.

In embodiments, the polynucleotides and/or vectors thereof described herein (such as the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides of the present invention) can include one or more regulatory elements that can be operatively linked to the polynucleotide. The term “regulatory element” is intended to include promoters, enhancers, internal ribosomal entry sites (IRES), and other expression control elements (e.g., transcription termination signals, such as polyadenylation signals and poly-U sequences). Such regulatory elements are described, for example, in Goeddel, GENE EXPRESSION TECHNOLOGY: METHODS IN ENZYMOLOGY 185, Academic Press, San Diego, Calif. (1990). Regulatory elements include those that direct constitutive expression of a nucleotide sequence in many types of host cell and those that direct expression of the nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). A tissue-specific promoter can direct expression primarily in a desired tissue and/or cells of interest, such as CNS cells and/or particular cell types therein (e.g., neurons and/or supporting cells (e.g., Schwan, astrocytes, glial cells, microglial cells, and/or the like). Regulatory elements may also direct expression in a temporal-dependent manner, such as in a cell-cycle dependent or developmental stage-dependent manner, which may or may not also be tissue or cell-type specific. In some embodiments, a vector comprises one or more pol III promoter (e.g., 1, 2, 3, 4, 5, or more pol III promoters), one or more pol II promoters (e.g., 1, 2, 3, 4, 5, or more pol II promoters), one or more pol I promoters (e.g., 1, 2, 3, 4, 5, or more pol I promoters), or combinations thereof. Examples of pol III promoters include, but are not limited to, U6 and H1 promoters. Examples of pol II promoters include, but are not limited to, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV enhancer) (see, e.g., Boshart et al, Cell, 41:521-530 (1985)), the SV40 promoter, the dihydrofolate reductase promoter, the J3-actin promoter, the phosphoglycerol kinase (PGK) promoter, and the EF1α promoter. Also encompassed by the term “regulatory element” are enhancer elements, such as WPRE; CMV enhancers; the R-U5′ segment in LTR of HTLV-I (Mol. Cell. Biol., Vol. 8(1), p. 466-472, 1988); SV40 enhancer; and the intron sequence between exons 2 and 3 of rabbit β-globin (Proc. Natl. Acad. Sci. USA., Vol. 78(3), p. 1527-31, 1981).

In some embodiments, the regulatory sequence can be a regulatory sequence described in U.S. Pat. No. 7,776,321, U.S. Pat. Pub. No. 2011/0027239, and PCT publication WO 2011/028929, the contents of which are incorporated by reference herein in their entirety. In some embodiments, the vector can contain a minimal promoter. In some embodiments, the minimal promoter is the Mecp2 promoter, tRNA promoter, or U6. In a further embodiment, the minimal promoter is tissue specific. In some embodiments, the length of the vector polynucleotide the minimal promoters and polynucleotide sequences is less than 4.4 Kb.

To express a polynucleotide, the vector can include one or more transcriptional and/or translational initiation regulatory sequences, e.g., promoters, that direct the transcription of the gene and/or translation of the encoded protein in a cell. In some embodiments a constitutive promoter may be employed. Suitable constitutive promoters for mammalian cells are generally known in the art and include, but are not limited to SV40, CAG, CMV, EF-1α, β-actin, RSV, and PGK. Suitable constitutive promoters for bacterial cells, yeast cells, and fungal cells are generally known in the art, such as a T-7 promoter for bacterial expression and an alcohol dehydrogenase promoter for expression in yeast.

In some embodiments, the regulatory element can be a regulated promoter. “Regulated promoter” refers to promoters that direct gene expression not constitutively, but in a temporally- and/or spatially-regulated manner, and includes tissue-specific, tissue-preferred and inducible promoters. In some embodiments, the regulated promoter is a tissue specific promoter as previously discussed elsewhere herein. Regulated promoters include conditional promoters and inducible promoters. In some embodiments, conditional promoters can be employed to direct expression of a polynucleotide in a specific cell type, under certain environmental conditions, and/or during a specific state of development. Suitable tissue specific promoters can include, but are not limited to, CNS tissue and cell specific promoters.

Suitable neuronal tissue/cell specific promoters include, but are not limited to, GFAP promoter (astrocytes), SYN1 promoter (neurons), and NSE/RU5′ (mature neurons).

Other suitable CNS specific promoters can include, but are not limited to, neuroactive peptide cholecystokinin (CCK) (see e.g., Chhatawl et al. Gene Therapy volume 14, pages 575-583(2007)), a brain specific DNA MiniPromoter (such as any of those identified for brain or pan-neronal expression as in de Leeuw et al. Mol. Therapy. 1(5): 2014. doi:10.1038/mtm.2013.5), myelin basic promoter (MBP) (see e.g., von Jonquieres, G., Mersmann, N., Klugmann, C. B., Harasta, A. E., Lutz, B., Teahan, O., et al. (2013). Glial promoter selectivity following AAV-delivery to the immature brain. PLoS One 8 (6), e65646. doi: 10.1371/journal.pone.0065646), glial fibrillary acid protein (GFAP) for expression in astrocytes (see e.g., Smith-Arica, J. R., Morelli, A. E., Larregina, A. T., Smith, J., Lowenstein, P. R., Castro, M. G. (2000). Cell-type-specific and regulatable transgenesis in the adult brain: adenovirus-encoded combined transcriptional targeting and inducible transgene expression. Mol. Ther. 2 (6), 579-587. doi: 10.1006/mthe.2000.0215 and Lee, Y., Messing, A., Su, M., Brenner, M. (2008). GFAP promoter elements required for region-specific and astrocyte-specific expression. Glia 56 (5), 481-493. doi: 10.1002/glia.20622), human myelin associated glycoprotein promoter (full-length or truncated) (see e.g., von Jonquieres, G., Frohlich, D., Klugmann, C. B., Wen, X., Harasta, A. E., Ramkumar, R., et al. (2016). Recombinant human myelin-associated glycoprotein promoter drives selective AAV-mediated transgene expression in oligodendrocytes. Front. Mol. Neurosci. 9, 13. doi: 10.3389/fnmol.2016.00013), F4/80 promoter (see e.g., Rosario, A. M., Cruz, P. E., Ceballos-Diaz, C., Strickland, M. R., Siemienski, Z., Pardo, M., et al. (2016). Microglia-specific targeting by novel capsid-modified AAV6 vectors. Mol. Ther. Methods Clin. Dev. 3, 16026. doi: 10.1038/mtm.2016.26), phosphate-activated glutaminase (PAG) or the vesicular glutamate transporter (vGLUT) promoter (for about 90% glutamatergic neuron-specific expression) (see e.g., Rasmussen, M., Kong, L., Zhang, G. R., Liu, M., Wang, X., Szabo, G., et al. (2007). Glutamatergic or GABAergic neuron-specific, long-term expression in neocortical neurons from helper virus-free HSV-1 vectors containing the phosphate-activated glutaminase, vesicular glutamate transporter-1, or glutamic acid decarboxylase promoter. Brain Res. 1144, 19-32. doi: 10.1016/j.brainres.2007.01.125), glutamic acid decarboxylase (GAD) promoter (for about 90% GABAergic neuron-specific expression) (see e.g., Rasmussen, M., Kong, L., Zhang, G. R., Liu, M., Wang, X., Szabo, G., et al. (2007). Glutamatergic or GABAergic neuron-specific, long-term expression in neocortical neurons from helper virus-free HSV-1 vectors containing the phosphate-activated glutaminase, vesicular glutamate transporter-1, or glutamic acid decarboxylase promoter. Brain Res. 1144, 19-32. doi: 10.1016/j.brainres.2007.01.125), MeCP2 promoter (see e.g., Gray et al. Hum Gene Ther. 2011 September; 22(9):1143-53. doi: 10.1089/hum.2010.245), and retinoblastoma gene promoter (see e.g., Jiang et al., J. Biol. Chem. 2001. 276, 593-600).

Other tissue and/or cell specific promoters are discussed elsewhere herein and can be generally known in the art and are within the scope of this disclosure.

Inducible/conditional promoters can be positively inducible/conditional promoters (e.g., a promoter that activates transcription of the polynucleotide upon appropriate interaction with an activated activator, or an inducer (compound, environmental condition, or other stimulus) or a negative/conditional inducible promoter (e.g., a promoter that is repressed (e.g., bound by a repressor) until the repressor condition of the promotor is removed (e.g. inducer binds a repressor bound to the promoter stimulating release of the promoter by the repressor or removal of a chemical repressor from the promoter environment).The inducer can be a compound, environmental condition, or other stimulus. Thus, inducible/conditional promoters can be responsive to any suitable stimuli such as chemical, biological, or other molecular agents, temperature, light, and/or pH. Suitable inducible/conditional promoters include, but are not limited to, Tet-On, Tet-Off, Lac promoter, pBad, AlcA, LexA, Hsp70 promoter, Hsp90 promoter, pDawn, XVE/OlexA, GVG, and pOp/LhGR.

Where expression in a plant cell is desired, the components of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein are typically placed under control of a plant promoter, i.e., a promoter operable in plant cells. The use of different types of promoters is envisaged. In some embodiments, inclusion of an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system vector in a plant can be for AAV vector production purposes.

A constitutive plant promoter is a promoter that is able to express the open reading frame (ORF) that it controls in all or nearly all of the plant tissues during all or nearly all developmental stages of the plant (referred to as “constitutive expression”). One non-limiting example of a constitutive promoter is the cauliflower mosaic virus 35S promoter. Different promoters may direct the expression of a gene in different tissues or cell types, or at different stages of development, or in response to different environmental conditions. In particular embodiments, one or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system components are expressed under the control of a constitutive promoter, such as the cauliflower mosaic virus 35S promoter issue-preferred promoters can be utilized to target enhanced expression in certain cell types within a particular plant tissue, for instance vascular cells in leaves or roots or in specific cells of the seed. Examples of particular promoters for use in the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system are found in Kawamata et al., (1997) Plant Cell Physiol 38:792-803; Yamamoto et al., (1997) Plant J 12:255-65; Hire et al., (1992) Plant Mol Biol 20:207-18; Kuster et al., (1995) Plant Mol Biol 29:759-72; and Capana et al., (1994) Plant Mol Biol 25:681-91.

Examples of promoters that are inducible and that can allow for spatiotemporal control of gene editing or gene expression may use a form of energy. The form of energy may include but is not limited to sound energy, electromagnetic radiation, chemical energy and/or thermal energy. Examples of inducible systems include tetracycline inducible promoters (Tet-On or Tet-Off), small molecule two-hybrid transcription activations systems (FKBP, ABA, etc.), or light inducible systems (Phytochrome, LOV domains, or cryptochrome)., such as a Light Inducible Transcriptional Effector (LITE) that direct changes in transcriptional activity in a sequence-specific manner. The components of a light inducible system may include one or more elements of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein, a light-responsive cytochrome heterodimer (e.g., from Arabidopsis thaliana), and a transcriptional activation/repression domain. In some embodiments, the vector can include one or more of the inducible DNA binding proteins provided in PCT publication WO 2014/018423 and US Publications, 2015/0291966, 2017/0166903, 2019/0203212, which describe e.g., embodiments of inducible DNA binding proteins and methods of use and can be adapted for use with the present invention.

In some embodiments, transient or inducible expression can be achieved by including, for example, chemical-regulated promotors, i.e., whereby the application of an exogenous chemical induces gene expression. Modulation of gene expression can also be obtained by including a chemical-repressible promoter, where application of the chemical represses gene expression. Chemical-inducible promoters include, but are not limited to, the maize ln2-2 promoter, activated by benzene sulfonamide herbicide safeners (De Veylder et al., (1997) Plant Cell Physiol 38:568-77), the maize GST promoter (GST-ll-27, WO93/01294), activated by hydrophobic electrophilic compounds used as pre-emergent herbicides, and the tobacco PR-1 a promoter (Ono et al., (2004) Biosci Biotechnol Biochem 68:803-7) activated by salicylic acid. Promoters which are regulated by antibiotics, such as tetracycline-inducible and tetracycline-repressible promoters (Gatz et al., (1991) Mol Gen Genet 227:229-37; U.S. Pat. Nos. 5,814,618 and 5,789,156) can also be used herein.

In some embodiments, the vector or system thereof can include one or more elements capable of translocating and/or expressing an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide to/in a specific cell component or organelle. Such organelles can include, but are not limited to, nucleus, ribosome, endoplasmic reticulum, golgi apparatus, chloroplast, mitochondria, vacuole, lysosome, cytoskeleton, plasma membrane, cell wall, peroxisome, centrioles, etc.

One or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides can be operably linked, fused to, or otherwise modified to include a polynucleotide that encodes or is a selectable marker or tag, which can be a polynucleotide or polypeptide. In some embodiments, the polypeptide encoding a polypeptide selectable marker can be incorporated in the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system polynucleotide such that the selectable marker polypeptide, when translated, is inserted between two amino acids between the N- and C-terminus of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polypeptide or at the N- and/or C-terminus of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polypeptide. In some embodiments, the selectable marker or tag is a polynucleotide barcode or unique molecular identifier (UMI).

It will be appreciated that the polynucleotide encoding such selectable markers or tags can be incorporated into a polynucleotide encoding one or more components of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein in an appropriate manner to allow expression of the selectable marker or tag. Such techniques and methods are described elsewhere herein and will be instantly appreciated by one of ordinary skill in the art in view of this disclosure. Many such selectable markers and tags are generally known in the art and are intended to be within the scope of this disclosure.

Suitable selectable markers and tags include, but are not limited to, affinity tags, such as chitin binding protein (CBP), maltose binding protein (MBP), glutathione-S-transferase (GST), poly(His) tag; solubilization tags such as thioredoxin (TRX) and poly(NANP), MBP, and GST; chromatography tags such as those consisting of polyanionic amino acids, such as FLAG-tag; epitope tags such as V5-tag, Myc-tag, HA-tag and NE-tag; protein tags that can allow specific enzymatic modification (such as biotinylation by biotin ligase) or chemical modification (such as reaction with FlAsH-EDT2 for fluorescence imaging), DNA and/or RNA segments that contain restriction enzyme or other enzyme cleavage sites; DNA segments that encode products that provide resistance against otherwise toxic compounds including antibiotics, such as, spectinomycin, ampicillin, kanamycin, tetracycline, Basta, neomycin phosphotransferase II (NEO), hygromycin phosphotransferase (HPT)) and the like; DNA and/or RNA segments that encode products that are otherwise lacking in the recipient cell (e.g., tRNA genes, auxotrophic markers); DNA and/or RNA segments that encode products which can be readily identified (e.g., phenotypic markers such as β-galactosidase, GUS; fluorescent proteins such as green fluorescent protein (GFP), cyan (CFP), yellow (YFP), red (RFP), luciferase, and cell surface proteins); polynucleotides that can generate one or more new primer sites for PCR (e.g., the juxtaposition of two DNA sequences not previously juxtaposed), DNA sequences not acted upon or acted upon by a restriction endonuclease or other DNA modifying enzyme, chemical, etc.; epitope tags (e.g., GFP, FLAG- and His-tags), and, DNA sequences that make a molecular barcode or unique molecular identifier (UMI), DNA sequences required for a specific modification (e.g., methylation) that allows its identification. Other suitable markers will be appreciated by those of skill in the art.

Selectable markers and tags can be operably linked to one or more components of the engineered AAV capsid system described herein via suitable linker, such as a glycine or glycine serine linkers as short as GS or GG up to (GGGGG)3 (SEQ ID NO: 315) or (GGGGS)3 (SEQ ID NO: 316). Other suitable linkers are described elsewhere herein.

The vector or vector system can include one or more polynucleotides encoding one or more targeting moieties. In some embodiments, the targeting moiety encoding polynucleotides can be included in the vector or vector system, such as a viral vector system, such that they are expressed within and/or on the virus particle(s) produced such that the virus particles can be targeted to specific cells, tissues, organs, etc. In some embodiments, the targeting moiety encoding polynucleotides can be included in the vector or vector system such that the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) and/or products expressed therefrom include the targeting moiety and can be targeted to specific cells, tissues, organs, etc. In some embodiments, such as non-viral carriers, the targeting moiety can be attached to the carrier (e.g., polymer, lipid, inorganic molecule etc.) and can be capable of targeting the carrier and any attached or associated engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) to specific cells, tissues, organs, etc.

In some embodiments, the polynucleotide encoding one or more features of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system can be expressed from a vector or suitable polynucleotide in a cell-free in vitro system. In other words, the polynucleotide can be transcribed and optionally translated in vitro. In vitro transcription/translation systems and appropriate vectors are generally known in the art and commercially available. Generally, in vitro transcription and in vitro translation systems replicate the processes of RNA and protein synthesis, respectively, outside of the cellular environment. Vectors and suitable polynucleotides for in vitro transcription can include T7, SP6, T3, promoter regulatory sequences that can be recognized and acted upon by an appropriate polymerase to transcribe the polynucleotide or vector.

In vitro translation can be stand-alone (e.g., translation of a purified polyribonucleotide) or linked/coupled to transcription. In some embodiments, the cell-free (or in vitro) translation system can include extracts from rabbit reticulocytes, wheat germ, and/or E. coli. The extracts can include various macromolecular components that are needed for translation of exogenous RNA (e.g., 70S or 80S ribosomes, tRNAs, aminoacyl-tRNA, synthetases, initiation, elongation factors, termination factors, etc.). Other components can be included or added during the translation reaction, including but not limited to, amino acids, energy sources (ATP, GTP), energy regenerating systems (creatine phosphate and creatine phosphokinase (eukaryotic systems)) (phosphoenol pyruvate and pyruvate kinase for bacterial systems), and other co-factors (Mg2+, K+, etc.). As previously mentioned, in vitro translation can be based on RNA or DNA starting material. Some translation systems can utilize an RNA template as starting material (e.g., reticulocyte lysates and wheat germ extracts). Some translation systems can utilize a DNA template as a starting material (e.g., E coli-based systems). In these systems transcription and translation are coupled and DNA is first transcribed into RNA, which is subsequently translated. Suitable standard and coupled cell-free translation systems are generally known in the art and are commercially available.

As described elsewhere herein, the polynucleotide encoding one or more embodiments of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein can be codon optimized. In some embodiments, one or more polynucleotides contained in a vector (“vector polynucleotides”) described herein that are in addition to an optionally codon optimized polynucleotide encoding embodiments of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein can be codon optimized. In general, codon optimization refers to a process of modifying a nucleic acid sequence for enhanced expression in the host cells of interest by replacing at least one codon (e.g., about or more than about 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more codons) of the native sequence with codons that are more frequently or most frequently used in the genes of that host cell while maintaining the native amino acid sequence. Various species exhibit particular bias for certain codons of a particular amino acid. Codon bias (differences in codon usage between organisms) often correlates with the efficiency of translation of messenger RNA (mRNA), which is in turn believed to be dependent on, among other things, the properties of the codons being translated and the availability of particular transfer RNA (tRNA) molecules. The predominance of selected tRNAs in a cell is generally a reflection of the codons used most frequently in peptide synthesis. Accordingly, genes can be tailored for optimal gene expression in a given organism based on codon optimization. Codon usage tables are readily available, for example, at the “Codon Usage Database” available at www.kazusa.ojp/codon/and these tables can be adapted in a number of ways. See Nakamura, Y., et al. “Codon usage tabulated from the international DNA sequence databases: status for the year 2000” Nucl. Acids Res. 28:292 (2000). Computer algorithms for codon optimizing a particular sequence for expression in a particular host cell are also available, such as Gene Forge (Aptagen; Jacobus, PA), are also available. In some embodiments, one or more codons (e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 50, or more, or all codons) in a sequence encoding a DNA/RNA-targeting Cas protein corresponds to the most frequently used codon for a particular amino acid. As to codon usage in yeast, reference is made to the online Yeast Genome database available at http://www.yeastgenome.org/community/codon_usage.shtml, or Codon selection in yeast, Bennetzen and Hall, J Biol Chem. 1982 Mar. 25; 257(6):3026-31. As to codon usage in plants including algae, reference is made to Codon usage in higher plants, green algae, and cyanobacteria, Campbell and Gowri, Plant Physiol. 1990 January; 92(1):1-11.; as well as Codon usage in plant genes, Murray et al, Nucleic Acids Res. 1989 Jan. 25; 17(2):477-98; or Selection on the codon bias of chloroplast and cyanelle genes in diferent plant and algal lineages, Morton B R, J Mol Evol. 1998 April; 46(4):449-59.

The vector polynucleotide can be codon optimized for expression in a specific cell-type, tissue type, organ type, and/or subject type. In some embodiments, a codon optimized sequence is a sequence optimized for expression in a eukaryote, e.g., humans (i.e., being optimized for expression in a human or human cell), or for another eukaryote, such as another animal (e.g., a mammal or avian) as is described elsewhere herein. Such codon optimized sequences are within the ambit of the ordinary skilled artisan in view of the description herein. In some embodiments, the polynucleotide is codon optimized for a specific cell type. Such cell types can include, but are not limited to, CNS epithelial cells (including but not limited to the cells lining the brain ventricles), nerve cells (nerves, brain cells, spinal column cells, nerve support cells (e.g., astrocytes, glial cells, Schwann cells etc.), connective tissue cells of the CNS (fat and other soft tissue padding cells of the CNS such as the meninges), stem cells and other progenitor cells, CNS immune cells, germ cells, and combinations thereof. Such codon optimized sequences are within the ambit of the ordinary skilled artisan in view of the description herein. In some embodiments, the polynucleotide is codon optimized for a specific tissue type. Such tissue types can include, but are not limited to, CNS tissue and/or cells thereof. Such codon optimized sequences are within the ambit of the ordinary skilled artisan in view of the description herein. In some embodiments, the polynucleotide is codon optimized for a specific organ. Such organs include, but are not limited to, the brain. Such codon optimized sequences are within the ambit of the ordinary skilled artisan in view of the description herein.

In some embodiments, a vector polynucleotide is codon optimized for expression in particular cells, such as prokaryotic or eukaryotic cells. The eukaryotic cells may be those for derived from a particular organism, such as a plant or a mammal, including but not limited to human, or non-human eukaryote or animal or mammal as discussed herein, e.g., mouse, rat, rabbit, dog, livestock, or non-human mammal or primate.

Viral Vecto and/or Cargo Engineering for Reduced Immunogenicity and/or Toxicity

In some embodiments, the viral genome (such as an AAV genome) and/or cargo (e.g., cargo polynucleotide) is engineered to increase delivery and/or expression efficiency or to otherwise optimize delivery and/or expression efficiency so as to reduce immunogenicity and/or toxicity. See also e.g., Rapti and Grimm. of Front Immunol. 2021; 12: 753467, particularly at section 3.2.2.5 therein, and Domenger and Grimm. 2019. Human Molec Gen. 28(R1):R3-R14. It will be appreciated that one or more approaches discussed here and elsewhere herein can be combined.

In some embodiments, the engineered AAV is a self-complementary AAV (scAAV), which can have a favorable genome configuration with respect to efficiency.

In some embodiments, the engineered viral vector, such as an AAV viral vector, is engineered to have a cargo polynucleotide and/or genome that has a reduced number of CpG islands, which, without being bound by theory, can evade the adaptive and innate immune response by reducing TLR9 signaling. See also e.g., Faust et al., J Clin Invest (2013) 123:2994-3001 and Xiang et al., Mol Ther (2020) 28:771-83, the teachings of which can be adapted for use with the present invention.

In some embodiments, the engineered viral vector, such as an AAV viral vector, is engineered to include one or more short oligonucleotides in its genome that are configured to and/or capable of antagonizing TLR9 activation (referred to herein as TLR9i oligonucleotides), which, without being bound by theory can help the engineered viral particle evade TLR9 sensing and thus reduce immunogenicity. See e.g., Chan et al., Sci Transl Med. 2021 Feb. 10; 13(580), the teachings of which can be adapted for use with the present invention. In some embodiments, one or more TLR9i oligonucleotides (e.g., 1, 2, 3, 4, 5 or more) are incorporated into one or both of the inverted terminal repeats (ITRs) of a viral vector, such as an AAV viral vector. In some embodiments, the one or more TLR9i oligonucleotides are incorporated into the 5′ ITR. In some embodiments, the TLR9i oligonucleotides comprise 1 or more ODN repeats (e.g., 1, 2, 3, 4, 5 or more) that are optionally separated from each other via a linker polynucleotide. In some embodiments, the linker(s) is/are AAAAA. In some embodiments the ODN repeat comprises or consists of TAGGG. In some embodiments, the tTLR9i and/or ODN repeat comprises or consists of the sequence TAGGGTTAGGGTTAGGGTTAGGG (SEQ ID NO: 8582) or TTTAGGGTAGGGTAGGGTAGGG (SEQ ID NO: 8583). In some embodiments, the TLR9i oligonucleotides comprise or consist of the sequence TAGGGTAGGGTAGGGTAGGGAAAAATAGGGTAGGGTAGGGTAGG GAAAAATTAGGGTTAGGGTTAGGGTTAGGGAAAAA (SEQ ID NO: 8584). In some embodiments, the TLR9i oligonucleotides comprise or consist of the sequence TAGGGTAGGGTAGGGTAGGGAAAAATAGGGTAGGGTAGGGTAGG GAAAAATTTAGGGTTAGGGTTAGGGTTAGGGAAAAATGCAGCGGTAAGTTCCCA TCCAGGTTTTTTTGCAGCGGTAAGTTCCCATCCAGGTTTTTTGCAGCGGTAAGTTCC CATCCAGGTTTTT (SEQ ID NO: 8585). Other suitable TLR9i oligonucleotides are set forth in e.g., Chan et al., Sci Transl Med. 2021 Feb. 10; 13(580), particularly at Table S1, the teachings of which can be adapted for use with the present invention.

In some embodiments, the AAV vector is engineered to include a synthetic enhancer, promoter, or other cis acting regulatory element that is configured to optimize or otherwise control transcription of the genes they are associated with (e.g., including but not limited to a cargo polynucleotide). In some embodiments, the synthetic enhancer, promoter, or other cis acting regulatory element is positioned in the engineered AAV vector such that it is about 100 to about 1000 base pairs upstream of the gene or polynucleotide that it regulates (e.g., including but not limited to a cargo polynucleotide). In some embodiments, the synthetic enhancer, promoter, or other cis acting regulatory element contains one or more transcription factor binding sites, which are optionally engineered to bind specific transcription factors so as to control cargo expression temporally or spatially. For example, cell-specific transcription factors can be incorporated to spatially control expression. Exemplary spatial and temporal specific regulatory elements that can be incorporated are described in greater detail elsewhere herein. Additionally, promoter strength can be selected to further optimized polynucleotide expression of the AAV vector. Various promoters (strong and weak) are further described elsewhere herein and will be appreciated by one of ordinary skill in the art in view of the description herein. See also, e.g., Domenger and Grimm. 2019. Human Molec Gen. 28(R1):R3-R14, particularly at pages R4-R6, the teachings of which can be adapted for use with the present invention.. The specific combination of regulatory elements included can be used to fine tune and optimize cargo polynucleotide expression from a viral, e.g., AAV, vector or genome.

Other cis-acting elements, such as RNAi molecule binding sites or external stimuli responsive elements, can be incorporated into an engineered viral vector or viral vector genome, such as an AAV genome. By incorporating cell-type specific RNAi molecule binding sites, spatial expression of a cargo polynucleotide can be fine-tuned or optimized. Further, a synthetic or engineered RNAi molecule binding site can be included allowing control in a spatial and/or temporal manner by controlling where and/or when the synthetic or engineered RNAi molecule is present. In some embodiments, the polynucleotide encoding the synthetic RNAi molecule binding can also be incorporated into the viral vector genome such that it regulates a repressor or other regulatory element of the viral vector genome. In some embodiments, the RNAi molecule binding site(s) are incorporated into a viral vector genome within the 3′UTR of a cargo polynucleotide (e.g., a transgene) This is discussed in further detail elsewhere herein. In some embodiments, the viral vector, such as an AAV vector, is engineered to contain a LOV2 domain from Avena sativa that generates a blue light sensitive cargo polynucleotide. Thus, in this way blue light can be used to provide temporal and spatial control of transgene expression. See also e.g., Domenger and Grimm. 2019. Human Molec Gen. 28(R1):R3-R14, particularly at R7-R8 and FIG. 2, the teachings of which can be adapted for use with the present invention..

In some embodiments, the viral vector, e.g., AAV, is engineered to have one or more adverse structural elements deleted. Deleterious structural elements can be identified using a suitable screen strategy such as SMRT sequencing technology to identify vectors with adverse elements. In some embodiments, the adverse structural element is a shRNA, a hairpin sequence, or other secondary structure that mimics an ITR. See also e.g., Domenger and Grimm. 2019. Human Molec Gen. 28(R1):R3-R14, particularly at R9, the teachings of which can be adapted for use with the present invention.

Other exemplary modifications to reduce immunogenicity and/or toxicity are also described elsewhere herein.

Capsid Modifications for Improved Efficacy and/or Reduced Immunogenicity and/or Toxicity

In some embodiments, the polypeptide composition, such as a viral capsid or capsid polypeptide (e.g., AAV capsid or capsid polypeptide) of the present invention is engineered and/or rationally designed or evolved to contained one or more modifications (in addition to the n-mer motifs of the present invention) to modify and/or improve delivery, stability, efficacy, and/or reduce immunogenicity and/or toxicity of the protein composition, such as a viral capsid or capsid polypeptide (e.g., AAV capsid or capsid polypeptide) of the present invention. See e.g., Rapti and Grimm. of Front Immunol. 2021; 12: 753467, particularly at FIG. 2, Table 1 Section 3; Lam et al., J Pharm Sci, 86 (11)(1997), pp. 1250-1255, Le et al., J Control Release, 108 (1) (2005), pp. 161-177, Wonganan et al., Mol Pharm, 9 (7) (2011), pp. 78-92, Yao et al., Molecules, 22 (7) (2017), pp. 1-15, Zhao et al., J Virol, 90 (9) (2016), pp. 4262-4268, Gabriel et al. Hum Gene Ther Methods, 24 (2) (2013), pp. 80-93, Zhang et al., Biomaterials, 80 (2016), pp. 134-145, Mevel et al., Chem Sci, 11 (4) (2020), pp. 1122-1131, the teachings of which can be adapted for use with the present invention.

In some embodiments, the protein compositions, such as capsid protein(s) (e.g., AAV capsid polypeptides) of the present invention are PEGylated, which without being bound by theory, can mask the protein compositions, such as capsid protein(s) (e.g., AAV capsid polypeptides) of the present invention from antibodies. Suitable PEGylation of the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention is described elsewhere herein.

In some embodiments, the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention are engineered to reduce the number of oxidation susceptible residues, such as Met, Tyr, Trp, His, and/or Cys. In some embodiments, the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention are engineered such that they contain one or more silent amino acid mutations (e.g., substitutions) that reduce the number of oxidation susceptible residues, such as Met, Tyr, Trp, His, and/or Cys. Without being bound by theory, such modifications can increase the stability, reduce degradation, increase half-life, and/or increase efficacy of the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention.

In some embodiments, as is also further described herein, the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention are encapsulated in a liposome, exosome, or other delivery vehicle. Without being bound by theory, such an approach can mask the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention from immune components such as antibodies, thus reducing the immunogenicity of the composition.

In some embodiments, as is also further described herein, the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention are cloaked via click labeling the polypeptide (e.g., capsid) to precisely tether oligonucleotides to the surface of the polypeptide composition (e.g., capsid) and associated or encapsulated with a lipid composition, (e.g., lipofectamine). See also e.g., Grimm et al., J Virol (2008) 82:5887-911. doi: 10.1128/JVI.00254-08, the teachings of which can be adapted for use with the present invention.

In some embodiments, the viral vector and/or polypeptide (e.g., capsid polypeptides) are selected, optimized and/or otherwise engineered to reduced immunogenicity. In some embodiments, and as discussed elsewhere herein, the serotype of the viral vector, such as AAV, can be selected to have a reduced immunogenicity in the recipient.

In some embodiments, the capsid polypeptide and/or capsid can be engineered and/or rationally designed or generated under a directed evolution approach to have reduced immunogenicity. In some embodiments, this is in addition or contemporaneous to any modification, engineering, selection, or directed evolution of proteins to have a specific tropism. See e.g., Rapti and Grimm. of Front Immunol. 2021; 12: 753467., particularly at Table 1 and Section 3/FIG. 2, the teachings of which can be adapted for use with the present invention.

As is also described herein, the immunogenicity of a viral capsid, particularly an AAV can be reduced, by one or more detargeting approaches, wherein the capsid or other component of the virial vector are modified to reduce delivery to or transgene/cargo expression in a non-target cell. In some embodiments, the capsid or capsid protein is modified at one or more residues to detarget a non-target cell, which can reduce the immunogenicity and/or toxicity of the viral particles. Exemplary modifications are described in greater detail elsewhere herein.

In some embodiments, the vector is a non-viral vector or carrier. In some embodiments, non-viral vectors can have the advantage(s) of reduced toxicity and/or immunogenicity and/or increased bio-safety as compared to viral vectors The terms of art “Non-viral vectors and carriers” and as used herein in this context refers to molecules and/or compositions that are not based on one or more component of a virus or virus genome (excluding any nucleotide to be delivered and/or expressed by the non-viral vector) that can be capable of attaching to, incorporating, coupling, and/or otherwise interacting with an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide of the present invention and can be capable of ferrying the polynucleotide to a cell and/or expressing the polynucleotide. It will be appreciated that this does not exclude the inclusion of a virus-based polynucleotide that is to be delivered. For example, if a gRNA to be delivered is directed against a virus component and it is inserted or otherwise coupled to an otherwise non-viral vector or carrier, this would not make said vector a “viral vector”. Non-viral vectors and carriers include naked polynucleotides, chemical-based carriers, polynucleotide (non-viral) based vectors, and particle-based carriers. It will be appreciated that the term “vector” as used in the context of non-viral vectors and carriers refers to polynucleotide vectors and “carriers” used in this context refers to a non-nucleic acid or polynucleotide molecule or composition that be attached to or otherwise interact with a polynucleotide to be delivered, such as an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide of the present invention.

In some embodiments one or more engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides described elsewhere herein can be included in a naked polynucleotide. The term of art “naked polynucleotide” as used herein refers to polynucleotides that are not associated with another molecule (e.g., proteins, lipids, and/or other molecules) that can often help protect it from environmental factors and/or degradation. As used herein, associated with includes, but is not limited to, linked to, adhered to, adsorbed to, enclosed in, enclosed in or within, mixed with, and the like. Naked polynucleotides that include one or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides described herein can be delivered directly to a host cell and optionally expressed therein. The naked polynucleotides can have any suitable two- and three-dimensional configurations. By way of non-limiting examples, naked polynucleotides can be single-stranded molecules, double stranded molecules, circular molecules (e.g., plasmids and artificial chromosomes), molecules that contain portions that are single stranded and portions that are double stranded (e.g., ribozymes), and the like. In some embodiments, the naked polynucleotide contains only the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention. In some embodiments, the naked polynucleotide can contain other nucleic acids and/or polynucleotides in addition to the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention. The naked polynucleotides can include one or more elements of a transposon system. Transposons and system thereof are described in greater detail elsewhere herein.

In some embodiments, one or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides can be included in a non-viral polynucleotide vector. Suitable non-viral polynucleotide vectors include, but are not limited to, transposon vectors and vector systems, plasmids, bacterial artificial chromosomes, yeast artificial chromosomes, AR (antibiotic resistance)-free plasmids and miniplasmids, circular covalently closed vectors (e.g., minicircles, minivectors, miniknots,), linear covalently closed vectors (“dumbbell shaped”), MIDGE (minimalistic immunologically defined gene expression) vectors, MiLV (micro-linear vector) vectors, Ministrings, mini-intronic plasmids, PSK systems (post-segregationally killing systems), ORT (operator repressor titration) plasmids, and the like. See e.g., Hardee et al. 2017. Genes. 8(2):65.

In some embodiments, the non-viral polynucleotide vector can have a conditional origin of replication. In some embodiments, the non-viral polynucleotide vector can be an ORT plasmid. In some embodiments, the non-viral polynucleotide vector can have a minimalistic immunologically defined gene expression. In some embodiments, the non-viral polynucleotide vector can have one or more post-segregationally killing system genes. In some embodiments, the non-viral polynucleotide vector is AR-free. In some embodiments, the non-viral polynucleotide vector is a minivector. In some embodiments, the non-viral polynucleotide vector includes a nuclear localization signal. In some embodiments, the non-viral polynucleotide vector can include one or more CpG motifs. In some embodiments, the non-viral polynucleotide vectors can include one or more scaffold/matrix attachment regions (S/MARs). See e.g., Mirkovitch et al. 1984. Cell. 39:223-232, Wong et al. 2015. Adv. Genet. 89:113-152, whose techniques and vectors can be adapted for use in the present invention. S/MARs are AT-rich sequences that play a role in the spatial organization of chromosomes through DNA loop base attachment to the nuclear matrix. S/MARs are often found close to regulatory elements such as promoters, enhancers, and origins of DNA replication. Inclusion of one or S/MARs can facilitate a once-per-cell-cycle replication to maintain the non-viral polynucleotide vector as an episome in daughter cells. In embodiments, the S/MAR sequence is located downstream of an actively transcribed polynucleotide (e.g., one or more engineered AAV capsid polynucleotides of the present invention) included in the non-viral polynucleotide vector. In some embodiments, the S/MAR can be a S/MAR from the beta-interferon gene cluster. See e.g., Verghese et al. 2014. Nucleic Acid Res. 42:e53; Xu et al. 2016. Sci. China Life Sci. 59:1024-1033; Jin et al. 2016. 8:702-711; Koirala et al. 2014. Adv. Exp. Med. Biol. 801:703-709; and Nehlsen et al. 2006. Gene Ther. Mol. Biol. 10:233-244, whose techniques and vectors can be adapted for use in the present invention.

In some embodiments, the non-viral vector is a transposon vector or system thereof. As used herein, “transposon” (also referred to as transposable element) refers to a polynucleotide sequence that is capable of moving form location in a genome to another. There are several classes of transposons. Transposons include retrotransposons and DNA transposons. Retrotransposons require the transcription of the polynucleotide that is moved (or transposed) in order to transpose the polynucleotide to a new genome or polynucleotide. DNA transposons are those that do not require reverse transcription of the polynucleotide that is moved (or transposed) in order to transpose the polynucleotide to a new genome or polynucleotide. In some embodiments, the non-viral polynucleotide vector can be a retrotransposon vector. In some embodiments, the retrotransposon vector includes long terminal repeats. In some embodiments, the retrotransposon vector does not include long terminal repeats. In some embodiments, the non-viral polynucleotide vector can be a DNA transposon vector. DNA transposon vectors can include a polynucleotide sequence encoding a transposase. In some embodiments, the transposon vector is configured as a non-autonomous transposon vector, meaning that the transposition does not occur spontaneously on its own. In some of these embodiments, the transposon vector lacks one or more polynucleotide sequences encoding proteins required for transposition. In some embodiments, the non-autonomous transposon vectors lack one or more Ac elements.

In some embodiments a non-viral polynucleotide transposon vector system can include a first polynucleotide vector that contains the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention flanked on the 5′ and 3′ ends by transposon terminal inverted repeats (TIRs) and a second polynucleotide vector that includes a polynucleotide capable of encoding a transposase coupled to a promoter to drive expression of the transposase. When both are expressed in the same cell the transposase can be expressed from the second vector and can transpose the material between the TIRs on the first vector (e.g., the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention) and integrate it into one or more positions in the host cell's genome. In some embodiments the transposon vector or system thereof can be configured as a gene trap. In some embodiments, the TIRs can be configured to flank a strong splice acceptor site followed by a reporter and/or other gene (e.g., one or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention) and a strong poly A tail. When transposition occurs while using this vector or system thereof, the transposon can insert into an intron of a gene and the inserted reporter or other gene can provoke a mis-splicing process and as a result it in activates the trapped gene.

Any suitable transposon system can be used. Suitable transposon and systems thereof can include, but are not limited to, Sleeping Beauty transposon system (Tc1/mariner superfamily) (see e.g., Ivics et al. 1997. Cell. 91(4): 501-510), piggyBac (piggyBac superfamily) (see e.g., Li et al. 2013 110(25): E2279-E2287 and Yusa et al. 2011. PNAS. 108(4): 1531-1536), Tol2 (superfamily hAT), Frog Prince (Tc1/mariner superfamily) (see e.g., Miskey et al. 2003 Nucleic Acid Res. 31(23):6873-6881) and variants thereof.

In some embodiments the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) can be coupled to a chemical carrier. Chemical carriers that can be suitable for delivery of polynucleotides can be broadly classified into the following classes: (i) inorganic particles, (ii) lipid-based, (iii) polymer-based, and (iv) peptide based. They can be categorized as (1) those that can form condensed complexes with a polynucleotide (such as the engineered targeting moiety, polypeptide, viral (e.g. AAV) capsid polynucleotide(s) of the present invention), (2) those capable of targeting specific cells, (3) those capable of increasing delivery of the polynucleotide (such as the engineered targeting moiety, polypeptide, viral (e.g. AAV) capsid polynucleotide(s) of the present invention) to the nucleus or cytosol of a host cell, (4) those capable of disintegrating from DNA/RNA in the cytosol of a host cell, and (5) those capable of sustained or controlled release. It will be appreciated that any one given chemical carrier can include features from multiple categories. The term “particle” as used herein, refers to any suitable sized particles for delivery of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system components described herein. Suitable sizes include macro-, micro-, and nano-sized particles.

In some embodiments, the non-viral carrier can be an inorganic particle. In some embodiments, the inorganic particle, can be a nanoparticle. The inorganic particles can be configured and optimized by varying size, shape, and/or porosity. In some embodiments, the inorganic particles are optimized to escape from the reticuloendothelial system. In some embodiments, the inorganic particles can be optimized to protect an entrapped molecule from degradation. The Suitable inorganic particles that can be used as non-viral carriers in this context can include, but are not limited to, calcium phosphate, silica, metals (e.g., gold, platinum, silver, palladium, rhodium, osmium, iridium, ruthenium, mercury, copper, rhenium, titanium, niobium, tantalum, and combinations thereof), magnetic compounds, particles, and materials, (e.g., supermagnetic iron oxide and magnetite), quantum dots, fullerenes (e.g., carbon nanoparticles, nanotubes, nanostrings, and the like), and combinations thereof. Other suitable inorganic non-viral carriers are discussed elsewhere herein.

In some embodiments, the non-viral carrier can be lipid-based. Suitable lipid-based carriers are also described in greater detail herein. In some embodiments, the lipid-based carrier includes a cationic lipid or an amphiphilic lipid that is capable of binding or otherwise interacting with a negative charge on the polynucleotide to be delivered (e.g., such as an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide of the present invention). In some embodiments, chemical non-viral carrier systems can include a polynucleotide such as the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention) and a lipid (such as a cationic lipid). These are also referred to in the art as lipoplexes. Other embodiments of lipoplexes are described elsewhere herein. In some embodiments, the non-viral lipid-based carrier can be a lipid nano emulsion. Lipid nano emulsions can be formed by the dispersion of an immisicible liquid in another stabilized emulsifying agent and can have particles of about 200 nm that are composed of the lipid, water, and surfactant that can contain the polynucleotide to be delivered (e.g., the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention). In some embodiments, the lipid-based non-viral carrier can be a solid lipid particle or nanoparticle.

In some embodiments, the non-viral carrier can be peptide-based. In some embodiments, the peptide-based non-viral carrier can include one or more cationic amino acids. In some embodiments, 35 to 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 99 or 100% of the amino acids are cationic. In some embodiments, peptide carriers can be used in conjunction with other types of carriers (e.g., polymer-based carriers and lipid-based carriers to functionalize these carriers). In some embodiments, the functionalization is targeting a host cell. Suitable polymers that can be included in the polymer-based non-viral carrier can include, but are not limited to, polyethylenimine (PEI), chitosan, poly (DL-lactide) (PLA), poly (DL-Lactide-co-glycoside) (PLGA), dendrimers (see e.g., US Pat. Pub. 2017/0079916 whose techniques and compositions can be adapted for use with the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides of the present invention), polymethacrylate, and combinations thereof.

In some embodiments, the non-viral carrier can be configured to release an engineered delivery system polynucleotide that is associated with or attached to the non-viral carrier in response to an external stimulus, such as pH, temperature, osmolarity, concentration of a specific molecule or composition (e.g., calcium, NaCl, and the like), pressure and the like. In some embodiments, the non-viral carrier can be a particle that is configured includes one or more of the engineered AAV capsid polynucleotides describe herein and an environmental triggering agent response element, and optionally a triggering agent. In some embodiments, the particle can include a polymer that can be selected from the group of polymethacrylates and polyacrylates. In some embodiments, the non-viral particle can include one or more embodiments of the compositions microparticles described in US Pat. Pubs. 20150232883 and 20050123596, whose techniques and compositions can be adapted for use in the present invention.

In some embodiments, the non-viral carrier can be a polymer-based carrier. In some embodiments, the polymer is cationic or is predominantly cationic such that it can interact in a charge-dependent manner with the negatively charged polynucleotide to be delivered (such as the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide(s) of the present invention). Polymer-based systems are described in greater detail elsewhere herein.

In some embodiments, the vector is a viral vector. The term of art “viral vector” and as used herein in this context refers to polynucleotide based vectors that contain one or more elements from or based upon one or more elements of a virus that can be capable of expressing and packaging a polynucleotide, such as an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotide of the present invention, into a virus particle and producing said virus particle when used alone or with one or more other viral vectors (such as in a viral vector system). Viral vectors and systems thereof can be used for producing viral particles for delivery of and/or expression of one or more components of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system described herein. The viral vector can be part of a viral vector system involving multiple vectors. In some embodiments, systems incorporating multiple viral vectors can increase the safety of these systems. Suitable viral vectors can include adenoviral-based vectors, adeno associated vectors, helper-dependent adenoviral (HdAd) vectors, hybrid adenoviral vectors, and the like. Other embodiments of viral vectors and viral particles produce therefrom are described elsewhere herein. In some embodiments, the viral vectors are configured to produce replication incompetent viral particles for improved safety of these systems.

In some embodiments, the vector can be an adenoviral vector. In some embodiments, the adenoviral vector can include elements such that the virus particle produced using the vector or system thereof can be serotype 2, 5, or 9. In some embodiments, the polynucleotide to be delivered via the adenoviral particle can be up to about 8 kb. Thus, in some embodiments, an adenoviral vector can include a DNA polynucleotide to be delivered that can range in size from about 0.001 kb to about 8 kb. Adenoviral vectors have been used successfully in several contexts (see e.g., Teramato et al. 2000. Lancet. 355:1911-1912; Lai et al. 2002. DNA Cell. Biol. 21:895-913; Flotte et al., 1996. Hum. Gene. Ther. 7:1145-1159; and Kay et al. 2000. Nat. Genet. 24:257-261. The engineered AAV capsids can be included in an adenoviral vector to produce adenoviral particles containing said engineered AAV capsids.

In some embodiments the vector can be a helper-dependent adenoviral vector or system thereof. These are also referred to in the field as “gutless” or “gutted” vectors and are a modified generation of adenoviral vectors (see e.g., Thrasher et al. 2006. Nature. 443:E5-7). In embodiments of the helper-dependent adenoviral vector system one vector (the helper) can contain all the viral genes required for replication but contains a conditional gene defect in the packaging domain. The second vector of the system can contain only the ends of the viral genome, one or more engineered AAV capsid polynucleotides, and the native packaging recognition signal, which can allow selective packaged release from the cells (see e.g., Cideciyan et al. 2009. N Engl J Med. 361:725-727). Helper-dependent Adenoviral vector systems have been successful for gene delivery in several contexts (see e.g., Simonelli et al. 2010. J Am Soc Gene Ther. 18:643-650; Cideciyan et al. 2009. N Engl J Med. 361:725-727; Crane et al. 2012. Gene Ther. 19(4):443-452; Alba et al. 2005. Gene Ther. 12:18-S27; Croyle et al. 2005. Gene Ther. 12:579-587; Amalfitano et al. 1998. J. Virol. 72:926-933; and Morral et al. 1999. PNAS. 96:12816-12821). The techniques and vectors described in these publications can be adapted for inclusion and delivery of the engineered AAV capsid polynucleotides described herein. In some embodiments, the polynucleotide to be delivered via the viral particle produced from a helper-dependent adenoviral vector or system thereof can be up to about 38 kb. Thus, in some embodiments, an adenoviral vector can include a DNA polynucleotide to be delivered that can range in size from about 0.001 kb to about 37 kb (see e.g., Rosewell et al. 2011. J. Genet. Syndr. Gene Ther. Suppl. 5:001).

In some embodiments, the vector is a hybrid-adenoviral vector or system thereof. Hybrid adenoviral vectors are composed of the high transduction efficiency of a gene-deleted adenoviral vector and the long-term genome-integrating potential of adeno-associated, retroviruses, lentivirus, and transposon based-gene transfer. In some embodiments, such hybrid vector systems can result in stable transduction and limited integration site. See e.g., Balague et al. 2000. Blood. 95:820-828; Morral et al. 1998. Hum. Gene Ther. 9:2709-2716; Kubo and Mitani. 2003. J. Virol. 77(5): 2964-2971; Zhang et al. 2013. PloS One. 8(10) e76771; and Cooney et al. 2015. Mol. Ther. 23(4):667-674), whose techniques and vectors described therein can be modified and adapted for use in the engineered AAV capsid system of the present invention. In some embodiments, a hybrid-adenoviral vector can include one or more features of a retrovirus and/or an adeno-associated virus. In some embodiments the hybrid-adenoviral vector can include one or more features of a spuma retrovirus or foamy virus (FV). See e.g., Ehrhardt et al. 2007. Mol. Ther. 15:146-156 and Liu et al. 2007. Mol. Ther. 15:1834-1841, whose techniques and vectors described therein can be modified and adapted for use in the engineered AAV capsid system of the present invention. Advantages of using one or more features from the FVs in the hybrid-adenoviral vector or system thereof can include the ability of the viral particles produced therefrom to infect a broad range of cells, a large packaging capacity as compared to other retroviruses, and the ability to persist in quiescent (non-dividing) cells. See also e.g., Ehrhardt et al. 2007. Mol. Ther. 156:146-156 and Shuji et al. 2011. Mol. Ther. 19:76-82, whose techniques and vectors described therein can be modified and adapted for use in the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid system of the present invention.

In an embodiment, the engineered vector or system thereof can be an adeno-associated vector (AAV). See, e.g., West et al., Virology 160:38-47 (1987); U.S. Pat. No. 4,797,368; WO 93/24641; Kotin, Human Gene Therapy 5:793-801 (1994); and Muzyczka, J. Clin. Invest. 94:1351 (1994). Although similar to adenoviral vectors in some of their features, AAVs have some deficiency in their replication and/or pathogenicity and thus can be safer that adenoviral vectors. In some embodiments the AAV can integrate into a specific site on chromosome 19 of a human cell with no observable side effects. In some embodiments, the capacity of the AAV vector, system thereof, and/or AAV particles can be up to about 4.7 kb. The AAV vector or system thereof can include one or more engineered capsid polynucleotides described herein.

The AAV vector or system thereof can include one or more regulatory molecules. In some embodiments the regulatory molecules can be promoters, enhancers, repressors and the like, which are described in greater detail elsewhere herein. In some embodiments, the AAV vector or system thereof can include one or more polynucleotides that can encode one or more regulatory proteins. In some embodiments, the one or more regulatory proteins can be selected from Rep78, Rep68, Rep52, Rep40, variants thereof, and combinations thereof. In some embodiments, the promoter can be a tissue specific promoter as previously discussed. In some embodiments, the tissue specific promoter can drive expression of an engineered capsid AAV capsid polynucleotide described herein.

The AAV vector or system thereof can include one or more polynucleotides that can encode one or more capsid polypeptides, such as the engineered AAV capsid polypeptides described elsewhere herein. The engineered capsid polypeptides can be capable of assembling into a protein shell (an engineered capsid) of the AAV virus particle. The engineered capsid can have a cell-, tissue- and/or organ-specific tropism.

In some embodiments, the AAV vector or system thereof can include one or more adenovirus helper factors or polynucleotides that can encode one or more adenovirus helper factors. Such adenovirus helper factors can include, but are not limited, E1A, E1B, E2A, E40RF6, and VA RNAs. In some embodiments, a producing host cell line expresses one or more of the adenovirus helper factors.

The AAV vector or system thereof can be configured to produce AAV particles having a specific serotype. In some embodiments, the serotype can be AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV rh.74, AAV rh.10, AAV12, AAV.DJ, AAV.ie, AAV1.9-3, AAV.Anc80, AAV.Anc80L65, AAV2.7m8, or AAV8BP2 or any combinations thereof. In some embodiments, the AAV can be AAV1, AAV-2, AAV-5, AAV-9 or any combination thereof. One can select the AAV of the AAV with regard to the cells to be targeted; e.g., one can select AAV serotypes 1, 2, 5, 9 or a hybrid capsid AAV-1, AAV-2, AAV-5, AAV-9 or any combination thereof for targeting brain and/or neuronal cells; and one can select AAV-4 for targeting cardiac tissue; and one can select AAV-8 for delivery to the liver. Thus, in some embodiments, an AAV vector or system thereof capable of producing AAV particles capable of targeting the brain and/or neuronal cells can be configured to generate AAV particles having serotypes 1, 2, 5 or a hybrid capsid AAV-1, AAV-2, AAV-5 or any combination thereof. In some embodiments, an AAV vector or system thereof capable of producing AAV particles capable of targeting cardiac tissue can be configured to generate an AAV particle having an AAV-4 serotype. In some embodiments, an AAV vector or system thereof capable of producing AAV particles capable of targeting the liver can be configured to generate an AAV having an AAV-8 serotype. See also Srivastava. 2017. Curr. Opin. Virol. 21:75-80.

It will be appreciated that while the different serotypes can provide some level of cell, tissue, and/or organ specificity, each serotype still is multi-tropic and thus can result in tissue-toxicity if using that serotype to target a tissue that the serotype is less efficient in transducing. Tus, in addition to achieving some tissue targeting capacity via selecting an AAV of a particular serotype, it will be appreciated that the tropism of the AAV serotype can be modified by an engineered AAV capsid described herein. As described elsewhere herein, variants of wild-type AAV of any serotype can be generated via a method described herein and determined to have a particular cell-specific tropism, which can be the same or different as that of the reference wild-type AAV serotype. In some embodiments, the cell, tissue, and/or specificity of the wild-type serotype can be enhanced (e.g., made more selective or specific for a particular cell type that the serotype is already biased towards). For example, wild-type AAV-9 is biased towards muscle and brain in humans (see e.g., Srivastava. 2017. Curr. Opin. Virol. 21:75-80.) By including an engineered AAV capsid and/or capsid polypeptide variant of wild-type AAV-9 as described herein, the bias for e.g., muscle (or other non-CNS tissue or cell) can be reduced or eliminated and/or the CNS tissue or cell specificity increased such that the muscle (or other non-CNS tissue or cell) specificity appears reduced in comparison, thus enhancing the specificity for the CNS tissue or cell as compared to the wild-type AAV-9. As previously mentioned, inclusion of an engineered capsid and/or capsid polypeptide n variant of a wild-type AAV serotype can have a different or more efficient and/or more specific tropism than the wild-type reference AAV serotype. For example, an engineered AAV capsid and/or capsid polypeptide variant of AAV-9 can have specificity for a tissue other than muscle or brain in humans or have heightened tropism for e.g., brain tissue as compared to wild-type AAV9.

In some embodiments, the AAV vector is a hybrid AAV vector or system thereof. Hybrid AAVs are AAVs that include genomes with elements from one serotype that are packaged into a capsid derived from at least one different serotype. For example, if it is the rAAV2/5 that is to be produced, and if the production method is based on the helper-free, transient transfection method discussed above, the 1st plasmid and the 3rd plasmid (the adeno helper plasmid) will be the same as discussed for rAAV2 production. However, the 2nd plasmid, the pRepCap will be different. In this plasmid, called pRep2/Cap5, the Rep gene is still derived from AAV2, while the Cap gene is derived from AAV5. The production scheme is the same as the above-mentioned approach for AAV2 production. The resulting rAAV is called rAAV2/5, in which the genome is based on recombinant AAV2, while the capsid is based on AAV5. It is assumed the cell or tissue-tropism displayed by this AAV2/5 hybrid virus should be the same as that of AAV5. It will be appreciated that wild-type hybrid AAV particles suffer the same specificity issues as with the non-hybrid wild-type serotypes previously discussed.

Advantages achieved by the wild-type based hybrid AAV systems can be combined with the increased and customizable cell-specificity that can be achieved with the engineered AAV capsids can be combined by generating a hybrid AAV that can include an engineered AAV capsid described elsewhere herein. It will be appreciated that hybrid AAVs can contain an engineered AAV capsid containing a genome with elements from a different serotype than the reference wild-type serotype that the engineered AAV capsid is a variant of. For example, a hybrid AAV can be produced that includes an engineered AAV capsid that is a variant of an AAV-9 serotype that is used to package a genome that contains components (e.g., rep elements) from an AAV-2 serotype. As with wild-type based hybrid AAVs previously discussed, the tropism of the resulting AAV particle will be that of the engineered AAV capsid.

A tabulation of certain wild-type AAV serotypes as to these cells can be found in Grimm, D. et al, J. Virol. 82: 5887-5911 (2008) reproduced below as Table 4. Further tropism details can be found in Srivastava. 2017. Curr. Opin. Virol. 21:75-80 as previously discussed.

TABLE 4
Cell LineAAV-1AAV-2AAV-3AAV-4AAV-5AAV-6AAV-8AAV-9
Huh-7131002.50.00.1100.70.0
HEK293251002.50.10.150.70.1
HeLa31002.00.16.710.20.1
HepG2310016.70.31.750.3ND
Hep1A201000.21.00.110.20.0
91117100110.20.1170.1ND
CHO100100141.433350101.0
COS33100333.35.0142.00.5
MeWo10100200.36.7101.00.2
NIH3T3101002.92.90.3100.3ND
A5491410020ND0.5100.50.1
HT118020100100.10.3330.50.1
Monocytes1111100NDND1251429NDND
Immature2500100NDND2222857NDND
DC
Mature DC2222100NDND3333333NDND

In some embodiments, the AAV vector or system thereof is AAV rh.74 or AAV rh.10.

In some embodiments, the AAV vector or system thereof is configured as a “gutless” vector, similar to that described in connection with a retroviral vector. In some embodiments, the “gutless” AAV vector or system thereof can have the cis-acting viral DNA elements involved in genome amplification and packaging in linkage with the heterologous sequences of interest (e.g., the engineered AAV capsid polynucleotide(s)).

The vectors described herein can be constructed using any suitable process or technique. In some embodiments, one or more suitable recombination and/or cloning methods or techniques can be used to the vector(s) described herein. Suitable recombination and/or cloning techniques and/or methods can include, but not limited to, those described in U.S. Application publication No. US 2004-0171156 A1. Other suitable methods and techniques are described elsewhere herein.

Construction of recombinant AAV vectors is described in a number of publications, including U.S. Pat. No. 5,173,414; Tratschin et al., Mol. Cell. Biol. 5:3251-3260 (1985); Tratschin, et al., Mol. Cell. Biol. 4:2072-2081 (1984); Hermonat & Muzyczka, PNAS 81:6466-6470 (1984); and Samulski et al., J. Virol. 63:03822-3828 (1989). Any of the techniques and/or methods can be used and/or adapted for constructing an AAV or other vectors described herein. AAV vectors are discussed elsewhere herein.

In some embodiments, the vector can have one or more insertion sites, such as a restriction endonuclease recognition sequence (also referred to as a “cloning site”). In some embodiments, one or more insertion sites (e.g., about or more than about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more insertion sites) are located upstream and/or downstream of one or more sequence elements of one or more vectors.

Delivery vehicles, vectors, particles, nanoparticles, formulations and components thereof for expression of one or more elements of an engineered AAV capsid system described herein are as used in the foregoing documents, such as WO 2014/093622 (PCT/US2013/074667) and are discussed in greater detail herein.

Virus Particle Production from Viral Vectors

There are two main strategies for producing AAV particles from AAV vectors and systems thereof, such as those described herein, which depend on how the adenovirus helper factors are provided (helper v. helper free). In some embodiments, a method of producing AAV particles from AAV vectors and systems thereof can include adenovirus infection into cell lines that stably harbor AAV replication and capsid encoding polynucleotides along with AAV vector containing the polynucleotide to be packaged and delivered by the resulting AAV particle (e.g., the engineered AAV capsid polynucleotide(s)). In some embodiments, a method of producing AAV particles from AAV vectors and systems thereof can be a “helper free” method, which includes co-transfection of an appropriate producing cell line with three vectors (e.g., plasmid vectors): (1) an AAV vector that contains a polynucleotide of interest (e.g., the engineered AAV capsid polynucleotide(s)) between 2 ITRs; (2) a vector that carries the AAV Rep-Cap encoding polynucleotides; and (helper polynucleotides. One of skill in the art will appreciate various methods and variations thereof that are both helper and -helper free and as well as the different advantages of each system.

The engineered AAV vectors and systems thereof described herein can be produced by any of these methods.

A vector (including non-viral carriers) described herein can be introduced into host cells to thereby produce transcripts, proteins, or peptides, including fusion proteins or peptides encoded by nucleic acids as described herein (e.g., engineered AAV capsid system transcripts, proteins, enzymes, mutant forms thereof, fusion proteins thereof, etc.), and virus particles (such as from viral vectors and systems thereof).

One or more engineered AAV capsid polynucleotides can be delivered using adeno associated virus (AAV), adenovirus or other plasmid or viral vector types as previously described, in particular, using formulations and doses from, for example, U.S. Pat. No. 8,454,972 (formulations, doses for adenovirus), U.S. Pat. No. 8,404,658 (formulations, doses for AAV) and U.S. Pat. No. 5,846,946 (formulations, doses for DNA plasmids) and from clinical trials and publications regarding the clinical trials involving lentivirus, AAV and adenovirus. For examples, for AAV, the route of administration, formulation and dose can be as in U.S. Pat. No. 8,454,972 and as in clinical trials involving AAV. For Adenovirus, the route of administration, formulation and dose can be as in U.S. Pat. No. 8,404,658 and as in clinical trials involving adenovirus.

For plasmid delivery, the route of administration, formulation and dose can be as in U.S. Pat. No. 5,846,946 and as in clinical studies involving plasmids. In some embodiments, doses can be based on or extrapolated to an average 70 kg individual (e.g., a male adult human), and can be adjusted for patients, subjects, mammals of different weight and species. Frequency of administration is within the ambit of the medical or veterinary practitioner (e.g., physician, veterinarian), depending on usual factors including the age, sex, general health, other conditions of the patient or subject and the particular condition or symptoms being addressed. The viral vectors can be injected into or otherwise delivered to the tissue or cell of interest.

In terms of in vivo delivery, AAV is advantageous over other viral vectors for a couple of reasons such as low toxicity (this may be due to the purification method not requiring ultra-centrifugation of cell particles that can activate the immune response) and a low probability of causing insertional mutagenesis because it doesn't integrate into the host genome.

The vector(s) and virus particles described herein can be delivered into a host cell in vitro, in vivo, and or ex vivo. Delivery can occur by any suitable method including, but not limited to, physical methods, chemical methods, and biological methods. Physical delivery methods are those methods that employ physical force to counteract the membrane barrier of the cells to facilitate intracellular delivery of the vector. Suitable physical methods include, but are not limited to, needles (e.g., injections), ballistic polynucleotides (e.g., particle bombardment, micro projectile gene transfer, and gene gun), electroporation, sonoporation, photoporation, magnetofection, hydroporation, and mechanical massage. Chemical methods are those methods that employ a chemical to elicit a change in the cells membrane permeability or other characteristic(s) to facilitate entry of the vector into the cell. For example, the environmental pH can be altered which can elicit a change in the permeability of the cell membrane. Biological methods are those that rely and capitalize on the host cell's biological processes or biological characteristics to facilitate transport of the vector (with or without a carrier) into a cell. For example, the vector and/or its carrier can stimulate an endocytosis or similar process in the cell to facilitate uptake of the vector into the cell.

Delivery of engineered AAV capsid system components (e.g., polynucleotides encoding engineered AAV capsid and/or capsid polypeptides) to cells via particles. The term “particle” as used herein, refers to any suitable sized particles for delivery of the engineered AAV capsid system components described herein. Suitable sizes include macro-, micro-, and nano-sized particles. In some embodiments, any of the of the engineered AAV capsid system components (e.g., polypeptides, polynucleotides, vectors, and combinations thereof described herein) can be attached to, coupled to, integrated with, otherwise associated with one or more particles or component thereof as described herein. The particles described herein can then be administered to a cell or organism by an appropriate route and/or technique. In some embodiments, particle delivery can be selected and be advantageous for delivery of the polynucleotide or vector components. It will be appreciated that in embodiments, particle delivery can also be advantageous for other engineered capsid system molecules and formulations described elsewhere herein.

Also described herein are engineered virus particles (also referred to here and elsewhere herein as “engineered viral particles”) that can contain an engineered viral (e.g., AAV) capsid as described in detail elsewhere herein. Viral particles with an engineered AAV capsid are referred to herein as engineered AAV particles. It will be appreciated that the engineered viral (e.g., AAV) particles can be adenovirus-based particles, helper adenovirus-based particles, AAV-based particles, or hybrid adenovirus-based particles that contain at least one engineered AAV capsid polypeptides as previously described. An engineered AAV capsid is one that that contains one or more engineered AAV capsid polypeptides as are described elsewhere herein. In some embodiments, the engineered AAV particles can include 1-60 engineered AAV capsid polypeptides described herein. In some embodiments, the engineered AAV particles can contain 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, or 60 engineered capsid polypeptides. In some embodiments, the engineered AAV particles can contain 0-59 wild-type AAV capsid polypeptides. In some embodiments, the engineered AAV particles can contain 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, or 59 wild-type AAV capsid polypeptides. The engineered AAV particles can thus include one or more n-mer inserts as is previously described.

The engineered AAV particle can include one or more cargo polynucleotides. Cargo polynucleotides are discussed in greater detail elsewhere herein. Methods of making the engineered AAV particles from viral and non-viral vectors are described elsewhere herein. Formulations containing the engineered virus particles are described elsewhere herein.

The engineered viral (e.g., AAV) capsid polynucleotides, other viral (e.g., AAV) polynucleotide(s), and/or vector polynucleotides can contain one or more cargo polynucleotides. The cargo polynucleotides can encode one or more polypeptides. Exemplary cargos are described in greater detail elsewhere herein. It will be appreciated that when a cargo polypeptide is described that its encoding polynucleotide can be a cargo polynucleotide described in this context. In some embodiments, the one or more cargo polynucleotides can be operably linked to the engineered viral (e.g., AAV) capsid polynucleotide(s) and can be part of the engineered viral (e.g., AAV) genome of the viral (e.g., AAV) system of the present invention. The cargo polynucleotides can be packaged into an engineered viral (e.g., AAV) particle, which can be delivered to, e.g., a cell. In some embodiments, the cargo polynucleotide can be capable of modifying a polynucleotide (e.g., gene or transcript) of a cell to which it is delivered. As used herein, “gene” can refer to a hereditary unit corresponding to a sequence of DNA that occupies a specific location on a chromosome and that contains the genetic instruction for a characteristic(s) or trait(s) in an organism. The term gene can refer to translated and/or untranslated regions of a genome. “Gene” can refer to the specific sequence of DNA that is transcribed into an RNA transcript that can be translated into a polypeptide or be a catalytic RNA molecule, including but not limited to, tRNA, siRNA, piRNA, miRNA, long-non-coding RNA and shRNA. Polynucleotide, gene, transcript, etc. modification includes all genetic engineering techniques including, but not limited to, gene editing as well as conventional recombinational gene modification techniques (e.g., whole or partial gene insertion, deletion, and mutagenesis (e.g., insertional and deletional mutagenesis) techniques.

Described herein are engineered cells that can include one or more of the engineered targeting moieties, polypeptides, viral (e.g., AAV) capsid polynucleotides, polypeptides, vectors, and/or vector systems described in greater detail elsewhere herein. In some embodiments, one or more of the engineered viral (e.g., AAV) capsid polynucleotides can be expressed in the engineered cells. In some embodiments, the engineered cells can be capable of producing engineered viral (e.g., AAV) capsid polypeptides and/or engineered viral (e.g., AAV) capsid particles that are described elsewhere herein. Also described herein are modified or engineered organisms that can include one or more engineered cells described herein. The engineered cells can be engineered to express a cargo molecule (e.g., a cargo polynucleotide) dependently or independently of an engineered viral (e.g., AAV) capsid polynucleotide as described elsewhere herein.

A wide variety of animals, plants, algae, fungi, yeast, etc. and animal, plant, algae, fungus, yeast cell or tissue systems may be engineered to express one or more nucleic acid constructs of the engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid system described herein using various transformation methods mentioned elsewhere herein. This can produce organisms that can produce engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid particles, such as for production purposes, engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid design and/or generation, and/or model organisms. In some embodiments, the polynucleotide(s) encoding one or more components of the engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid system described herein can be stably or transiently incorporated into one or more cells of a plant, animal, algae, fungus, and/or yeast or tissue system. In some embodiments, one or more of engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid system polynucleotides are genomically incorporated into one or more cells of a plant, animal, algae, fungus, and/or yeast or tissue system. Further embodiments of the modified organisms and systems are described elsewhere herein. In some embodiments, one or more components of the engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid system described herein are expressed in one or more cells of the plant, animal, algae, fungus, yeast, or tissue systems.

Described herein are various embodiments of engineered cells that can include one or more of the engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid system polynucleotides, polypeptides, vectors, and/or vector systems described elsewhere herein. In some embodiments, the cells can express one or more of the engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid polynucleotides and can produce one or more engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid particles, which are described in greater detail herein. Such cells are also referred to herein as “producer cells”. It will be appreciated that these engineered cells are different from “modified cells” described elsewhere herein in that the modified cells are not necessarily producer cells (i.e. they do not make engineered viral (e.g., AAV) particles) unless they include one or more of the engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid polynucleotides, engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid vectors or other vectors described herein that render the cells capable of producing an engineered viral (e.g., AAV) capsid particle or other particles described herein. Modified cells can be recipient cells of an engineered viral (e.g., AAV) capsid particles and can, in some embodiments, be modified by the engineered viral (e.g., AAV) capsid particle(s) and/or a cargo polynucleotide delivered to the recipient cell. Modified cells are discussed in greater detail elsewhere herein. The term modification can be used in connection with modification of a cell that is not dependent on being a recipient cell. For example, isolated cells can be modified prior to receiving an engineered targeting moiety, polypeptide, viral (e.g., AAV) capsid molecule.

In an embodiment, the invention provides a non-human eukaryotic organism; for example, a multicellular eukaryotic organism, including a eukaryotic host cell containing one or more components of an engineered delivery system described herein according to any of the described embodiments. In other embodiments, the invention provides a eukaryotic organism; preferably a multicellular eukaryotic organism, comprising a eukaryotic host cell containing one or more components of an engineered delivery system described herein according to any of the described embodiments. In some embodiments, the organism is a host of a virus (e.g., an AAV).

In particular embodiments, the plants, algae, fungi, yeast, etc., cells or parts obtained are transgenic plants, comprising an exogenous DNA sequence incorporated into the genome of all or part of the cells.

The engineered cell can be a prokaryotic cell. The prokaryotic cell can be bacterial cell. The prokaryotic cell can be an archaea cell. The bacterial cell can be any suitable bacterial cell. Suitable bacterial cells can be from the genus Escherichia, Bacillus, Lactobacillus, Rhodococcus, Rodhobacter, Synechococcus, Synechoystis, Pseudomonas, Psedoaltermonas, Stenotrophamonas, and Streptomyces Suitable bacterial cells include, but are not limited to Escherichia coli cells, Caulobacter crescentus cells, Rodhobacter sphaeroides cells, Psedoaltermonas haloplanktis cells. Suitable strains of bacterial include, but are not limited to BL21(DE3), DL21(DE3)-pLysS, BL21 Star-pLysS, BL21-SI, BL21-AI, Tuner, Tuner pLysS, Origami, Origami B pLysS, Rosetta, Rosetta pLysS, Rosetta-gami-pLysS, BL21 CodonPlus, AD494, BL2trxB, HMS174, NovaBlue (DE3), BLR, C41(DE3), C43(DE3), Lemo21 (DE3), Shuffle T7, ArcticExpress and ArticExpress (DE3).

The engineered cell can be a eukaryotic cell. The eukaryotic cells may be those of or derived from a particular organism, such as a plant or a mammal, including but not limited to human, or non-human eukaryote or animal or mammal as herein discussed, e.g., mouse, rat, rabbit, dog, livestock, or non-human mammal or primate. In some embodiments the engineered cell can be a cell line. Examples of cell lines include, but are not limited to, C8161, CCRF-CEM, MOLT, mIMCD-3, NHDF, HeLa-S3, Huh1, Huh4, Huh7, HUVEC, HASMC, HEKn, HEKa, MiaPaCell, Panc1, PC-3, TF1, CTLL-2, CiR, Rat6, CV1, RPTE, A10, T24, J82, A375, ARH-77, Calu1, SW480, SW620, SKOV3, SK-UT, CaCo2, P388D1, SEM-K2, WEHI-231, HB56, TIB55, Jurkat, J45.01, LRMB, Bcl-1, BC-3, IC21, DLD2, Raw264.7, NRK, NRK-52E, MRC5, MEF, Hep G2, HeLa B, HeLa T4, COS, COS-1, COS-6, COS-M6A, BS-C-1 monkey kidney epithelial, BALB/3T3 mouse embryo fibroblast, 3T3 Swiss, 3T3-L1, 132-d5 human fetal fibroblasts; 10.1 mouse fibroblasts, 293-T, 3T3, 721, 9L, A2780, A2780ADR, A2780cis, A172, A20, A253, A431, A-549, ALC, B16, B35, BCP-1 cells, BEAS-2B, bEnd.3, BHK-21, BR 293, BxPC3, C3H-10T1/2, C6/36, Cal-27, CHO, CHO-7, CHO-IR, CHO-K1, CHO-K2, CHO-T, CHO Dhfr −/−, COR-L23, COR-L23/CPR, COR-L23/5010, COR-L23/R23, COS-7, COV-434, CML T1, CMT, CT26, D17, DH82, DU145, DuCaP, EL4, EM2, EM3, EMT6/AR1, EMT6/AR10.0, FM3, H1299, H69, HB54, HB55, HCA2, HEK-293, HeLa, Hepa1c1c7, HL-60, HMEC, HT-29, Jurkat, JY cells, K562 cells, Ku812, KCL22, KG1, KYO1, LNCap, Ma-Mel 1-48, MC-38, MCF-7, MCF-10A, MDA-MB-231, MDA-MB-468, MDA-MB-435, MDCK II, MDCK II, MOR/0.2R, MONO-MAC 6, MTD-1A, MyEnd, NCI-H69/CPR, NCI-H69/LX10, NCI-H69/LX20, NCI-H69/LX4, NIH-3T3, NALM-1, NW-145, OPCN/OPCT cell lines, Peer, PNT-1A/PNT 2, RenCa, RIN-5F, RMA/RMAS, Saos-2 cells, Sf-9, SkBr3, T2, T-47D, T84, THP1 cell line, U373, U87, U937, VCaP, Vero cells, WM39, WT-49, X63, YAC-1, YAR, and transgenic varieties thereof. Cell lines are available from a variety of sources known to those with skill in the art (see, e.g., the American Type Culture Collection (ATCC) (Manassas, Va.)).

In some embodiments, the engineered producer cell is a CNS cell, such as a neuron or supporting cell (e.g., a Schawan cell, astrocyte, glial cells, microglial cell and/or the like), a muscle cell (e.g., cardiac muscle, skeletal muscle, and/or smooth muscle), bone cell, blood cell, immune cell (including but not limited to B cells, macrophages, T-cells, CAR-T cells, and the like), kidney cells, bladder cells, lung cells, heart cells, liver cells, brain cells, neurons, skin cells, stomach cells, neuronal support cells, intestinal cells, epithelial cells, endothelial cells, stem or other progenitor cells, adrenal gland cells, cartilage cells, and combinations thereof.

In some embodiments, the engineered cell can be a fungus cell. As used herein, a “fungal cell” refers to any type of eukaryotic cell within the kingdom of fungi. Phyla within the kingdom of fungi include Ascomycota, Basidiomycota, Blastocladiomycota, Chytridiomycota, Glomeromycota, Microsporidia, and Neocallimastigomycota. Fungal cells may include yeasts, molds, and filamentous fungi. In some embodiments, the fungal cell is a yeast cell.

As used herein, the term “yeast cell” refers to any fungal cell within the phyla Ascomycota and Basidiomycota. Yeast cells may include budding yeast cells, fission yeast cells, and mold cells. Without being limited to these organisms, many types of yeast used in laboratory and industrial settings are part of the phylum Ascomycota. In some embodiments, the yeast cell is an S. cerevisiae, Kluyveromyces marxianus, or Issatchenkia orientalis cell. Other yeast cells may include without limitation Candida spp. (e.g., Candida albicans), Yarrowia spp. (e.g., Yarrowia lipolytica), Pichia spp. (e.g., Pichia pastoris), Kluyveromyces spp. (e.g., Kluyveromyces lactis and Kluyveromyces marxianus), Neurospora spp. (e.g., Neurospora crassa), Fusarium spp. (e.g., Fusarium oxysporum), and Issatchenkia spp. (e.g., Issatchenkia orientalis, a.k.a. Pichia kudriavzevii and Candida acidothermophilum). In some embodiments, the fungal cell is a filamentous fungal cell. As used herein, the term “filamentous fungal cell” refers to any type of fungal cell that grows in filaments, i.e., hyphae or mycelia. Examples of filamentous fungal cells may include without limitation Aspergillus spp. (e.g., Aspergillus niger), Trichoderma spp. (e.g., Trichoderma reesei), Rhizopus spp. (e.g., Rhizopus oryzae), and Mortierella spp. (e.g., Mortierella isabellina).

In some embodiments, the fungal cell is an industrial strain. As used herein, “industrial strain” refers to any strain of fungal cell used in or isolated from an industrial process, e.g., production of a product on a commercial or industrial scale. Industrial strain may refer to a fungal species that is typically used in an industrial process, or it may refer to an isolate of a fungal species that may be also used for non-industrial purposes (e.g., laboratory research). Examples of industrial processes may include fermentation (e.g., in production of food or beverage products), distillation, biofuel production, production of a compound, and production of a polypeptide. Examples of industrial strains can include, without limitation, JAY270 and ATCC4124.

In some embodiments, the fungal cell is a polyploid cell. As used herein, a “polyploid” cell may refer to any cell whose genome is present in more than one copy. A polyploid cell may refer to a type of cell that is naturally found in a polyploid state, or it may refer to a cell that has been induced to exist in a polyploid state (e.g., through specific regulation, alteration, inactivation, activation, or modification of meiosis, cytokinesis, or DNA replication). A polyploid cell may refer to a cell whose entire genome is polyploid, or it may refer to a cell that is polyploid in a particular genomic locus of interest.

In some embodiments, the fungal cell is a diploid cell. As used herein, a “diploid” cell may refer to any cell whose genome is present in two copies. A diploid cell may refer to a type of cell that is naturally found in a diploid state, or it may refer to a cell that has been induced to exist in a diploid state (e.g., through specific regulation, alteration, inactivation, activation, or modification of meiosis, cytokinesis, or DNA replication). For example, the S. cerevisiae strain S228C may be maintained in a haploid or diploid state. A diploid cell may refer to a cell whose entire genome is diploid, or it may refer to a cell that is diploid in a particular genomic locus of interest. In some embodiments, the fungal cell is a haploid cell. As used herein, a “haploid” cell may refer to any cell whose genome is present in one copy. A haploid cell may refer to a type of cell that is naturally found in a haploid state, or it may refer to a cell that has been induced to exist in a haploid state (e.g., through specific regulation, alteration, inactivation, activation, or modification of meiosis, cytokinesis, or DNA replication). For example, the S. cerevisiae strain S228C may be maintained in a haploid or diploid state. A haploid cell may refer to a cell whose entire genome is haploid, or it may refer to a cell that is haploid in a particular genomic locus of interest.

In some embodiments, the engineered cell is a cell obtained from a subject. In some embodiments, the subject is a healthy or non-diseased subject. In some embodiments, the subject is a subject with a desired physiological and/or biological characteristic such that when an engineered targeting moiety, polypeptide, vector, viral (e.g., AAV) capsid particle is produced it can package one or more cargo polynucleotides that can be related to the desired physiological and/or biological characteristic and/or capable of modifying the desired physiological and/or biological characteristic. Thus, the cargo polynucleotides of the produced engineered viral (e.g., AAV) or other particles can be capable of transferring the desired characteristic to a recipient cell. In some embodiments, the cargo polynucleotides are capable of modifying a polynucleotide of the engineered cell such that the engineered cell has a desired physiological and/or biological characteristic.

In some embodiments, a cell transfected with one or more vectors described herein is used to establish a new cell line comprising one or more vector-derived sequences.

The engineered cells can be used to produce engineered targeting moieties, polypeptides, viral (e.g., AAV) capsid polynucleotides, vectors, and/or particles. In some embodiments, the engineered targeting moieties, polypeptides, viral (e.g., AAV) capsid polynucleotides, vectors, and/or particles are produced, harvested, and/or delivered to a subject in need thereof. In some embodiments, the engineered cells are delivered to a subject. Other uses for the engineered cells are described elsewhere herein. In some embodiments, the engineered cells can be included in formulations and/or kits described elsewhere herein.

The engineered cells can be stored short-term or long-term for use at a later time. Suitable storage methods are generally known in the art. Further, methods of restoring the stored cells for use (such as thawing, reconstitution, and otherwise stimulating metabolism in the engineered cell after storage) at a later time are also generally known in the art.

Component(s) of the engineered targeting moieties, polypeptides, viral (e.g., AAV) capsid system, engineered cells, engineered viral (e.g., AAV) particles, and/or combinations thereof can be included in a formulation that can be delivered to a subject or a cell. In some embodiments, the formulation is a pharmaceutical formulation. One or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein can be provided to a subject in need thereof or a cell alone or as an active ingredient, such as in a pharmaceutical formulation. As such, also described herein are pharmaceutical formulations containing an amount of one or more of the polypeptides, polynucleotides, vectors, cells, or combinations thereof described herein. In some embodiments, the pharmaceutical formulation can contain an effective amount of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein. The pharmaceutical formulations described herein can be administered to a subject in need thereof or a cell.

In some embodiments, the amount of the one or more of the polypeptides, polynucleotides, vectors, cells, virus particles, nanoparticles, other delivery particles, and combinations thereof described herein contained in the pharmaceutical formulation can range from about 1 μg/kg to about 10 mg/kg based upon the bodyweight of the subject in need thereof or average bodyweight of the specific patient population to which the pharmaceutical formulation can be administered. The amount of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein in the pharmaceutical formulation can range from about 1 μg to about 10 g, from about 10 nL to about 10 ml. In embodiments where the pharmaceutical formulation contains one or more cells, the amount can range from about 1 cell to 1×102, 1×103, 1×104, 1×105, 1×106, 1×107, 1×108, 1×109, 1×1010 or more cells. In embodiments where the pharmaceutical formulation contains one or more cells, the amount can range from about 1 cell to 1×102, 1×103, 1×104, 1×105, 1×106, 1×107, 1×108, 1×109, 1×1010 or more cells per nL, μL, mL, or L.

In embodiments, were engineered AAV capsid particles are included in the formulation, the formulation can contain 1 to 1×101, 1×102, 1×103, 1×104, 1×105, 1×106, 1×107, 1×108, 1×109, 1×1010, 1×1011, 1×1012, 1×1013, 1×1014, 1×1015, 1×1016, 1×1017, 1×1018, 1×1019, or 1×1020, transducing units (TU)/mL of the engineered AAV capsid particles. In some embodiments, the formulation can be 0.1 to 100 mL in volume and can contain 1 to 1×101, 1×102, 1×103, 1×104, 1×105, 1×106, 1×107, 1×108, 1×109, 1×1010, 1×1011, 1×1012, 1×1013, 1×1014, 1×1015, 1×1016, 1×1017, 1×1018, 1×1019, or 1×1020, transducing units (TU)/mL of the engineered AAV capsid particles.

In embodiments, the pharmaceutical formulation containing an amount of one or more of the polypeptides, polynucleotides, vectors, cells, virus particles, nanoparticles, other delivery particles, and combinations thereof described herein can further include a pharmaceutically acceptable carrier. Suitable pharmaceutically acceptable carriers include, but are not limited to, water, salt solutions, alcohols, gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, carbohydrates such as lactose, amylose or starch, magnesium stearate, talc, silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxy methylcellulose, and polyvinyl pyrrolidone, which do not deleteriously react with the active composition.

The pharmaceutical formulations can be sterilized, and if desired, mixed with auxiliary agents, such as lubricants, preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic pressure, buffers, coloring, flavoring and/or aromatic substances, and the like which do not deleteriously react with the active composition.

In addition to an amount of one or more of the polypeptides, polynucleotides, vectors, cells, engineered viral (e.g., AAV) capsids, viral (e.g., AAV) or other particles, nanoparticles, other delivery particles, and combinations thereof described herein, the pharmaceutical formulation can also include an effective amount of an auxiliary active agent, including but not limited to, polynucleotides, amino acids, peptides, polypeptides, antibodies, aptamers, ribozymes, hormones, immunomodulators, antipyretics, anxiolytics, antipsychotics, analgesics, antispasmodics, anti-inflammatories, anti-histamines, anti-infectives, chemotherapeutics, and combinations thereof.

In embodiments where there is an auxiliary active agent contained in the pharmaceutical formulation in addition to the one or more of the polypeptides, polynucleotides, compositions, vectors, cells, virus particles, nanoparticles, other delivery particles, and combinations thereof described herein, amount, such as an effective amount, of the auxiliary active agent will vary depending on the auxiliary active agent. In some embodiments, the amount of the auxiliary active agent ranges from 0.001 micrograms to about 1 milligram. In other embodiments, the amount of the auxiliary active agent ranges from about 0.01 IU to about 1000 IU. In further embodiments, the amount of the auxiliary active agent ranges from 0.001 mL to about 1 mL. In yet other embodiments, the amount of the auxiliary active agent ranges from about 1% w/w to about 50% w/w of the total pharmaceutical formulation. In additional embodiments, the amount of the auxiliary active agent ranges from about 1% v/v to about 50% v/v of the total pharmaceutical formulation. In still other embodiments, the amount of the auxiliary active agent ranges from about 1% w/v to about 50% w/v of the total pharmaceutical formulation.

In some embodiments, the pharmaceutical formulations described herein may be in a dosage form. The dosage forms can be adapted for administration by any appropriate route. Appropriate routes include, but are not limited to, oral (including buccal or sublingual), rectal, epidural, intracranial, intraocular, inhaled, intranasal, topical (including buccal, sublingual, or transdermal), vaginal, intraurethral, parenteral, intracranial, subcutaneous, intramuscular, intravenous, intraperitoneal, intradermal, intraosseous, intracardiac, intraarticular, intracavemous, intrathecal, intravitreal, intracerebral, gingival, subgingival, intracerebroventricular, intra-arterial, intracarotid, intrathecal, intracisternal, subpial, intracerebroventricular, intraparenchymal, intracranial, subdural, subretinal, subconjunctival, intravitreal, intratympanic, intracochlear, intranasal, and intradermal. Such formulations may be prepared by any method known in the art.

Dosage forms adapted for oral administration can be discrete dosage units such as capsules, pellets or tablets, powders or granules, solutions, or suspensions in aqueous or non-aqueous liquids; edible foams or whips, or in oil-in-water liquid emulsions or water-in-oil liquid emulsions. In some embodiments, the pharmaceutical formulations adapted for oral administration also include one or more agents which flavor, preserve, color, or help disperse the pharmaceutical formulation. Dosage forms prepared for oral administration can also be in the form of a liquid solution that can be delivered as foam, spray, or liquid solution. In some embodiments, the oral dosage form can contain about 1 ng to 1000 g of a pharmaceutical formulation containing a therapeutically effective amount or an appropriate fraction thereof of the targeted effector fusion protein and/or complex thereof or composition containing the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein. The oral dosage form can be administered to a subject in need thereof.

Where appropriate, the dosage forms described herein can be microencapsulated.

The dosage form can also be prepared to prolong or sustain the release of any ingredient. In some embodiments, the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein can be the ingredient whose release is delayed. In other embodiments, the release of an optionally included auxiliary ingredient is delayed. Suitable methods for delaying the release of an ingredient include, but are not limited to, coating or embedding the ingredients in material in polymers, wax, gels, and the like. Delayed release dosage formulations can be prepared as described in standard references such as “Pharmaceutical dosage form tablets,” eds. Liberman et. al. (New York, Marcel Dekker, Inc., 1989), “Remington—The science and practice of pharmacy”, 20th ed., Lippincott Williams & Wilkins, Baltimore, MD, 2000, and “Pharmaceutical dosage forms and drug delivery systems”, 6th Edition, Ansel et al., (Media, PA: Williams and Wilkins, 1995). These references provide information on excipients, materials, equipment, and processes for preparing tablets and capsules and delayed release dosage forms of tablets and pellets, capsules, and granules. The delayed release can be anywhere from about an hour to about 3 months or more.

Examples of suitable coating materials include, but are not limited to, cellulose polymers such as cellulose acetate phthalate, hydroxypropyl cellulose, hydroxypropyl methylcellulose, hydroxypropyl methylcellulose phthalate, and hydroxypropyl methylcellulose acetate succinate; polyvinyl acetate phthalate, acrylic acid polymers and copolymers, and methacrylic resins that are commercially available under the trade name EUDRAGIT® (Roth Pharma, Westerstadt, Germany), zein, shellac, and polysaccharides.

Coatings may be formed with a different ratio of water-soluble polymer, water insoluble polymers, and/or pH dependent polymers, with or without water insoluble/water soluble non-polymeric excipient, to produce the desired release profile. The coating is either performed on the dosage form (matrix or simple) which includes, but is not limited to, tablets (compressed with or without coated beads), capsules (with or without coated beads), beads, particle compositions, “ingredient as is” formulated as, but not limited to, suspension form or as a sprinkle dosage form.

Dosage forms adapted for topical administration can be formulated as ointments, creams, suspensions, lotions, powders, solutions, pastes, gels, sprays, aerosols, or oils. In some embodiments for treatments of the eye or other external tissues, for example the mouth or the skin, the pharmaceutical formulations are applied as a topical ointment or cream. When formulated in an ointment, the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein can be formulated with a paraffinic or water-miscible ointment base. In some embodiments, the active ingredient can be formulated in a cream with an oil-in-water cream base or a water-in-oil base. Dosage forms adapted for topical administration in the mouth include lozenges, pastilles, and mouth washes.

Dosage forms adapted for nasal or inhalation administration include aerosols, solutions, suspension drops, gels, or dry powders. In some embodiments, the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein is contained in a dosage form adapted for inhalation is in a particle-size-reduced form that is obtained or obtainable by micronization. In some embodiments, the particle size of the size reduced (e.g., micronized) compound or salt or solvate thereof, is defined by a D50 value of about 0.5 to about 10 microns as measured by an appropriate method known in the art. Dosage forms adapted for administration by inhalation also include particle dusts or mists. Suitable dosage forms wherein the carrier or excipient is a liquid for administration as a nasal spray or drops include aqueous or oil solutions/suspensions of an active ingredient (e.g., the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein and/or auxiliary active agent), which may be generated by various types of metered dose pressurized aerosols, nebulizers, or insufflators.

In some embodiments, the dosage forms can be aerosol formulations suitable for administration by inhalation. In some of these embodiments, the aerosol formulation can contain a solution or fine suspension of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein and a pharmaceutically acceptable aqueous or non-aqueous solvent. Aerosol formulations can be presented in single or multi-dose quantities in sterile form in a sealed container. For some of these embodiments, the sealed container is a single dose or multi-dose nasal or an aerosol dispenser fitted with a metering valve (e.g., metered dose inhaler), which is intended for disposal once the contents of the container have been exhausted.

Where the aerosol dosage form is contained in an aerosol dispenser, the dispenser contains a suitable propellant under pressure, such as compressed air, carbon dioxide, or an organic propellant, including but not limited to a hydrofluorocarbon. The aerosol formulation dosage forms in other embodiments are contained in a pump-atomizer. The pressurized aerosol formulation can also contain a solution or a suspension of one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein. In further embodiments, the aerosol formulation can also contain co-solvents and/or modifiers incorporated to improve, for example, the stability and/or taste and/or fine particle mass characteristics (amount and/or profile) of the formulation. Administration of the aerosol formulation can be once daily or several times daily, for example 2, 3, 4, or 8 times daily, in which 1, 2, or 3 doses are delivered each time.

For some dosage forms suitable and/or adapted for inhaled administration, the pharmaceutical formulation is a dry powder inhalable formulation. In addition to the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein, an auxiliary active ingredient, and/or pharmaceutically acceptable salt thereof, such a dosage form can contain a powder base such as lactose, glucose, trehalose, manitol, and/or starch. In some of these embodiments, the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein is in a particle-size reduced form. In further embodiments, a performance modifier, such as L-leucine or another amino acid, cellobiose octaacetate, and/or metals salts of stearic acid, such as magnesium or calcium stearate.

In some embodiments, the aerosol dosage forms can be arranged so that each metered dose of aerosol contains a predetermined amount of an active ingredient, such as the one or more of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein.

Dosage forms adapted for vaginal administration can be presented as pessaries, tampons, creams, gels, pastes, foams, or spray formulations. Dosage forms adapted for rectal administration include suppositories or enemas.

Dosage forms adapted for parenteral administration and/or adapted for any type of injection (e.g., intravenous, intraperitoneal, subcutaneous, intramuscular, intradermal, intraosseous, epidural, intracardiac, intraarticular, intracavemous, gingival, subginigival, intrathecal, intravireal, intracerebral, and intracerebroventricular, and others) can include aqueous and/or non-aqueous sterile injection solutions, which can contain anti-oxidants, buffers, bacteriostats, solutes that render the composition isotonic with the blood of the subject, and aqueous and non-aqueous sterile suspensions, which can include suspending agents and thickening agents. The dosage forms adapted for parenteral administration can be presented in a single-unit dose or multi-unit dose containers, including but not limited to sealed ampoules or vials. The doses can be lyophilized and resuspended in a sterile carrier to reconstitute the dose prior to administration. Extemporaneous injection solutions and suspensions can be prepared in some embodiments, from sterile powders, granules, and tablets.

Dosage forms adapted for ocular administration can include aqueous and/or nonaqueous sterile solutions that can optionally be adapted for injection, and which can optionally contain anti-oxidants, buffers, bacteriostats, solutes that render the composition isotonic with the eye or fluid contained therein or around the eye of the subject, and aqueous and nonaqueous sterile suspensions, which can include suspending agents and thickening agents. Dosage forms for the eye can be adapted for topical administration to the eye, such as drops, suspensions, gels, hydrogels (e.g., contact lenses) and/or the like.

For some embodiments, the dosage form contains a predetermined amount of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein per unit dose. In some embodiments, the predetermined amount of the Such unit doses may therefore be administered once or more than once a day. Such pharmaceutical formulations may be prepared by any of the methods well known in the art.

In some embodiments, the pharmaceutical formulation and/or dosage form, is adapted for improved delivery and/or efficacy of a viral particle, particularly an AAV. In some embodiments, a viral particle or vector such as an AAV particle or vector, of the present invention is PEGylated. In some embodiments, the PEGlyation can improve the pharmacokinetics and/or pharmacodynamics of the viral particles, particularly AAV particles. In some embodiments, the engineered capsid polypeptides of the present invention, including but not limited to the engineered AAV capsid polypeptides are modified with one or more azide moieties which can then be orthogonally conjugated to one or more polyethylene glycols (PEGs) via click chemistry. In some embodiments, this approach can increase the stability (e.g., by 1-3 or more fold) and/or reduce immune system detection of the viral vectors (e.g., antibody recognition can be reduced by 0.1 to 2 or more fold). In some embodiments, the PEG used for PEGlyation is PEG 2000. PEGylated AAV2 particles via amine functionalities have been shown to protect the virus from neutralization and enable significant levels of gene expression upon re-administration without compromising the patient's immune system. See e.g., Harris and Chess. Le at al. Nat Rev Drug Discov, 2 (3) (2003), pp. 214-221, Brocchini et al., Nat Protoc, 1 (5) (2006), pp. 2241-2252, Gupta et al., J Cell Commun Signal, 13 (3) (2019), pp. 319-330, Pelegri-Oday et al., J. Am. Chem. Soc. 2014, 136, 41, 14323-14332, Le et al., J Control Release, 108 (1) (2005), pp. 161-177, and Lee et al. Biotechnol Bioeng, 92 (1) (2005), pp. 24-34, the teachings of which can be adapted for use with the present invention.

In some embodiments, the polypeptide compositions, viral vectors, viral polypeptides (e.g., capsid polypeptides and/or capsids), and/or viral particles are modified so as to improve transduction, stability, and/or other property of the polypeptide compositions, viral vectors, viral polypeptides, and/or viral particles, (in addition to inclusion of a n-mer motif described herein). In some embodiments, the modification(s) increase the stability and/or efficacy of the viral vectors, viral polypeptides (e.g., capsid polypeptides and/or capsids), and/or viral particles. In some embodiments, the capsid or capsid polypeptides thereof are modified by mutation of one or more serine, threonine, and/or lysine residues such that they are replaced with an alanine or arginine residues. In some embodiments, the modification is inclusion of an azide moiety in a viral capsid or capsid polypeptide of the present invention, such an AAV capsid or capsid polypeptide of the present invention. In some embodiments, the azide is introduced into the VP3 capsid domain. See e.g., Lam et al., J Pharm Sci, 86 (11) (1997), pp. 1250-1255, Le et al., J Control Release, 108 (1) (2005), pp. 161-177, Wonganan et al., Mol Pharm, 9 (7) (2011), pp. 78-92, Yao et al., Molecules, 22 (7) (2017), pp. 1-15, Zhao et al., J Virol, 90 (9) (2016), pp. 4262-4268, Gabriel et al. Hum Gene Ther Methods, 24 (2) (2013), pp. 80-93, Zhang et al., Biomaterials, 80 (2016), pp. 134-145, Mevel et al., Chem Sci, 11 (4) (2020), pp. 1122-1131, the teachings of which can be adapted for use with the present invention.

Peptide oxidation is a major cause of chemical instability and also sometimes linked to physical instability. For example, amino acids such as methionine, cysteine, histidine, tyrosine and tryptophan in peptides are susceptible to oxidation. More specifically viral capsid polypeptides can oxidize upon exposure to light and due to metal ion impurities in the raw materials and excipients, common to pharmaceutical formulations leading to a loss in functionality. In some embodiments, oxidation of the polypeptide compositions viral vectors, viral polypeptides (e.g., capsid polypeptides and/or capsids), and/or viral particles can be decreased and/or prevented by including free amino acids such as methionine and histidine and/or metal ion scavengers such as ethanol, EDTA and DTPA in a pharmaceutical formulation of the polypeptide compositions viral vectors, viral polypeptides (e.g., capsid polypeptides and/or capsids), and/or viral particles of the present invention. See e.g., Wang et al., Int J Pharm, 185 (1999), pp. 129-188, Evans et al., J Pharm Sci, 93 (10) (2004), pp. 2458-2475, Reinauer et al., J Pharm Sci, 109 (1) (2020), pp. 818-829, Kamerzell et al., Adv Drug Deliv Rev, 63 (13) (2011), pp. 1118-1159, Shah et al., J Pharm Sci, 107 (11) (2018), pp. 2789-2803, Shah et al., Int J Pharm, 547 (1-2) (2018), pp. 438-449, Tsai et al., Pharm Res An Off J Am Assoc Pharm Sci, 10 (5) (1993), pp. 649-659, Master et al., J Pharm Sci, 99 (5) (2010), pp. 2386-2398, and Lam et al., J Pharm Sci, 86 (11) (1997), pp. 1250-1255, the teachings of which can be adapted for use with the present invention.

Protein aggregation can cause an immunogenic response to protein compositions, including viral capsid compositions. In some embodiments, aggregation of proteins in a formulation, such as viral particles/vectors/capsids can be reduced by inclusion of one or more surfactants in the formulation. In some embodiments, a pharmaceutical formulation containing a protein composition, viral particle, viral capsid, and/or viral capsid polypeptide (e.g., an AAV capsid or capsid polypeptide) of the present invention contains one or more surfactants. In some embodiments, the surfactant is a nonionic surfactant. In some embodiments, the nonionic surfactant is a polysorbate (e.g., polysorbate 20, polysorbate 80). In some embodiments, the nonionic surfactant is poloxamer 188. Without being bound by theory, inclusion of a surfactant can also protect proteins against surface-induced damaged by competing with the proteins for adsorption sites on surfaces, of e.g., containers and delivery devices. See also e.g., Wang et al., Int J Pharm, 289 (1-2) (2005), pp. 1-30, Rodrigues et al., Pharm Res, 36 (2) (2019), pp. 1-20, Wright, J. F. Mol Ther, 12 (1)(2005), pp. 171-178, and Jones et al., ACS Symp Ser, 675 (1997), pp. 206-222, the teachings of which can be adapted for use with the present invention.

Salt can also affect the protein compositions, viral particles, viral vectors, viral capsids, and/or viral capsid proteins in a formulation. At low concentrations, salts affect electrostatic interactions in proteins. Therefore, this effect could be stabilizing when there are repulsive interactions leading to protein unfolding, or destabilizing when there are stabilizing salt bridges or ion pairs in the protein. At high salt concentrations, electrostatic interactions are saturated; the dominant effect of salt is on solvent properties of the solution. The stabilizing salts increase surface tension at water-protein interface and strengthen hydrophobic interactions by keeping hydrophobic groups away from water molecules, inducing preferential hydration of proteins. The salt effect strongly depends on the salt concentration and solution pH, as pH determines the charged state of ionizable amino acids in protein groups. In some embodiments, the salt composition and amounts are optimized for delivery and efficacy of the protein compositions, viral particles, viral vectors, viral capsids, and/or viral capsid proteins of the present invention.

Buffer and pH can influence conformational and colloidal stabilities of proteins, particularly viral capsid proteins. In some embodiments, the pharmaceutical formulation contains one or more buffers so as to optimize the pH of the formulation. The pH determines the net charge on the protein molecule and the nature of electrostatic interactions. Generally, the higher the net charge of the protein, the lower will be the aggregation propensity due to electrostatic repulsions, and higher will be the colloidal stability. In some embodiments, the pharmaceutical formulation contains a buffer optimized to the protein composition, viral particle, viral capsid, or capsid protein of the present invention such that the pH of the formulation is such that it results in a greater net charge of the protein as compared to an unbuffered formulation. In some embodiments, the buffer results in a pharmaceutical formulation of a protein composition, viral particle, viral capsid, or capsid protein of the present invention that has reduced aggregation and/or increased colloidal stability as compared to the same protein composition, viral particle, viral capsid, or capsid protein of the present invention in a formulation without said buffer. See also e.g., Marshall et al., Biochemistry, 50 (12)(2011), pp. 2061-2071, Kamihira et al., J Biol Chem, 278 (5)(2003), pp. 2859-2865, yun et al., Biophys J, 92 (11) (2007), pp. 4064-4077, Raman et al., Biochemistry, 44 (4) (2005), pp. 1288-1299, Jain and Udgaonkar et al., Biochemistry, 49 (35) (2010), pp. 7615-7624, and Klement et al., J Mol Biol, 373 (5) (2007), pp. 1321-1333, the teachings of which can be adapted for use with the present invention.

Osmolytes are small organic compounds cand can be included in a pharmaceutical formulation of the preset invention to stabilize proteins (e.g., the protein composition, viral particle, viral capsid, or capsid protein of the present invention) against denaturation and aggregation. Proteins in an aqueous solution exists in equilibrium between the folded (F) and unfolded (U) states. Without being bound by theory, stabilization by osmolytes occurs by a preferential exclusion mechanism where osmolytes shift the equilibrium towards the F-state. In some embodiments, a pharmaceutical formulation of the present invention includes one or more osmolytes. In some embodiments, the osmolyte(s) are sucrose, glycine, mannitol, histidine, dextrose, arginine, trehalose, lactose, or any combination thereof. In some embodiments, the osmolyte, such as a sugar (e.g., sucrose) can be used in a culture media used to produce viral particles, such as those of the present invention. In some embodiments, inclusion of the osmolyte in culture media during viral particle production increases viral particle yield by 0.1 to 5 fold or more. In some embodiments, the osmolyte incorporated into such a culture media is sucrose and optionally the concentration of the sucrose is about 0.2M. See also e.g., Deorkar and Thiyagarajan., Bio Pharm Int, 29 (10) (2016), pp. 26-30, Wang, W., Int J Pharm, 185 (1999), pp. 129-188, Barnett et al., J Phys Chem B, 120 (13)(2016), pp. 3318-3330, Amani et al., Protein J, 36 (2) (2017), pp. 147-153, Auton et al., Biophys Chem, 159 (1) (2011), pp. 90-99, Kendrick et al., Proc Natl Acad Sci USA, 94 (22) (1997), pp. 11917-11922, Timasheff, S. N., Proc Natl Acad Sci USA, 99 (15) (2002), pp. 9721-9726, Wlodarczyk et al., Eur J Pharm Biopharm, 131 (2018), pp. 92-98, and Rego et al., bioRxiv. Published online (2018), pp. 1-21, the teachings of which can be adapted for use with the present invention.

In some embodiments, the pH of the formulation is basic pH. Without being bound by theory, a basic pH can reduce disulfide formation and/or exchange, thus improving the stability and/or efficacy of the polypeptide compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptide) of the present invention present in the formulation.

In some embodiments, as is also further described herein, the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention can be encapsulated in a liposome, exosome, or other delivery vehicle. Without being bound by theory, such an approach can mask the protein compositions, such as capsid polypeptide(s) (e.g., AAV capsid polypeptides) of the present invention from immune components such as antibodies, thus reducing the immunogenicity of the composition.

Also described herein are kits that contain one or more of the one or more of the polypeptides, polynucleotides, vectors, cells, or other components described herein and combinations thereof and pharmaceutical formulations described herein. In embodiments, one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof described herein can be presented as a combination kit. As used herein, the terms “combination kit” or “kit of parts” refers to the compounds, or formulations and additional components that are used to package, screen, test, sell, market, deliver, and/or administer the combination of elements or a single element, such as the active ingredient, contained therein. Such additional components include but are not limited to, packaging, syringes, blister packages, bottles, and the like. The combination kit can contain one or more of the components (e.g., one or more of the one or more of the polypeptides, polynucleotides, vectors, cells, and combinations thereof) or formulation thereof can be provided in a single formulation (e.g., a liquid, lyophilized powder, etc.), or in separate formulations. The separate components or formulations can be contained in a single package or in separate packages within the kit. The kit can also include instructions in a tangible medium of expression that can contain information and/or directions regarding the content of the components and/or formulations contained therein, safety information regarding the content of the components(s) and/or formulation(s) contained therein, information regarding the amounts, dosages, indications for use, screening methods, component design recommendations and/or information, recommended treatment regimen(s) for the components(s) and/or formulations contained therein. As used herein, “tangible medium of expression” refers to a medium that is physically tangible or accessible and is not a mere abstract thought or an unrecorded spoken word. “Tangible medium of expression” includes, but is not limited to, words on a cellulosic or plastic material, or data stored in a suitable computer readable memory form. The data can be stored on a unit device, such as a flash memory drive or CD-ROM or on a server that can be accessed by a user via, e.g., a web interface.

In one embodiment, the invention provides a kit comprising one or more of the components described herein. In some embodiments, the kit comprises a vector system and instructions for using the kit. In some embodiments, the vector system includes a regulatory element operably linked to one or more engineered targeting moiety, polypeptide, viral (e.g., AAV) delivery system polynucleotides, as described elsewhere herein and, optionally, a cargo molecule, which can optionally be operably linked to a regulatory element. The one or more engineered targeting moiety, polypeptide, viral (e.g., AAV) delivery system polynucleotides, can be included on the same or different vectors as a cargo molecule capable of being delivered by the engineered targeting moiety, polypeptide, viral (e.g., AAV) delivery system described herein in embodiments containing a cargo molecule within the kit.

In some embodiments, the kit comprises a vector system and instructions for using the kit. In some embodiments, the vector system comprises (a) a first regulatory element operably linked to a direct repeat sequence and one or more insertion sites for inserting one or more guide sequences up- or downstream (whichever applicable) of the direct repeat sequence, wherein when expressed, the guide sequence directs sequence-specific binding of a Cas9 CRISPR complex to a target sequence in a eukaryotic cell, wherein the Cas9 CRISPR complex comprises a Cas9 enzyme complexed with the guide sequence that is hybridized to the target sequence; and/or (b) a second regulatory element operably linked to an enzyme-coding sequence encoding said Cas9 enzyme comprising a nuclear localization sequence. Where applicable, a tracr sequence may also be provided. In some embodiments, the kit comprises components (a) and (b) located on the same or different vectors of the system. In some embodiments, component (a) further comprises two or more guide sequences operably linked to the first regulatory element, wherein when expressed, each of the two or more guide sequences direct sequence specific binding of a CRISPR complex to a different target sequence in a eukaryotic cell. In some embodiments, the Cas9 enzyme comprises one or more nuclear localization sequences of sufficient strength to drive accumulation of said CRISPR enzyme in a detectable amount in the nucleus of a eukaryotic cell. In some embodiments, the CRISPR enzyme is a type V or VI CRISPR system enzyme. In some embodiments, the CRISPR enzyme is a Cas9 enzyme. In some embodiments, the Cas9 enzyme is derived from Francisella tularensis 1, Francisella tularensis subsp. novicida, Prevotella albensis, Lachnospiraceae bacterium MC2017 1, Butyrivibrio proteoclasticus, Peregrinibacteria bacterium GW2011_GWA2_33_10, Parcubacteria bacterium GW2011_GWC2_44_17, Smithella sp. SCADC, Acidaminococcus sp. BV3L6, Lachnospiraceae bacterium MA2020, Candidatus Methanoplasma termitum, Eubacterium eligens, Moraxella bovoculi 237, Leptospira inadai, Lachnospiraceae bacterium ND2006, Porphyromonas crevioricanis 3, Prevotella disiens, or Porphyromonas macacae Cas9 (e.g., modified to have or be associated with at least one DD), and may include further alteration or mutation of the Cas9, and can be a chimeric Cas9. In some embodiments, the DD-CRISPR enzyme is codon-optimized for expression in a eukaryotic cell. In some embodiments, the DD-CRISPR enzyme directs cleavage of one or two strands at the location of the target sequence. In some embodiments, the DD-CRISPR enzyme lacks or substantially DNA strand cleavage activity (e.g., no more than 5% nuclease activity as compared with a wild-type enzyme or enzyme not having the mutation or alteration that decreases nuclease activity). In some embodiments, the first regulatory element is a polymerase III promoter. In some embodiments, the second regulatory element is a polymerase II promoter. In some embodiments, the guide sequence is at least 16, 17, 18, 19, 20, 25 nucleotides, or between 16-30, or between 16-25, or between 16-20 nucleotides in length.

The compositions containing the CNS-specific targeting moieties described herein (e.g., the engineered targeting moiety system polynucleotides, polypeptides, vector(s), engineered cells, engineered viral (e.g., AAV) capsids, and viral and other particles) can be used generally to package and/or deliver one or more cargo polynucleotides or other cargo types to a recipient cell or cell population (including tissues, organs, and organsims). In some embodiments, delivery, is done in a cell-specific manner based upon the specificity of the targeting moiety(ies). In some embodiments, the cell-specificity is conferred via the n-mer insert(s) included in the targeting moiety as previously discussed. In some embodiments, delivery is done in cell-specific manner based upon the tropism of the engineered viral (e.g., AAV) capsid. In some embodiments, engineered targeting moiety(ies), polypeptides, viral (e.g., AAV) capsids, particles, viral (e.g., AAV) particles, compositions thereof, and/or cells discussed herein can be administered to a subject or a cell, tissue, and/or organ and facilitate the transfer and/or integration of the cargo polynucleotide to the recipient cell. In other embodiments, engineered cells capable of producing engineered targeting moiety(ies), polypeptides, viral (e.g., AAV) capsids, particles, viral (e.g., AAV) particles and/or compositions thereof can be generated from engineered targeting moiety system molecules (e.g., polynucleotides, vectors, and vector systems, etc.). In some embodiments, the engineered targeting moiety(ies), polypeptides, viral (e.g., AAV) capsids, particles, viral (e.g., AAV) particles and/or compositions thereof can be delivered to a subject or a cell, tissue, and/or organ. When delivered to a subject, they engineered delivery system molecule(s) can transform a subject's cell in vivo or ex vivo to produce an engineered cell that can be capable of making an engineered targeting moiety(ies), polypeptides, viral (e.g., AAV) capsids, particles, viral (e.g., AAV) particles and/or compositions thereof, which can be released from the engineered cell and deliver cargo molecule(s) to a recipient cell in vivo or produce personalized engineered polypeptides, viral (e.g., AAV) particles, and/or other particles for reintroduction into the subject from which the recipient cell was obtained. In some embodiments, an engineered cell can be delivered to a subject, where it can release produced engineered targeting moieties, polypeptides, viral (e.g., AAV) particles, and/or other particles such that they can then deliver a cargo (e.g., cargo polynucleotide(s)) to a recipient cell. These general processes can be used in a variety of ways to treat and/or prevent disease or a symptom thereof in a subject, generate model cells, generate modified organisms, provide cell selection and screening assays, in bioproduction, and in other various applications.

In some embodiments, the engineered targeting moieties, polypeptides, viral (e.g., AAV) particles, and/or other particles, polynucleotides, vectors, and systems thereof can be used to generate engineered AAV capsid variant libraries that can be mined for variants with a desired cell-specificity, such as CNS specificity. The description provided herein as supported by the various Examples can demonstrate that one having a desired cell-specificity in mind could utilize the present invention as described herein to obtain a capsid with the desired cell-specificity, such as CNS specificity.

In some embodiments, one or more molecules of the engineered delivery system, engineered targeting moieties, polypeptides, viral (e.g., AAV) particles, and/or other particles, polynucleotides, vectors, systems thereof, engineered cells, and/or formulations thereof described herein can be delivered to a subject in need thereof as a therapy for one or more diseases. In some embodiments, the disease to be treated is a genetic or epigenetic based disease. In some embodiments, the disease to be treated is not a genetic or epigenetic based disease. In some embodiments, one or more molecules of the engineered delivery system, engineered targeting moieties, polypeptides, viral (e.g., AAV) particles, and/or other particles, polynucleotides, vectors, and systems thereof, engineered cells, and/or formulations thereof described herein can be delivered to a subject in need thereof as a treatment or prevention (or as a part of a treatment or prevention) of a disease. It will be appreciated that the specific disease to be treated and/or prevented by delivery of an engineered cell and/or engineered can be dependent on the cargo molecule packaged into an engineered AAV capsid particle.

Generally, the compositions described herein can be used in a therapy for treating or preventing a CNS disease, disorder, or a symptom thereof. It will be appreciated that a CNS disease or disorder refers to any disease or disorder whose pathology involves or affects one or more cell types of the central nervous system. In some embodiments, the CNS disease or disorder is one whose primary pathology involves one or more cell types of the CNS. In some embodiments, one or more other cell types outside of the CNS are involved in the pathology of the CNS disease, such as a muscle cell or a peripheral nervous system cell. In some embodiments, the CNS disease or disorder can be caused by one or more genetic abnormalities. In some embodiments, the CNS disease or disorder is not caused by a genetic abnormality. Non-genetic causes of diseases include infection, cancer, physical trauma and others that will be appreciated by those of skill in the art. It also will be appreciated that gene modification approaches to treating disease can be applied to treat and/or prevent both genetic diseases and non-genetic diseases. For example, in the case of non-genetic diseases, a gene therapy approach can be used to modify the cause of the non-genetic disease (e.g., a cancer or infectious organism) such that the cause is no longer disease causing (e.g., by eliminating or rendering non-functional the cancer cells or infectious organism).

Exemplary CNS diseases and disorders include, without limitation, Friedreich's Ataxia, Dravet Syndrome, Spinocerebellar Ataxia Type 3, Niemann Pick Type C, Huntington's Disease, Pompe Disease, Myotonic Dystrophy Type 1, Gluta Deficiency Syndrome (De Vivo Syndrome), Tay-Sachs, Spinal Muscular Atrophy, Alzheimer's disease, Amyotrophic lateral sclerosis (ALS), Danon disease, Rett Syndrome, Angleman Syndrome, infantile neuronal dystorpy, Gaucher's disease, Krabbe disease, metachromatic leukodystrophy, Salla disease, Farber disease or Spinal Musular Atrophy with progressive myoclonic Epilepsy (also reffered to as Jankovic-Rivera syndrome, Unverricht-Lundborg disease, AADC deficiency, Parkinson's disease, Batten disease, a neuronal ceroid lipofuscinosis disease, giant axonal neuropathy, a mucopolysaccharidosis disease (e.g., Hurler syndrome, MPS III A-D), neurofibromatosis, a spinocerebellar ataxia disease, Sandoff disease, GM2 gangliosidosis, Canavan disease, Cockayne syndrome, a pain disease or disorder, a neuropathy or nerve damage, or any combination thereof. Others are described elsewhere herein and/or will be appreciated by those of ordinary skill in the art in view of the description provided herein.

In some embodiments, the compositions described herein can be used for treating or preventing an eye disease or disorder. It will be appreciated that an eye disease or disorder is a disease or disorder that has a pathology or clinical symptom that involves one or more cells or cell types of the eye, including but not limited to, the optic nerve, rods, cones, retinal cells (e.g., photoreceptors, bipolar cells, ganglion cells, horizontal cells, and amacrine cells), and/or the like. The eye disease or disorder can be of genetic or non-genetic origin. Exemplary eye diseases and disorders include, without limitation, Stargardt disease, a Leber's congenital amaurosis (LCA) (e.g., Leber's congenital amaurosis type 2, LEBER CONGENITALAMAUROSIS (LCA) ANDEARLY-ONSET SEVERE RETINALDYSTROPHY (EOSRD)), Choroideremia, a macular degeneration, diabetic retinopathy, a retinopathy, vitelliform macular dystrophy, a macular dystrophy, Sorsby's fundus dystrophy, cataracts, glaucoma, optic neuropathies, Marfan syndrome, myopia, polypoidal choroidal vasculopathies, retinitis pigmentosa, uveal melanoma, X-linked retinoschisis, pattern dystrophy, achromatopsia, Blue cone monochromatism, Bornholm eye disease, ADGUCA1A-associated COD/CORD, autosomal dominant PRPH2 associated CORD, X-linkedRPGR-associatedCOD/CORD, fundus albipunctatus, Enhanced S-conesyndrome, Bietti crystalline comeoretinaldystorphy, or any combination thereof.

In some embodiments, the compositions described herein can be used for treating or preventing an inner ear disease or disorder. It will be appreciated that an eye disease or disorder is a disease or disorder that has a pathology or clinical symptom that involves one or more cells or cell types of the ear, and more particularly the inner ear, including but not limited to, hair cells, pillar cells, Boettcher's cells, Claudius' cells, spiral ganglion neurons, and Deiters' cells (phalangeal cells). The inner ear disease or disorder can be of genetic or non-genetic origin. Exemplary inner ear disease and disorders include, without limitation, GJB-2 deafness, Jeryell and Lange-Nielsen syndrome, Usher syndrome, Alport syndrome, Branchio-oto-renal syndrome, Waardenburg syndrome, Pendred syndrome, Stickler syndrome, Treacher Collins syndrome, CHARGE syndrome, Norrie disease, Perrault syndrome, Autosomal dominant Nonsyndromic hearing loss, utosomal Recessive Nonsyndromic Hearing Loss, X-linked nonsyndromic hearing loss, an auditory neuropathy, a congenital hearing loss, or any combination thereof.

In some embodiments, the compositions comprising a CNS specific targeting moiety of the present invention and/or cargos that can be delivered by such compositions can be used to treat or prevent pain or a pain disease or disorder in a subject. In some embodiments, a cargo is capable of modulating sensitivity to or pain sensation/perception in a subject. It will be appreciated that depending on the disease or condition, it can be desirable to increase pain sensitivity or perception (e.g., in the case of disease where there is no pain sensitivity) or decrease pain sensitivity, sensation, and/or perception (e.g., neuropathies and others).

In some embodiments, the cargo molecule can treat or prevent a Pain disease or disorder or pain resulting from a disease or disorder. In some embodiments, the pain disease or disorder causes a deleterious insensitivity or lack of sensitivity to pain. In some embodiments, the pain is due to trauma or damage to a tissue and/or nerve(s)/neurons that can be the result of disease (e.g., ischemia, virus, etc.) or external trauma or mechanical pain (e.g., acute injury, surgical wounds and/or amputation, thermal exposure, etc. In some embodiments, the pain disease or disorder involves dysfunction of one or more neurons, ganglions, or other cells of the CNS and/or peripheral nervous system. In some embodiments, the disease or disorder generates inappropriate, hyper-, or other wise deleterious pain negatively impacting quality of life. Exemplary pain diseases or disorders include, without limitation, HSAN-1, HSAN-2, HSAN-3 (familial dysautonomia—pain free phenotype), HSAN-4 (CIPA), mutilated foot, erythermalagia, paroxysmal extreme pain, and other insensitivities to pain, neuropathic pain, other chronic pain, and/or the like. Exemplary targets for genetic modifications for pain modulation include those involved in signal transduction and/or conduction and/or synaptic transmission (TRPV1/2/3/4, P2XR3, TRPM8, TRPA1, P2RX3, P2RY, BDKRB1/2, Htr3A, ACCNs, TRPV4, TRPC/P, ACCN1/2, SCN10A, SCN11A, SCN1,3, 4A, SCN9A, KCNQ, (other K+ channel genes), NR1, 2, GRIA1-4, GRIC1-5, NK1R, CACNA1A-S, CACNA2D1; genes of the microglia (e.g., TLR2/4. P2RX4/7, CCL2, CX3CRN1), genes of the CNS (e.g., BDNF, OPRD1/K1/M1, CNR1, GABRs, TNF, PLA2), genes of the PNS (e.g., IL1/6/12/18, COX-2, NTRK1, NGF, GDNF, TNF, LIF, CCL2, CNR2), genes and/or any one or more of the SNPs set forth in Table 1 of Foulkes and Wood. PLOS Genetics. 2008. https://doi.org/10.1371/journal.pgen.1000086; any one or more genes associated with a heritable pain condition (e.g., SPTLC1, IkbKAP protein gene, CCT4, Nav1.7 gene); ion channel related genes (e.g., (SCN9A, CACNG2, ZSCAN20, SCN11A), Neurotransmission (OPRM1, COMT, PRKCA, SLCA4, MPZ, GCH1), Metabolism (GCH1, TF, CP, TFRC, ACO1, FXN, SLC11A2, B2M, BMP6), Immune Response (HLA-A, HLA-B, HLA-DQB1, HLA-DRB1, IL6, IL1R2, IL10, TNF-α, GFRA2, HMGB1P46), SCN9A (NaV1.7), SCN10A (NaV1.8) and SCN11A (NaV1.9), GAD, or any combination thereof. In some embodiments, the cargo is a glutamic acid decarboxylase (GAD) which can provide GABA to recue pain, such as neuropathic pain. In some embodiments, the pain-associated genes are modified using a CRISPRi approach (e.g., a cargo molecule can contain CRISPRi molecule(s). In some embodiments, the pain-associated genes are modified using a CRISPRi-KRAB approach. See also e.g., Wolfe et al., Pain Medicine, Volume 10, Issue 7, October 2009, Pages 1325-1330, Moreno A M, Glaucilene F C, Alemán F et al. Long-lasting analgesia via targeted in vivoepigenetic repression of Nav1.7. bioRxiv711812 (2019). https://www.biorxiv.org/content/10.1101/71, Foulkes and Wood. PLOS Genetics. 2008. https://doi.org/10.1371/journal.pgen.1000086, the teachings of which can be adapted for use with the present invention.

Genetic diseases that can be treated are discussed in greater detail elsewhere herein (see e.g., discussion on Gene-modification based-therapies below). Other diseases can include, but are not limited to, any of the following: cancer (such as glioblastoma or other brain or CNS cancers), Acubetivacter infections, actinomycosis, African sleeping sickness, AIDS/HIV, ameobiasis, Anaplasmosis, Angiostrongyliasis, Anisakiasis, Anthrax, Acranobacterium haemolyticum infection, Argentine hemorrhagic fever, Ascariasis, Aspergillosis, Astrovirus infection, Babesiosis, Bacterial meningitis, Bacterial pneumonia, Bacterial vaginosis, Bacteroides infection, balantidiasis, Bartonellosis, Baylisascaris infection, BK virus infection, Black Piedra, Blastocytosis, Blastomycosis, Bolivian hemorrhagic fever, Botulism, Brazillian hemmorhagic fever, brucellosis, Bubonic plague, Burkholderia infection, buruli ulcer, calicivirus invention, campylobacteriosis, Candidasis, Capillariasis, Carrion's disease, Cat-scratch disease, cellulitis, Chagas Disease, Chancroid, Chickenpox, Chikungunya, Chlamydia, Chlamydia pneumoniae, Cholera, Chromoblastomycosis, Chytridiomycosis, Clonochiasis, Clostridium difficile colitis, Coccidioidomycosis, Colorado tick fever, rhinovirus/coronavirus invection (common cold), Cretzfeldt-Jakob disease, Crimean-congo hemorrhagic fever, Cryptococcosis, Cryptosporidosis, Cutaneous larva migrans (CLM), cyclosporiasis, cysticercosis, cytomegalovirus infection, Dengue fever, Desmodesmus infection, Dientamoebiasis, Diptheria, Diphylobothriasis, Dracunculiasis, Ebola, Echinococcosis, Ehrlichiosis, Enterobiasis, Enterococcus infection, Enterovirus infection, Epidemic typhus, Erthemia Infectisoum, Exanthem subitum, Fasciolasis, Fasciolopsiasis, fatal familial insomnia, filarisis, Clostridum perfingens infection, Fusobacterium infection, Gas gangrene (clostridial myonecrosis), Geotrichosis, Gerstmann-Straussler-Scheinker syndrome, Giardasis, Glanders, Gnathostomiasis, Gonorrhea, Granuloma inguinales, Group A streptococcal infection, Group B streptococcal infection, Haemophilus influenzae infection, Hand, foot, and mouth disease, hanta virus pulmonary syndrome, heartland virus disease, Helicobacter pylori infection, hemorrhagi fever with renal syndrome, Hendra virus infection, Hepatitis (all groups A, B, C, D, E), hepes simplex, histoplasmosis, hookworm infection, human bocavirus infection, human ewingii erlichosis, Human granulocytic anaplasmosis, human metapneymovirus infection, human monocytic ehrlichosis, human papaloma virus, Hymenolepiasis, Epstein-Barr infection, mononucleosis, influenza, isoporisis, Kawasaki disease, Kingell kingae infection, Kuru, Lasas fever, Leginollosis (Legionnaires's disease and Potomac Fever), Leishmaniasis, Leprosy, Leptospirosis, Listeriosis, Lyme disease, lymphatic filariasis, lymphocytic choriomeningitis, Malaria, Marburg hemorrhagic feaver, measals, Middle East respiratory syndrome, Meliodosis, menigitis, Menigococcal disease, Metagonimiasis, Microsporidosis, Molluscum contagiosum, Monkeypox, Mumps, Murine typhus, Mycoplasma pneumonia, Mycoplasma genitalium infection, Mycetoma, Myiasis, Conjunctivitis, Nipah virus infection, Norovirus, Variant Creutzfeldt-Jakob disease, Nocardosis, Onchocerciasis, Opisthorchiasis, Paracoccidioidomycosis, Paragonimiasis, Pasteurellosis, Pdiculosisi capitis, Pediculosis corpis, Pediculosis pubis, pelvic inflammatory disease, pertussis, plague, pneumococcal infection, pneumocystis pneumonia, pneumonia, poliomyelitis, prevotella infection, primary amoebic menigoencephalitis, progressive multifocal leukoencephalopathy, Psittacosis, Qfever, rabies, relapsing fever, respiratory syncytial virus infection, rhinovirus infection, rickettsial infection, Rickettsialpox, Rift Valley Fever, Rocky Mountain Spotted Fever, Rotavirus infection, Rubella, Salmonellosis, SARS, Scabies, Scarlet fever, Schistosomiais, Sepsis, Shigellosis, Shingles, Smallpox, Sporotrichosisi, Staphlococcol infection (including MRSA), strongyloidiasis, subacute sclerosing panecephalitis, Syphillis, Taeniasis, tetanus, Trichophyton species infection, Tocariasis, Toxoplasmosis, Trachoma, Trichinosis, Trichuriasis, Tuberculosis, Tularemia, Typhoid Fever, Typhus Fever, Ureaplasma urealyticum infection, Valley fever, Venezuelan equine encephalitis, Venezuelan hemorrhagic fever, Vibrio species infection, Viral pneumonia, West Nile Fever, White Piedra, Yersinia pseudotuberculosis, Yersiniosis, Yellow fever, Zeaspora, Zika fever, Zygomycosis and combinationsthereof.

Other diseases and disorders or symptoms thereof that can be treated using embodiments of the present invention include, but are not limited to, endocrine diseases (e.g., Type I and Type II diabetes, gestational diabetes, hypoglycemia. Glucagonoma, Goitre, Hyperthyroidism, hypothyroidism, thyroiditis, thyroid cancer, thyroid hormone resistance, parathyroid gland disorders, Osteoporosis, osteitis deformans, rickets, ostomalacia, hypopituitarism, pituitary tumors, etc.), skin conditions of infections and non-infection origin, eye diseases of infectious or non-infectious origin, gastrointestinal disorders of infectious or non-infectious origin, cardiovascular diseases of infectious or non-infectious origin, brain and neuron diseases of infectious or non-infectious origin, nervous system diseases of infectious or non-infectious origin, muscle diseases of infectious or non-infectious origin, bone diseases of infectious or non-infectious origin, reproductive system diseases of infectious or non-infectious origin, renal system diseases of infectious or non-infectious origin, blood diseases of infectious or non-infectious origin, lymphatic system diseases of infectious or non-infectious origin, immune system diseases of infectious or non-infectious origin, mental-illness of infectious or non-infectious origin and the like.

In some embodiments, the disease to be treated is a CNS or CNS related disease or disorder, such as a genetic CNS disease or disorder. Such CNS or CNS related disease (including genetic CNS disease or disorders) are described in greater detail elsewhere herein.

Other diseases and disorders will be appreciated by those of skill in the art.

Generally speaking, adoptive cell transfer involves the transfer of cells (autologous, allogeneic, and/or xenogeneic) to a subject. The cells may or may not be modified and/or otherwise manipulated prior to delivery to the subject. Manipulation can include genetic modification by one or more gene modifying agents. Exemplary gene modifying agents and systems are described in greater detail elsewhere herein and will be appreciated by those of ordinary skill in the art. Such gene or other modification compositions or systems can be delivered to a cell to be modified for adoptive therapy by one or more of the compositions described herein containing a CNS specific targeting moiety.

In some embodiments, an engineered cell as described herein can be included in an adoptive cell transfer therapy. In some embodiments, an engineered cell as described herein can be delivered to a subject in need thereof. In some embodiments, the cell can be isolated from a subject, manipulated in vitro such that it is capable of generating an engineered AAV capsid particle described herein to produce an engineered cell and delivered back to the subject in an autologous manner or to a different subject in an allogeneic or xenogeneic manner. The cell isolated, manipulated, and/or delivered can be a eukaryotic cell. The cell isolated, manipulated, and/or delivered can be a stem cell. The cell isolated, manipulated, and/or delivered can be a differentiated cell. The cell isolated, manipulated, and/or delivered can be a nervous system cell, such as a central nervous system cell, including but not limited to a neuron, a glial cell, an astrocyte, a Schwann cell, a microglial cell, or other neuron support cell, and/or other brain or CNS cell, or any combination thereof. Other specific cell types will instantly be appreciated by one of ordinary skill in the art.

In some embodiments, the isolated cell can be manipulated such that it becomes an engineered cell as described elsewhere herein (e.g., contain and/or express one or more engineered delivery system molecules or vectors described elsewhere herein). Methods of making such engineered cells are described in greater detail elsewhere herein.

The present invention also contemplates use of the engineered delivery system molecules, vectors, engineered cells, and/or engineered AAV capsid particles described herein to generate a gene drive via delivery of one or more cargo polynucleotides or production of engineered AAV capsid particles with one or more cargo polynucleotides capable of producing a gene drive. In some embodiments, the gene drive can be a Cas-mediated RNA-guided gene drive e.g., Cas- to provide RNA-guided gene drives, for example in systems analogous to gene drives described in PCT Patent Publication WO 2015/105928. Systems of this kind may for example provide methods for altering eukaryotic germline cells, by introducing into the germline cell a nucleic acid sequence encoding an RNA-guided DNA nuclease and one or more guide RNAs. The guide RNAs may be designed to be complementary to one or more target locations on genomic DNA of the germline cell. The nucleic acid sequence encoding the RNA guided DNA nuclease and the nucleic acid sequence encoding the guide RNAs may be provided on constructs between flanking sequences, with promoters arranged such that the germline cell may express the RNA guided DNA nuclease and the guide RNAs, together with any desired cargo-encoding sequences that are also situated between the flanking sequences. The flanking sequences will typically include a sequence which is identical to a corresponding sequence on a selected target chromosome, so that the flanking sequences work with the components encoded by the construct to facilitate insertion of the foreign nucleic acid construct sequences into genomic DNA at a target cut site by mechanisms such as homologous recombination, to render the germline cell homozygous for the foreign nucleic acid sequence. In this way, gene-drive systems are capable of introgressing desired cargo genes throughout a breeding population (Gantz et al., 2015, Highly efficient Cas9-mediated gene drive for population modification of the malaria vector mosquito Anopheles stephensi, PNAS 2015, published ahead of print Nov. 23, 2015, doi:10.1073/pnas.1521077112; Esvelt et al., 2014, Concerning RNA-guided gene drives for the alteration of wild populations eLife 2014; 3:e03401). In select embodiments, target sequences may be selected which have few potential off-target sites in a genome. Targeting multiple sites within a target locus, using multiple guide RNAs, may increase the cutting frequency and hinder the evolution of drive resistant alleles. Truncated guide RNAs may reduce off-target cutting. Paired nickases may be used instead of a single nuclease, to further increase specificity. Gene drive constructs (such as gene drive engineered delivery system constructs) may include cargo sequences encoding transcriptional regulators, for example to activate homologous recombination genes and/or repress non-homologous end-joining. Target sites may be chosen within an essential gene, so that non-homologous end-joining events may cause lethality rather than creating a drive-resistant allele. The gene drive constructs can be engineered to function in a range of hosts at a range of temperatures (Cho et al. 2013, Rapid and Tunable Control of Protein Stability in Caenorhabditis elegans Using a Small Molecule, PLoS ONE 8(8): e72393. doi:10.1371/journal.pone.0072393).

The engineered AAV capsid system molecules, vectors, engineered cells, and/or engineered delivery particles described herein, can be used to deliver cargo polynucleotides and/or otherwise be involved in modifying tissues for transplantation between two different persons (transplantation) or between species (xenotransplantation). Such techniques for generation of transgenic animals are described elsewhere herein. Interspecies transplantation techniques are generally known in the art. For example, RNA-guided DNA nucleases can be delivered using via engineered AAV capsid polynucleotides, vectors, engineered cells, and/or engineered AAV capsid particles described herein and can be used to knockout, knockdown or disrupt selected genes in an organ for transplant (e.g., ex vivo (e.g., after harvest but before transplantation) or in vivo (in donor or recipient)), animal, such as a transgenic pig (such as the human heme oxygenase-1 transgenic pig line), for example by disrupting expression of genes that encode epitopes recognized by the human immune system, i.e., xenoantigen genes. Candidate porcine genes for disruption may for example include α(1,3)-galactosyltransferase and cytidine monophosphate-N-acetylneuraminic acid hydroxylase genes (see PCT Patent Publication WO 2014/066505). In addition, genes encoding endogenous retroviruses may be disrupted, for example the genes encoding all porcine endogenous retroviruses (see Yang et al., 2015, Genome-wide inactivation of porcine endogenous retroviruses (PERVs), Science 27 Nov. 2015: Vol. 350 no. 6264 pp. 1101-1104). In addition, RNA-guided DNA nucleases may be used to target a site for integration of additional genes in xenotransplant donor animals, such as a human CD55 gene to improve protection against hyperacute rejection.

Where it is interspecies transplantation (such as human to human), the engineered AAV capsid system molecules, vectors, engineered cells, and/or engineered delivery particles described herein, can be used to deliver cargo polynucleotides and/or otherwise be involved to modify the tissue to be transplanted. In some embodiments, the modification can include modifying one or more HLA antigens or other tissue type determinants, such that the immunogenic profile is more similar or identical to the recipient's immunogenic profile than to the donor's so as to reduce the occurrence of rejection by the recipient. Relevant tissue type determinants are known in the art (such as those used to determine organ matching) and techniques to determine the immunogenic profile (which is made up of the expression signature of the tissue type determinants) are generally known in the art.

In some embodiments, the donor (such as before harvest) or recipient (after transplantation) can receive one or more of the engineered AAV capsid system molecules, vectors, engineered cells, and/or engineered delivery particles described herein that are capable of modifying the immunogenic profile of the transplanted cells, tissue, and/or organ. In some embodiments, the transplanted cells, tissue, and/or organ can be harvested from the donor and the engineered AAV capsid system molecules, vectors, engineered cells, and/or engineered delivery particles described herein capable of modifying the harvested cells, tissue, and/or organ to be, for example, less immunogenic or be modified to have some specific characteristic when transplanted in the recipient can be delivered to the harvested cells, tissue, and/or organ ex vivo. After delivery the cells, tissue, and/or organs can be transplanted into the donor.

Gene Modification and Treatment of Diseases with Genetic or Epigenetic Embodiments that Affect the CNS, Brain, and/or Neurons, the Eye and/or Inner Ear

The engineered delivery system molecules, vectors, engineered cells, and/or engineered delivery particles described herein (e.g., those with one or more targeting moieties, such as a CNS-specific targeting moiety described herein) can be used to modify genes or other polynucleotides and/or treat diseases of the CNS, brain, and/or neurons, the eye, and/or the inner ear with genetic and/or epigenetic embodiments. As described elsewhere herein the cargo molecule can be a polynucleotide that can be delivered to a cell and, in some embodiments, be integrated into the genome of the cell. In some embodiments, the cargo molecule(s) can be one or more CRISPR-Cas system components. In some embodiments, the CRISPR-Cas components, when delivered by an engineered AAV capsid particles described herein can be optionally expressed in the recipient cell and act to modify the genome of the recipient cell in a sequence specific manner. In some embodiments, the cargo molecules that can be packaged and delivered by the engineered AAV capsid particles described herein can facilitate/mediate genome modification via a method that is not dependent on CRISPR-Cas. Such non-CRISPR-Cas genome modification systems will instantly be appreciated by those of ordinary skill in the art and are also, at least in part, described elsewhere herein. In some embodiments, modification is at a specific target sequence. In other embodiments, modification is at locations that appear to be random throughout the genome.

Exemplary CNS, Brain, and/or Neuronal Disease-Associated Genes

Examples of CNS, brain, and/or neuronal disease-associated genes and polynucleotides that can be modified using the engineered delivery AAV delivery system molecules, vectors, capsids, engineered cells, and/or engineered delivery particles described herein are described below.

In some embodiments, a therapeutic or preventive, such as the engineered AAV capsids and systems thereof as described elsewhere herein, can be delivered to a subject in need thereof or a cell thereof to treat a brain, neuron, neurological, and/or central nervous system disease or disorder (CNS). In some embodiments the brain, neuron, neurological, and/or CNS disease or disorder can be caused, directly or indirectly, by one or mutations in one or more of the following genes as compared to normal or non-pathological variant of the same: in the case of Amyotrophic lateral sclerosis (ALS): SOD1, ALS2, STEX, FUS, TARDBP, VEGF (VEGF-a, VEGF-b, VEGF-c); in the case of Alzheimer's disease: E1, CHIP, UCH, UBB, Tau, LRP, PICALM, Clusterin, PS1, SORL1, CR1, Vldlr, Uba1, Uba3, CHIP28, Aqp1, Uchl1, Uchl3, APP, AAA, CVAP, AD1, APOE, AD2, PSEN2, AD4, STM2, APBB2, FE65L1, NOS3, PLAU, URK, ACE, DCP1, ACE1, MPO, PACIP1, PAXIP1L, PTIP, A2M, BLMH, BMH, PSEN1, AD3); in the case of Autism: Mecp2, BZRAP1, MDGA2, Sema5A, Neurexin 1, GLO1, MECP2, RTT, PPMX, MRX16, MRX79, NLGN3, NLGN4, KIAA1260, AUTSX2; in the case of Fragile X Syndrome: FMR2, FXR1, FXR2, mGLUR5; in the case of Huntington's disease and disease like disorders: HD, IT15, PRNP, PRIP, JPH3, JP3, HDL2, TBP, SCA17); in the case of Parkinson's disease: NR4A2, NURR1, NOT, TINUR, SNCAIP, TBP, SCA17, SNCA, NACP, PARK1, PARK4, DJ1, PARK7, LRRK2, PARK8, PINK1, PARK6, UCHL1, PARK5, SNCA, NACP, PARK1, PARK4, PRKN, PARK2, PDJ, DBH, NDUFV2, PINK1, x-synuclein); in the case of Rett syndrome: MECP2, RTT, PPMX, MRX16, MRX79, CDKL5, STK9, MECP2, RTT, PPMX, MRX16, MRX79, x-Synuclein, DJ-1; in the case of Schizophrenia: Neuregulin1 (Nrg1), Erb4 (receptor for Neuregulin), Complexin1 (Cplx1), Tph1 Tryptophan hydroxylase, Tph2, Tryptophan hydroxylase 2, Neurexin 1, GSK3, GSK3a, GSK3b, 5-HTT (Slc6a4), COMT, DRD (Drd1a), SLC6A3, DAOA, DTNBP1, Dao (Dao1)); in the case of Secretase Related Disorders (APH-1 (alpha and beta), Presenilin (Psen1), nicastrin, (Ncstn), PEN-2, Nos1, Parp1, Nat1, Nat2); in the case of Trinucleotide Repeat Disorders (HTT (Huntington's Dx), SBMA/SMAX1/AR (Kennedy's Dx), FXN/X25 (Friedrich's Ataxia), ATX3 (Machado-Joseph's Dx), ATXN1 and ATXN2 (spinocerebellar ataxias), DMPK (myotonic dystrophy), Atrophin-1 and Atn1 (DRPLA Dx), CBP (Creb-BP—global instability), VLDLR (Alzheimer's), Atxn7, Atxn10); in the case of diseases or disorders associated with or involving aberrant or abnormal axonal guidance signaling in the brain, neurons, and/or CNS: PRKCE; ITGAM; ROCK1; ITGA5; CXCR4; ADAM12; IGF1; RAC1; RAP1A; EIF4E; PRKCZ; NRP1; NTRK2; ARHGEF7; SMO; ROCK2; MAPK1; PGF; RAC2; PTPN11; GNAS; AKT2; PIK3CA; ERBB2; PRKCI; PTK2; CFL1; GNAQ; PIK3CB; CXCL12; PIK3C3; WNT11; PRKD1; GNB2L1; ABL1; MAPK3; ITGA1; KRAS; RHOA; PRKCD; PIK3C2A; ITGB7; GLI2; PXN; VASP; RAF1; FYN; ITGB1; MAP2K2; PAK4; ADAM17; AKT1; PIK3R1; GLI1; WNT5A; ADAM10; MAP2K1; PAK3; ITGB3; CDC42; VEGFA; ITGA2; EPHA8; CRKL; RND1; GSK3B; AKT3; PRKCA; in the case of diseases or disorders associated with or involving aberrant or abnormal actin cytoskeleton signaling in the brain, neurons, and/or CNS: ACTN4; PRKCE; ITGAM; ROCK1; ITGA5; IRAK1; PRKAA2; EIF2AK2; RAC1; INS; ARHGEF7; GRK6; ROCK2; MAPK1; RAC2; PLK1; AKT2; PIK3CA; CDK8; PTK2; CFL1; PIK3CB; MYH9; DIAPH1; PIK3C3; MAPK8; F2R; MAPK3; SLC9A1; ITGA1; KRAS; RHOA; PRKCD; PRKAA1; MAPK9; CDK2; PIM1; PIK3C2A; ITGB7; PPP1CC; PXN; VIL2; RAF1; GSN; DYRK1A; ITGB1; MAP2K2; PAK4; PIP5K1A; PIK3R1; MAP2K1; PAK3; ITGB3; CDC42; APC; ITGA2; TTK; CSNK1A1; CRKL; BRAF; VAV3; SGK; in the case of diseases or disorders associated with or involving Huntington's Disease signaling: PRKCE; IGF1; EP300; RCOR1; PRKCZ; HDAC4; TGM2; MAPK1; CAPNS1; AKT2; EGFR; NCOR2; SP1; CAPN2; PIK3CA; HDAC5; CREB1; PRKCI; HSPA5; REST; GNAQ; PIK3CB; PIK3C3; MAPK8; IGF1R; PRKD1; GNB2L1; BCL2L1; CAPN1; MAPK3; CASP8; HDAC2; HDAC7A; PRKCD; HDAC11; MAPK9; HDAC9; PIK3C2A; HDAC3; TP53; CASP9; CREBBP; AKT1; PIK3R1; PDPK1; CASP1; APAF1; FRAP1; CASP2; JUN; BAX; ATF4; AKT3; PRKCA; CLTC; SGK; HDAC6; CASP3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal apoptosis regulation and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; ROCK1; BID; IRAK1; PRKAA2; EIF2AK2; BAK1; BIRC4; GRK6; MAPK1; CAPNS1; PLK1; AKT2; IKBKB; CAPN2; CDK8; FAS; NFKB2; BCL2; MAP3K14; MAPK8; BCL2L1; CAPN1; MAPK3; CASP8; KRAS; RELA; PRKCD; PRKAA1; MAPK9; CDK2; PIM1; TP53; TNF; RAF1; IKBKG; RELB; CASP9; DYRK1A; MAP2K2; CHUK; APAF1; MAP2K1; NFKB1; PAK3; LMNA; CASP2; BIRC2; TTK; CSNK1A1; BRAF; BAX; PRKCA; SGK; CASP3; BIRC3; PARP1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal leukocyte extravasation signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ACTN4; CD44; PRKCE; ITGAM; ROCK1; CXCR4; CYBA; RAC1; RAP1A; PRKCZ; ROCK2; RAC2; PTPN11; MMP14; PIK3CA; PRKCI; PTK2; PIK3CB; CXCL12; PIK3C3; MAPK8; PRKD1; ABL1; MAPK10; CYBB; MAPK13; RHOA; PRKCD; MAPK9; SRC; PIK3C2A; BTK; MAPK14; NOX1; PXN; VIL2; VASP; ITGB1; MAP2K2; CTNND1; PIK3R1; CTNNB1; CLDN1; CDC42; F11R; ITK; CRKL; VAV3; CTTN; PRKCA; MMP1; MMP9; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal integrin signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ACTN4; ITGAM; ROCK1; ITGA5; RAC1; PTEN; RAP1A; TLN1; ARHGEF7; MAPK1; RAC2; CAPNS1; AKT2; CAPN2; PIK3CA; PTK2; PIK3CB; PIK3C3; MAPK8; CAV1; CAPN1; ABL1; MAPK3; ITGA1; KRAS; RHOA; SRC; PIK3C2A; ITGB7; PPP1CC; ILK; PXN; VASP; RAF1; FYN; ITGB1; MAP2K2; PAK4; AKT1; PIK3R1; TNK2; MAP2K1; PAK3; ITGB3; CDC42; RND3; ITGA2; CRKL; BRAF; GSK3B; AKT3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal acute phase response signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IRAK1; SOD2; MYD88; TRAF6; ELK1; MAPK1; PTPN11; AKT2; IKBKB; PIK3CA; FOS; NFKB2; MAP3K14; PIK3CB; MAPK8; RIPK1; MAPK3; IL6ST; KRAS; MAPK13; IL6R; RELA; SOCS1; MAPK9; FTL; NR3C1; TRAF2; SERPINE1; MAPK14; TNF; RAF1; PDK1; IKBKG; RELB; MAP3K7; MAP2K2; AKT1; JAK2; PIK3R1; CHUK; STAT3; MAP2K1; NFKB1; FRAP1; CEBPB; JUN; AKT3; ILIR1; IL6; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal PTEN signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ITGAM; ITGA5; RAC1; PTEN; PRKCZ; BCL2L11; MAPK1; RAC2; AKT2; EGFR; IKBKB; CBL; PIK3CA; CDKN1B; PTK2; NFKB2; BCL2; PIK3CB; BCL2L1; MAPK3; ITGA1; KRAS; ITGB7; ILK; PDGFRB; INSR; RAF1; IKBKG; CASP9; CDKN1A; ITGB1; MAP2K2; AKT1; PIK3R1; CHUK; PDGFRA; PDPK1; MAP2K1; NFKB1; ITGB3; CDC42; CCND1; GSK3A; ITGA2; GSK3B; AKT3; FOXO1; CASP3; RPS6KB1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal p53 signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PTEN; EP300; BBC3; PCAF; FASN; BRCA1; GADD45A; BIRC5; AKT2; PIK3CA; CHEK1; TP53INP1; BCL2; PIK3CB; PIK3C3; MAPK8; THBS1; ATR; BCL2L1; E2F1; PMAIP1; CHEK2; TNFRSF10B; TP73; RB1; HDAC9; CDK2; PIK3C2A; MAPK14; TP53; LRDD; CDKN1A; HIPK2; AKT1; PIK3R1; RRM2B; APAF1; CTNNB1; SIRT1; CCND1; PRKDC; ATM; SFN; CDKN2A; JUN; SNAI2; GSK3B; BAX; AKT3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal aryl hydrocarbon receptor signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HSPB1; EP300; FASN; TGM2; RXRA; MAPK1; NQO1; NCOR2; SP1; ARNT; CDKN1B; FOS; CHEK1; SMARCA4; NFKB2; MAPK8; ALDH1A1; ATR; E2F1; MAPK3; NRIP1; CHEK2; RELA; TP73; GSTP1; RB1; SRC; CDK2; AHR; NFE2L2; NCOA3; TP53; TNF; CDKN1A; NCOA2; APAF1; NFKB1; CCND1; ATM; ESR1; CDKN2A; MYC; JUN; ESR2; BAX; IL6; CYP1B1; HSP90AA1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal xenobiotic metabolism signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; EP300; PRKCZ; RXRA; MAPK1; NQO1; NCOR2; PIK3CA; ARNT; PRKCI; NFKB2; CAMK2A; PIK3CB; PPP2R1A; PIK3C3; MAPK8; PRKD1; ALDH1AI; MAPK3; NRIP1; KRAS; MAPK13; PRKCD; GSTP1; MAPK9; NOS2A; ABCB1; AHR; PPP2CA; FTL; NFE2L2; PIK3C2A; PPARGC1A; MAPK14; TNF; RAF1; CREBBP; MAP2K2; PIK3R1; PPP2R5C; MAP2K1; NFKB1; KEAP1; PRKCA; EIF2AK3; IL6; CYP1B1; HSP90AA1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal SAPK/JNK signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; IRAK1; PRKAA2; EIF2AK2; RAC1; ELK1; GRK6; MAPK1; GADD45A; RAC2; PLK1; AKT2; PIK3CA; FADD; CDK8; PIK3CB; PIK3C3; MAPK8; RIPK1; GNB2L1; IRS1; MAPK3; MAPK10; DAXX; KRAS; PRKCD; PRKAA1; MAPK9; CDK2; PIM1; PIK3C2A; TRAF2; TP53; LCK; MAP3K7; DYRK1A; MAP2K2; PIK3R1; MAP2K1; PAK3; CDC42; JUN; TTK; CSNK1A1; CRKL; BRAF; SGK; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal PPAr/RXR signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKAA2; EP300; INS; SMAD2; TRAF6; PPARA; FASN; RXRA; MAPK1; SMAD3; GNAS; IKBKB; NCOR2; ABCA1; GNAQ; NFKB2; MAP3K14; STAT5B; MAPK8; IRS1; MAPK3; KRAS; RELA; PRKAA1; PPARGC1A; NCOA3; MAPK14; INSR; RAF1; IKBKG; RELB; MAP3K7; CREBBP; MAP2K2; JAK2; CHUK; MAP2K1; NFKB1; TGFBR1; SMAD4; JUN; IL1R1; PRKCA; IL6; HSP90AA1; ADIPOQ; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal NF-kappaB signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IRAK1; EIF2AK2; EP300; INS; MYD88; PRKCZ; TRAF6; TBK1; AKT2; EGFR; IKBKB; PIK3CA; BTRC; NFKB2; MAP3K14; PIK3CB; PIK3C3; MAPK8; RIPK1; HDAC2; KRAS; RELA; PIK3C2A; TRAF2; TLR4; PDGFRB; TNF; INSR; LCK; IKBKG; RELB; MAP3K7; CREBBP; AKT1; PIK3R1; CHUK; PDGFRA; NFKB1; TLR2; BCL10; GSK3B; AKT3; TNFAIP3; IL1R1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal neuregulin signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ERBB4; PRKCE; ITGAM; ITGA5; PTEN; PRKCZ; ELK1; MAPK1; PTPN11; AKT2; EGFR; ERBB2; PRKCI; CDKN1B; STAT5B; PRKD1; MAPK3; ITGA1; KRAS; PRKCD; STAT5A; SRC; ITGB7; RAF1; ITGB1; MAP2K2; ADAM17; AKT1; PIK3R1; PDPK1; MAP2K1; ITGB3; EREG; FRAP1; PSEN1; ITGA2; MYC; NRG1; CRKL; AKT3; PRKCA; HSP90AA1; RPS6KB1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal wnt and beta catenin signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CD44; EP300; LRP6; DVL3; CSNK1E; GJA1; SMO; AKT2; PIN1; CDH1; BTRC; GNAQ; MARK2; PPP2R1A; WNT11; SRC; DKK1; PPP2CA; SOX6; SFRP2; ILK; LEF1; SOX9; TP53; MAP3K7; CREBBP; TCF7L2; AKT1; PPP2R5C; WNT5A; LRP5; CTNNB1; TGFBR1; CCND1; GSK3A; DVL1; APC; CDKN2A; MYC; CSNK1A1; GSK3B; AKT3; SOX2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal insulin receptor signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PTEN; INS; EIF4E; PTPN1; PRKCZ; MAPK1; TSC1; PTPN11; AKT2; CBL; PIK3CA; PRKCI; PIK3CB; PIK3C3; MAPK8; IRS1; MAPK3; TSC2; KRAS; EIF4EBP1; SLC2A4; PIK3C2A; PPP1CC; INSR; RAF1; FYN; MAP2K2; JAK1; AKT1; JAK2; PIK3R1; PDPK1; MAP2K1; GSK3A; FRAP1; CRKL; GSK3B; AKT3; FOXO1; SGK; RPS6KB1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal IL-6 signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HSPB1; TRAF6; MAPKAPK2; ELK1; MAPK1; PTPN11; IKBKB; FOS; NFKB2; MAP3K14; MAPK8; MAPK3; MAPK10; IL6ST; KRAS; MAPK13; IL6R; RELA; SOCS1; MAPK9; ABCB1; TRAF2; MAPK14; TNF; RAF1; IKBKG; RELB; MAP3K7; MAP2K2; IL8; JAK2; CHUK; STAT3; MAP2K1; NFKB1; CEBPB; JUN; ILIR1; SRF; IL6; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal IGF-1 signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IGF1; PRKCZ; ELK1; MAPK1; PTPN11; NEDD4; AKT2; PIK3CA; PRKCI; PTK2; FOS; PIK3CB; PIK3C3; MAPK8; IGF1R; IRS1; MAPK3; IGFBP7; KRAS; PIK3C2A; YWHAZ; PXN; RAF1; CASP9; MAP2K2; AKT1; PIK3R1; PDPK1; MAP2K1; IGFBP2; SFN; JUN; CYR61; AKT3; FOXO1; SRF; CTGF; RPS6KB1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal NRF2-mediated oxidative stress response pathway regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; EP300; SOD2; PRKCZ; MAPK1; SQSTM1; NQO1; PIK3CA; PRKCI; FOS; PIK3CB; PIK3C3; MAPK8; PRKD1; MAPK3; KRAS; PRKCD; GSTP1; MAPK9; FTL; NFE2L2; PIK3C2A; MAPK14; RAF1; MAP3K7; CREBBP; MAP2K2; AKT1; PIK3R1; MAP2K1; PP1B; JUN; KEAP1; GSK3B; ATF4; PRKCA; EIF2AK3; HSP90AA1; PRDX1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal PPAR (e.g. PPAR alpha, PPAR beta, PPAR delta, and/or PPAR gamma) regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: EP300; INS; TRAF6; PPARA; RXRA; MAPK1; IKBKB; NCOR2; FOS; NFKB2; MAP3K14; STAT5B; MAPK3; NRIP1; KRAS; PPARG; RELA; STAT5A; TRAF2; PPARGC1A; PDGFRB; TNF; INSR; RAF1; IKBKG; RELB; MAP3K7; CREBBP; MAP2K2; CHUK; PDGFRA; MAP2K1; NFKB1; JUN; ILIR1; HSP90AA1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Fc Epsilon RI regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; RAC1; PRKCZ; LYN; MAPK1; RAC2; PTPN11; AKT2; PIK3CA; SYK; PRKCI; PIK3CB; PIK3C3; MAPK8; PRKD1; MAPK3; MAPK10; KRAS; MAPK13; PRKCD; MAPK9; PIK3C2A; BTK; MAPK14; TNF; RAF1; FYN; MAP2K2; AKT1; PIK3R1; PDPK1; MAP2K1; AKT3; VAV3; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal G-protein coupled receptor regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; RAP1A; RGS16; MAPK1; GNAS; AKT2; IKBKB; PIK3CA; CREB1; GNAQ; NFKB2; CAMK2A; PIK3CB; PIK3C3; MAPK3; KRAS; RELA; SRC; PIK3C2A; RAF1; IKBKG; RELB; FYN; MAP2K2; AKT1; PIK3R1; CHUK; PDPK1; STAT3; MAP2K1; NFKB1; BRAF; ATF4; AKT3; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal inositol phosphate metabolism regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; IRAK1; PRKAA2; EIF2AK2; PTEN; GRK6; MAPK1; PLK1; AKT2; PIK3CA; CDK8; PIK3CB; PIK3C3; MAPK8; MAPK3; PRKCD; PRKAA1; MAPK9; CDK2; PIM1; PIK3C2A; DYRK1A; MAP2K2; PIP5K1A; PIK3R1; MAP2K1; PAK3; ATM; TTK; CSNK1A1; BRAF; SGK; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal PDGF regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: EIF2AK2; ELK1; ABL2; MAPK1; PIK3CA; FOS; PIK3CB; PIK3C3; MAPK8; CAV1; ABL1; MAPK3; KRAS; SRC; PIK3C2A; PDGFRB; RAF1; MAP2K2; JAK1; JAK2; PIK3R1; PDGFRA; STAT3; SPHK1; MAP2K1; MYC; JUN; CRKL; PRKCA; SRF; STAT1; SPHK2; in the case of diseases or disorders associated with involving aberrant, pathologic, and/or abnormal VEGF regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ACTN4; ROCK1; KDR; FLT1; ROCK2; MAPK1; PGF; AKT2; PIK3CA; ARNT; PTK2; BCL2; PIK3CB; PIK3C3; BCL2L1; MAPK3; KRAS; HIF1A; NOS3; PIK3C2A; PXN; RAF1; MAP2K2; ELAVL1; AKT1; PIK3R1; MAP2K1; SFN; VEGFA; AKT3; FOXO1; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal natural killer cell regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; RAC1; PRKCZ; MAPK1; RAC2; PTPN11; KIR2DL3; AKT2; PIK3CA; SYK; PRKCI; PIK3CB; PIK3C3; PRKD1; MAPK3; KRAS; PRKCD; PTPN6; PIK3C2A; LCK; RAF1; FYN; MAP2K2; PAK4; AKT1; PIK3R1; MAP2K1; PAK3; AKT3; VAV3; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal cell cycle G1/S checkpoint regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HDAC4; SMAD3; SUV39H1; HDAC5; CDKN1B; BTRC; ATR; ABL1; E2F1; HDAC2; HDAC7A; RB1; HDAC11; HDAC9; CDK2; E2F2; HDAC3; TP53; CDKN1A; CCND1; E2F4; ATM; RBL2; SMAD4; CDKN2A; MYC; NRG1; GSK3B; RBL1; HDAC6; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal T-cell receptor regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: RAC1; ELK1; MAPK1; IKBKB; CBL; PIK3CA; FOS; NFKB2; PIK3CB; PIK3C3; MAPK8; MAPK3; KRAS; RELA; PIK3C2A; BTK; LCK; RAF1; IKBKG; RELB; FYN; MAP2K2; PIK3R1; CHUK; MAP2K1; NFKB1; ITK; BCL10; JUN; VAV3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal death receptor regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CRADD; HSPB1; BID; BIRC4; TBK1; IKBKB; FADD; FAS; NFKB2; BCL2; MAP3K14; MAPK8; RIPK1; CASP8; DAXX; TNFRSF10B; RELA; TRAF2; TNF; IKBKG; RELB; CASP9; CHUK; APAF1; NFKB1; CASP2; BIRC2; CASP3; BIRC3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or FGF regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: RAC1; FGFR1; MET; MAPKAPK2; MAPK1; PTPN11; AKT2; PIK3CA; CREB1; PIK3CB; PIK3C3; MAPK8; MAPK3; MAPK13; PTPN6; PIK3C2A; MAPK14; RAF1; AKT1; PIK3R1; STAT3; MAP2K1; FGFR4; CRKL; ATF4; AK3; PRKCA; HGF; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or GM-CSF regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: LYN; ELK1; MAPK1; PTPN11; AKT2; PIK3CA; CAMK2A; STAT5B; PIK3CB; PIK3C3; GNB2L1; BCL2L1; MAPK3; ETS1; KRAS; RUNX1; PIM1; PIK3C2A; RAF1; MAP2K2; AKT1; JAK2; PIK3R1; STAT3; MAP2K1; CCND1; AK3; STAT1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or amyotrophic lateral sclerosis regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: BID; IGF1; RACI; BIRC4; PGF; CAPNS1; CAPN2; PIK3CA; BCL2; PIK3CB; PIK3C3; BCL2L1; CAPN1; PIK3C2A; TP53; CASP9; PIK3R1; RABSA; CASP1; APAF1; VEGFA; BIRC2; BAX; AKT3; CASP3; BIRC3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or JAK/Stat regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PTPN1; MAPK1; PTPN11; AKT2; PIK3CA; STAT5B; PIK3CB; PIK3C3; MAPK3; KRAS; SOCS1; STAT5A; PTPN6; PIK3C2A; RAF1; CDKN1A; MAP2K2; JAK1; AKT1; JAK2; PIK3R1; STAT3; MAP2K1; FRAP1; AKT3; STAT1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or nicotinate and nicotinamide metabolism regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; IRAK1; PRKAA2; EIF2AK2; GRK6; MAPK1; PLK1; AKT2; CDK8; MAPK8; MAPK3; PRKCD; PRKAA1; PBEF1; MAPK9; CDK2; PIM1; DYRK1A; MAP2K2; MAP2K1; PAK3; NT5E; TTK; CSNK1A1; BRAF; SGK; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or chemokine signaling regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CXCR4; ROCK2; MAPK1; PTK2; FOS; CFL1; GNAQ; CAMK2A; CXCL12; MAPK8; MAPK3; KRAS; MAPK13; RHOA; CCR3; SRC; PPP1CC; MAPK14; NOX1; RAF1; MAP2K2; MAP2K1; JUN; CCL2; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or IL-2 signaling regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ELK1; MAPK1; PTPN11; AKT2; PIK3CA; SYK; FOS; STAT5B; PIK3CB; PIK3C3; MAPK8; MAPK3; KRAS; SOCS1; STAT5A; PIK3C2A; LCK; RAFI; MAP2K2; JAK1; AKT1; PIK3R1; MAP2K1; JUN; AKT3; in the case of diseases or disorders associated with or involving synaptic long term depression in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; IGF1; PRKCZ; PRDX6; LYN; MAPK1; GNAS; PRKCI; GNAQ; PPP2R1A; IGF1R; PRKD1; MAPK3; KRAS; GRN; PRKCD; NOS3; NOS2A; PPP2CA; YWHAZ; RAF1; MAP2K2; PPP2R5C; MAP2K1; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or estrogen receptor regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: TAF4B; EP300; CARM1; PCAF; MAPK1; NCOR2; SMARCA4; MAPK3; NRIP1; KRAS; SRC; NR3C1; HDAC3; PPARGC1A; RBM9; NCOA3; RAF1; CREBBP; MAP2K2; NCOA2; MAP2K1; PRKDC; ESR1; ESR2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or protein ubiquitination pathway activity, regulation, and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: TRAF6; SMURF1; BIRC4; BRCA1; UCHL1; NEDD4; CBL; UBE2I; BTRC; HSPA5; USP7; USP10; FBXW7; USP9X; STUB1; USP22; B2M; BIRC2; PARK2; USP8; USP1; VHL; HSP90AA1; BIRC3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or IL-10 regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: TRAF6; CCR1; ELK1; IKBKB; SP1; FOS; NFKB2; MAP3K14; MAPK8; MAPK13; RELA; MAPK14; TNF; IKBKG; RELB; MAP3K7; JAK1; CHUK; STAT3; NFKB1; JUN; IL1R1; IL6; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or Vitamin D receptor (VDR) and/or RXR regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; EP300; PRKCZ; RXRA; GADD45A; HES1; NCOR2; SP1; PRKCI; CDKN1B; PRKD1; PRKCD; RUNX2; KLF4; YY1; NCOA3; CDKN1A; NCOA2; SPP1; LRP5; CEBPB; FOXO1; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or TGF-beta regulation or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: EP300; SMAD2; SMURF1; MAPK1; SMAD3; SMAD1; FOS; MAPK8; MAPK3; KRAS; MAPK9; RUNX2; SERPINE1; RAF1; MAP3K7; CREBBP; MAP2K2; MAP2K1; TGFBR1; SMAD4; JUN; SMAD5; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or Toll-like Receptor activity, regulation, and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IRAK1; EIF2AK2; MYD88; TRAF6; PPARA; ELK1; IKBKB; FOS; NFKB2; MAP3K14; MAPK8; MAPK13; RELA; TLR4; MAPK14; IKBKG; RELB; MAP3K7; CHUK; NFKB1; TLR2; JUN; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or p38 MAPK activity, regulation, and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HSPB1; IRAK1; TRAF6; MAPKAPK2; ELK1; FADD; FAS; CREB1; DDIT3; RPS6KA4; DAXX; MAPK13; TRAF2; MAPK14; TNF; MAP3K7; TGFBR1; MYC; ATF4; IL1R1; SRF; STAT1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or neurotrophin/TRK activity, regulation, and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: NTRK2; MAPK1; PTPN11; PIK3CA; CREB1; FOS; PIK3CB; PIK3C3; MAPK8; MAPK3; KRAS; PIK3C2A; RAF1; MAP2K2; AKT1; PIK3R1; PDPK1; MAP2K1; CDC42; JUN; ATF4; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or FXR and/or RXR activity, regulation, and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: INS; PPARA; FASN; RXRA; AKT2; SDC1; MAPK8; APOB; MAPK10; PPARG; MTTP; MAPK9; PPARGC1A; TNF; CREBBP; AKT1; SREBF1; FGFR4; AKT3; FOXO1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or synaptic long term potentiation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; RAP1A; EP300; PRKCZ; MAPK1; CREB1; PRKCI; GNAQ; CAMK2A; PRKD1; MAPK3; KRAS; PRKCD; PPP1CC; RAF1; CREBBP; MAP2K2; MAP2K1; ATF4; PRKCA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or calcium regulation and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: RAP1A; EP300; HDAC4; MAPK1; HDAC5; CREB1; CAMK2A; MYH9; MAPK3; HDAC2; HDAC7A; HDAC11; HDAC9; HDAC3; CREBBP; CALR; CAMKK2; ATF4; HDAC6; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or EGF or EGFR regulation and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ELK1; MAPK1; EGFR; PIK3CA; FOS; PIK3CB; PIK3C3; MAPK8; MAPK3; PIK3C2A; RAF1; JAK1; PIK3R1; STAT3; MAP2K1; JUN; PRKCA; SRF; STAT1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or LPS/IL-1 mediated inhibition of RXR function, regulation and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IRAK1; MYD88; TRAF6; PPARA; RXRA; ABCA1; MAPK8; ALDH1A1; GSTP1; MAPK9; ABCB1; TRAF2; TLR4; TNF; MAP3K7; NR1H2; SREBF1; JUN; IL1R1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or LXR/RXR function, regulation and/or signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: FASN; RXRA; NCOR2; ABCA1; NFKB2; IRF3; RELA; NOS2A; TLR4; TNF; RELB; LDLR; NR1H2; NFKB1; SREBF1; IL1R1; CCL2; IL6; MMP9; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or amyloid processing in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRKCE; CSNK1E; MAPK1; CAPNS1; AKT2; CAPN2; CAPN1; MAPK3; MAPK13; MAPT; MAPK14; AKT1; PSEN1; CSNK1A1; GSK3B; AKT3; APP; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal IL-4 activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: AKT2; PIK3CA; PIK3CB; PIK3C3; IRS1; KRAS; SOCS1; PTPN6; NR3C1; PIK3C2A; JAK1; AKT1; JAK2; PIK3R1; FRAP1; AKT3; RPS6KB1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal cell cycle: G2/M DNA damage checkpoint regulation activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: EP300; PCAF; BRCA1; GADD45A; PLK1; BTRC; CHEK1; ATR; CHEK2; YWHAZ; TP53; CDKN1A; PRKDC; ATM; SFN; CDKN2A; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal purine metabolism signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: NME2; SMARCA4; MYH9; RRM2; ADAR; EIF2AK4; PKM2; ENTPD1; RAD51; RRM2B; TJP2; RAD51C; NT5E; POLD1; NME1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal cAMP-mediated signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: RAP1A; MAPK1; GNAS; CREB1; CAMK2A; MAPK3; SRC; RAF1; MAP2K2; STAT3; MAP2K1; BRAF; ATF4; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal mitochondrial function in the brain, neurons, and/or CNS and/or diseases or disorders thereof: SOD2; MAPK8; CASP8; MAPK10; MAPK9; CASP9; PARK7; PSEN1; PARK2; APP; CASP3; AIF; CytC; SMAC (Diablo); Aifm-1; Aifm-2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal notch signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HES1; JAG1; NUMB; NOTCH4; ADAM17; NOTCH2; PSEN1; NOTCH3; NOTCH1; DLL4; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal endoplasmic reticulum stress pathway activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HSPA5; MAPK8; XBP1; TRAF2; ATF6; CASP9; ATF4; EIF2AK3; CASP3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal pyrimidine metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: NME2; AICDA; RRM2; EIF2AK4; ENTPD1; RRM2B; NT5E; POLD1; NME1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Parkinson's signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: UCHL1; MAPK8; MAPK13; MAPK14; CASP9; PARK7; PARK2; CASP3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Glycolysis/Gluconeogenesis activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HK2; GCK; GPI; ALDH1A1; PKM2; LDHA; HK1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal interferon activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IRF1; SOCS1; JAK1; JAK2; IFITM1; STAT1; IFIT3; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal sonic the hedgehog activity, signaling, and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ARRB2; SMO; GLI2; DYRK1A; GLI1; GSK3B; DYRK1B; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal glycerophospholipid metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PLD1; GRN; GPAM; YWHAZ; SPHK1; SPHK2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal phospholipid degradation, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; PLD1; GRN; YWHAZ; SPHK1; SPHK2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal tryptophan metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: SIAH2; PRMT5; NEDD4; ALDH1A1; CYPIB1; SIAH1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal lysine degradation, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: SUV39H1; EHMT2; NSD1; SETD7; PPP2R5C; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal nucleotide excision repair pathway activity, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ERCC5; ERCC4; XPA; XPC; ERCC1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal nucleotide starch and sucrose metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: UCHL1; HK2; GCK; GPI; HK1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal aminosugars metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: NQO1; HK2; GCK; HK1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal arachidonic acid metabolism, signaling thereof, and/or regulation thereof in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; GRN; YWHAZ; CYP1B1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal circadian rhythm signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CSNK1E; CREB1; ATF4; NR1D1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or coagulation system activity signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: BDKRB1; F2R; SERPINE1; F3; a PAR (e.g. PAR1, PAR2, etc.) PLC, aPC; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal dopamine receptor signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PPP2R1A; PPP2CA; PPP1CC; PPP2R5C; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Glutathione Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IDH2; GSTP1; ANPEP; IDH1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Glycerolipid Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; GPAM; SPHK1; SPHK2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Linoleic Acid Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; GRN; YWHAZ; CYP1B1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Methionine Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: DNMT1; DNMT3B; AHCY; DNMT3A; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Pyruvate Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: GLO1; ALDH1A1; PKM2; LDHA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Arginine and Proline Metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; NOS3; NOS2A; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Eicosanoid signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; GRN; YWHAZ; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal fructose and mannose metabolism signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: HK2; GCK; HK1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal antigen presentation pathway activity, signaling and/or regulation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CALR; B2M; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal steroid biosynthesis in the brain, neurons, and/or CNS and/or diseases or disorders thereof: NQO1; DHCR7; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal butanoate metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; NLGN1; in the case of diseases or disorders associated with or involving an aberrant, pathologic, and/or abnormal citrate cycle in the brain, neurons, and/or CNS and/or diseases or disorders thereof: IDH2; IDH1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal fatty acid metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; CYP1B1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Glycerophospholipid metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; CHKA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal histidine metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5; ALDH1A1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal inositol metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ERO1L; APEX1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Phenylalanine metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRDX6; PRDX1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Seleno amino acid metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5; AHCY; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Sphingolipid metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: SPHK1; SPHK2; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Aminophosphonate metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal androgen and/or estrogen metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Ascorbate and Aldarate metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Cysteine Metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: LDHA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal fatty acid biosynthesis in the brain, neurons, and/or CNS and/or diseases or disorders thereof: FASN; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal glutamate receptor signaling in the brain, neurons, and/or CNS and/or diseases or disorders thereof: GNB2L1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Pentose Phosphate pathway in the brain, neurons, and/or CNS and/or diseases or disorders thereof: GPI; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal retinol metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Pentose and Glucuronate interconversions in the brain, neurons, and/or CNS and/or diseases or disorders thereof: UCHL1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Riboflavin Metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: TYR; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Tyrosine Metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5, TYR; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Ubiquinone biosynthesis in the brain, neurons, and/or CNS and/or diseases or disorders thereof: PRMT5; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal Valine, leucine and isoleucine degradation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal glycine, serine, and threonine metabolism in the brain, neurons, and/or CNS and/or diseases or disorders thereof: CHKA; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal lysine degradation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: ALDH1A1; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal pain or pain signaling or pain signal generation in the brain, neurons, and/or CNS and/or diseases or disorders thereof: TRPM7; TRPC5; TRPC6; TRPC1; Cnr1; cnr2; Grk2; Trpa1; Pomc; Cgrp; Crf; Pka; Era; Nr2b; TRPM5; Prkaca; Prkacb; Prkar1a; Prkar2a; in the case of diseases or disorders associated with or involving aberrant, pathologic, and/or abnormal brain, neuron, and/or CNS development and/or diseases or disorders thereof: BMP-4; Chordin (Chrd); Noggin (Nog); WNT (Wnt2; Wnt2b; Wnt3a; Wnt4; Wnt5a; Wnt6; Wnt7b; Wnt8b; Wnt9a; Wnt9b; Wnt10a; Wnt10b; Wnt16); beta-catenin; Dkk-1; Frizzled related proteins; Otx-2; Gbx2; FGF-8; Reelin; Dab1; unc-86 (Pou4f1 or Bm3a); Numb; Reln; in the case of diseases or disorders associated with or involving prion disorders of or in the brain, neuron, and/or CNS and/or diseases or disorders thereof: Prp; in the case of substance or activity additions involving activities of the brain, neuron, and/or CNS: Prkce (alcohol); Drd2; Drd4; ABAT (alcohol); GRIA2; Grm5; Grin1; Htr1b; Grin2a; Drd3; Pdyn; Gria1 (alcohol); in the case of diseases or disorders associated with or involving PI3K/AKT signaling and/or regulation thereof in the brain, neuron, and/or CNS and/or diseases or disorders thereof: PRKCE; ITGAM; ITGA5; IRAK1; PRKAA2; EIF2AK2; PTEN; EIF4E; PRKCZ; GRK6; MAPK1; TSC1; PLK1; AKT2; IKBKB; PIK3CA; CDK8; CDKN1B; NFKB2; BCL2; PIK3CB; PPP2R1A; MAPK8; BCL2L1; MAPK3; TSC2; ITGA1; KRAS; EIF4EBP1; RELA; PRKCD; NOS3; PRKAA1; MAPK9; CDK2; PPP2CA; PIM1; ITGB7; YWHAZ; ILK; TP53; RAF1; IKBKG; RELB; DYRK1A; CDKN1A; ITGB1; MAP2K2; JAK1; AKT1; JAK2; PIK3R1; CHUK; PDPK1; PPP2R5C; CTNNB1; MAP2K1; NFKB1; PAK3; ITGB3; CCND1; GSK3A; FRAP1; SFN; ITGA2; ITK; CSNK1A1; BRAF; GSK3B; AKT3; FOXO1; SGK; HSP90AA1; RPS6KB1; in the case of diseases or disorders associated with or involving ERK/MAPK signaling and/or regulation thereof in the brain, neuron, and/or CNS and/or diseases or disorders thereof: PRKCE; ITGAM; ITGA5; HSPB1; IRAK1; PRKAA2; EIF2AK2; RAC1; RAP1A; TLN1; EIF4E; ELK1; GRK6; MAPK1; RAC2; PLK1; AKT2; PIK3CA; CDK8; CREB1; PRKCI; PTK2; FOS; RPS6KA4; PIK3CB; PPP2R1A; PIK3C3; MAPK8; MAPK3; ITGA1; ETS1; KRAS; MYCN; EIF4EBP1; PPARG; PRKCD; PRKAA1; MAPK9; SRC; CDK2; PPP2CA; PIM1; PIK3C2A; ITGB7; YWHAZ; PPP1CC; KSR1; PXN; RAF1; FYN; DYRK1A; ITGB1; MAP2K2; PAK4; PIK3R1; STAT3; PPP2R5C; MAP2K1; PAK3; ITGB3; ESR1; ITGA2; MYC; TTK; CSNK1A1; CRKL; BRAF; ATF4; PRKCA; SRF; STAT1; SGK; in the case of diseases or disorders associated with or involving glucocorticoid receptor signaling and/or regulation thereof in the brain, neuron, and/or CNS and/or diseases or disorders thereof: RACI; TAF4B; EP300; SMAD2; TRAF6; PCAF; ELK1; MAPK1; SMAD3; AKT2; IKBKB; NCOR2; UBE2I; PIK3CA; CREB1; FOS; HSPA5; NFKB2; BCL2; MAP3K14; STAT5B; PIK3CB; PIK3C3; MAPK8; BCL2L1; MAPK3; TSC22D3; MAPK10; NRIP1; KRAS; MAPK13; RELA; STAT5A; MAPK9; NOS2A; PBX1; NR3C1; PIK3C2A; CDKN1C; TRAF2; SERPINE1; NCOA3; MAPK14; TNF; RAF1; IKBKG; MAP3K7; CREBBP; CDKN1A; MAP2K2; JAK1; IL8; NCOA2; AKT1; JAK2; PIK3R1; CHUK; STAT3; MAP2K1; NFKB1; TGFBR1; ESR1; SMAD4; CEBPB; JUN; AR; AKT3; CCL2; MMP1; STAT1; IL6; HSP90AA1; in the case of diseases or disorders associated with or involving ephrin receptor signaling and/or regulation thereof in the brain, neuron, and/or CNS and/or diseases or disorders thereof: PRKCE; ITGAM; ROCK1; ITGA5; CXCR4; IRAK1; PRKAA2; EIF2AK2; RAC1; RAP1A; GRK6; ROCK2; MAPK1; PGF; RAC2; PTPN11; GNAS; PLK1; AKT2; DOK1; CDK8; CREB1; PTK2; CFL1; GNAQ; MAP3K14; CXCL12; MAPK8; GNB2L1; ABL1; MAPK3; ITGA1; KRAS; RHOA; PRKCD; PRKAA1; MAPK9; SRC; CDK2; PIM1; ITGB7; PXN; RAF1; FYN; DYRK1A; ITGB1; MAP2K2; PAK4; AKT1; JAK2; STAT3; ADAM10; MAP2K1; PAK3; ITGB3; CDC42; VEGFA; ITGA2; EPHA8; TTK; CSNK1A1; CRKL; BRAF; PTPN13; ATF4; AKT3; SGK; in the case of diseases or disorders associated with or involving B cell receptor signaling and/or regulation thereof in the brain, neuron, and/or CNS and/or diseases or disorders thereof: RAC1; PTEN; LYN; ELK1; MAPK1; RAC2; PTPN11; AKT2; IKBKB; PIK3CA; CREB1; SYK; NFKB2; CAMK2A; MAP3K14; PIK3CB; PIK3C3; MAPK8; BCL2L1; ABL1; MAPK3; ETS1; KRAS; MAPK13; RELA; PTPN6; MAPK9; EGR1; PIK3C2A; BTK; MAPK14; RAF1; IKBKG; RELB; MAP3K7; MAP2K2; AKT1; PIK3R1; CHUK; MAP2K1; NFKB1; CDC42; GSK3A; FRAP1; BCL6; BCL10; JUN; GSK3B; ATF4; AKT3; VAV3; RPS6KB1; in the case of Infantile neuroaxonal dystroph: PLA2G6; in the case of Gaucher's disease: GBA; in the case of Krabbe disease: GALC; in the case of metachromatic leukodystrophy: ARSA and/or PRSP, isoform specific Saposin B replacement; in the case of Salla disease: SLC17A5; in the case of Farber disease or spinal muscular atrophy with progressive myoclonic epilepsy (also referred to as Jankovic-Rivera syndrome): ASAH1; in the case of Unverricht-Lundborg disease: CSTB; in the case of AADC deficiency: AADC; in the case of autosomal recessive forms of Parkinson's disease: PRKN, and others; in the case of Batten disease: CLN3; in the case of giant axonal neuropathy: GAN; in the case of mucopolysacchariodosis diseases (including MOS1H (Hurler syndrome), MPSII (Hunter syndrome), MPS III A-D: IDUA, IDS, SGSH, NAGLU, HGSNAT, GNS; in the case of Sandhoff disease (HEXB); in the case of GM2 gangliosidosis, AB variant: GM2A; in the case of Canavan disease: ASPA; in the case of cockayne syndrome: CSA or CSB; in the case of neurofibromatosis: NF1 or NF2; or any combination thereof.

Examples of eye disease-associated genes and polynucleotides that can be modified using the engineered delivery AAV delivery system molecules, vectors, capsids, engineered cells, and/or engineered delivery particles described herein are described below. The compositions described herein can be delivered to one or both eyes to treat or prevent an eye disease, disorder or symptom thereof.

The compositions described herein can be used to correct ocular defects that arise from several genetic mutations further described in Genetic Diseases of the Eye, Second Edition, edited by Elias I. Traboulsi, Oxford University Press, 2012.

In some embodiments, the condition to be treated or targeted is an eye disorder. In some embodiments, the eye disorder may include glaucoma. In some embodiments, the eye disorder includes a retinal degenerative disease. In some embodiments, the retinal degenerative disease is selected from Stargardt disease, Bardet-Biedl Syndrome, Best disease, Blue Cone Monochromacy, Choroidermia, Cone-rod dystrophy, Congenital Stationary Night Blindness, Enhanced S-Cone Syndrome, Juvenile X-Linked Retinoschisis, Leber Congenital Amaurosis, Malattia Leventinesse, Norrie Disease or X-linked Familial Exudative Vitreoretinopathy, Pattern Dystrophy, Sorsby Dystrophy, Usher Syndrome, Retinitis Pigmentosa, Achromatopsia or Macular dystrophies or degeneration, Retinitis Pigmentosa, Achromatopsia, and age related macular degeneration. In some embodiments, the retinal degenerative disease is Leber Congenital Amaurosis (LCA) or Retinitis Pigmentosa. Other exemplary eye diseases are described in greater detail elsewhere herein.

In the case of macular degeneration and/or diabetic retinopathy, the gene target can be VEGF, where the gene expression or gene product of VEGF is reduced or eliminated in the eye, particularly the retina, and particularly when applied subretinally or via another ocular administration route.

In the case of Best disease, the gene or gene product target can be RDS or VMD2, where knockdown/reduction or elimination of the gene expression or gene product can provide a therapeutic or otherwise beneficial effect, particularly when applied subretinally or via another ocular administration route.

In the case of Sorsby's fundus dystrophy, the gene or gene product target can be TIMP3, where knockdown/reduction or elimination of the gene expression or gene product can provide a therapeutic or otherwise beneficial effect, particularly when applied subretinally or via another ocular administration route.

In the case of Stargardt disease, the gene or gene product target can be ABCA4, where knockdown/reduction or elimination of the gene expression or gene product can provide a therapeutic or otherwise beneficial effect, particularly when applied subretinally or via another ocular administration route.

In the case of Leber's congenital amaurosis type 2, the gene or gene product target can be RPE65, where knockdown/reduction or elimination of the gene expression or gene product can provide a therapeutic or otherwise beneficial effect, particularly when applied subretinally or via another ocular administration route.

In the case of Choroideremia, the gene or gene product target can be CHM, where knockdown/reduction or elimination of the gene expression or gene product can provide a therapeutic or otherwise beneficial effect, particularly when applied subretinally or via another ocular administration route.

Other exemplary eye diseases and/or disorders and genetic targets for treatment or prevention are shown in the Tables below and in Genes and Genetics in Eye Diseases: A Genomic Medicine Approach for Investigating Hereditary and Inflammatory Ocular Disorders. International Journal of Ophthalmology, 2018 and Inherited Retinal Diseases: Therapeutics, Clinical Trials and End Points—A Review. Clinical & Experimental Ophthalmology, 2021, 49, 270-288, and the Herediary Ocular Disease Database—available at PG-6T disorders.eyes.arizona.edu/for-patients/handout-list.

DiseaseGene/variant
AMDNOS2A, CFH, CF, C2, C3, CFB, HTRA1/LOC, MMP-9,
TIMP-3, SLC16A8, etc.
CataractGEMIN4, CYP51A1, RIC1, TAPT1, TAF1A, WDR87, APE1,
MIP, Cx50/GJA3 & 8, CRYAA, CRYBB2, PRX, POLR3B,
XRCC1, ZNF350, EPHA2, etc.
GlaucomaCALM2, MPP-7, Optineurin, LOX1, CYP1B1, CAV1/2,
MYOC, PITX2, FOXC1, PAX6, CYP1B1, LTBP2, etc.
Inherited optic neuropathiesComplex I or ND genes, OPA1, RPE65, etc.
Marfan syndromeFBN1, TGFBR2, MTHFR, MTR, MTRR, etc.
MyopiaHGF, C-MET, UMODL1, MMP-1/2, PAX6, CBS, MTHFR,
IGF-1, UHRF1BP1L, PTPRR, PPFIA2, P4HA2, etc.
Polypoidal choroidalC2, C3, CFH, SERPING1, PEDF, ARMS2-HTRA1, FGD6,
vasculopathiesABCG1, LOC387715, CETP, etc.
Retinitis pigmentosa (RCD)RPGR, PRPF3, HK1, AGBL5, etc.
Stargardt's diseaseABC1, ABCA4, CRB1, etc.
Uveal melanomaPTEN, BAP1, GNAQ, GNA11, DDEF1, SF3B1, EIF1AX,
CDKN2A, p14ARF, HERC2/OCA2, etc.
DiseaseGene/variant
Best diseaseBEST1
X-Linked retinoschisisRS1
Pattern dystrophyPRPH2
Sorsby fundus dystrophyTIMP3
AchromatopsiaCNGB3, CNGA3, GNAT2, ATF6, PDE6H, PDE6C
Blu cone monochromatismOPN1LW, OPN1MW,
Bornholm eye diseaseOPN1LW, OPN1MW
ADGUCA1A-associatedGUCA1A
COD/CORD
ADGUCY2D-associatedGUCY2D
COD/CORD
Autosomal dominantPRPH2-PRPH2
associated CORD
Autosomal recessiveABCA4-ABCA4
associated COD/CORD
X-linkedRPGR-RPGR
associatedCOD/CORD
Fundus Albipunctatus (FA)RDH5, RLBP1, RPE65
RCD (retinitis pigmentosa)MERTK, MYO7A, USH2A, PDE6B, RLBP1, RHO, RP2
Enhanced S-conesyndromeNR2E3
(ESCS)
Bietti crystallineCYP4V2
corneoretinaldystrophy (BCD)
LEBERGUCY2D, CEP290, RPE65, AIPL1
CONGENITALAMAUROSIS
(LCA) ANDEARLY-ONSET
SEVERE
RETINALDYSTROPHY
(EOSRD)
Choroideremia (CHM)CHM

Examples of ear, particularly inner ear, disease-associated genes and polynucleotides that can be modified using the engineered delivery AAV delivery system molecules, vectors, capsids, engineered cells, and/or engineered delivery particles described herein are described below. The compositions described herein can be delivered to one or both ears, particularly to the inner ear, to treat or prevent an ear disease, disorder or symptom thereof, particularly an inner ear disease, disorder, or symptom thereof.

In certain example embodiments, the inner ear disease or disorder is GJB-2 deafness, Jeryell and Lange-Nielsen syndrome, Usher syndrome, Alport syndrome, Branchio-oto-renal syndrome, Waardenburg syndrome, Pendred syndrome, Stickler syndrome, Treacher Collins syndrome, CHARGE syndrome, Norrie disease, Perrault syndrome, Autosomal dominant Nonsyndromic hearing loss, utosomal Recessive Nonsyndromic Hearing Loss, X-linked nonsyndromic hearing loss, an auditory neuropathy, a congenital hearing loss, or any combination thereof.

In the case of GJB-2 deafness, the GJB-2 gene can be replaced. Genes associated with CHARGE syndrome: SFMA3E, CHD7. Genes associated with Norrie Disease: NDP. Genes associated with Pendred Syndrome: FOMO1, KCNJ10. Genes associated with Perrault syndrome: HSD17B4, HARS2, CLPP*, LARS2, TWNK ERAL1.

Genes associated with Autosomal Dominant Nonsyndromic Hearing Loss may comprise: DIAPH1, KCNQ4, GJB3, IFNLR1, GJB2, GJB6, MYH14, CEACAM16, GSDME/DFNA5, WFS1, LMX1A, TECTA, COCH, EYA4, MYO7A, COL11A2, POU4F3, MYH9, ACTG1, MYO6, SIX1, SLC17A8, REST, GRHL2, NLRP3, TMC1, COL11A1, CRYM, P2RX2, CCDC50, MIRN96, TJP2, TNC, SMAC/DIABLO. TBC1D24, CD164, OSBPL2, HOMER2, KITLG, MCM2, PTPRQ, DMXL2, MYO3A, PDE1C, TRRAP, PLS1, ATP2B2, SCD5, SLC12A2, MAP1B, RIPOR2/FAM65B. Genes associated with Autosomal Recessive Nonsyndromic Hearing Loss may comprise: GJB2, MYO7A, MYO15A, SLC26A4, TMIE, TMC1, TMPRSS3, OTOF, CDH23, GIPC3, STRC, USHIC, OTOG, TECTA, OTOA, PCDH15, RDX, GRXCR1, GAB1, TRIOBP, CLDN14, MYO3A, WHRN, CDC14A, ESRRB, ESPN, MYO6, HGF, ILDR1, ADCY1, CIB2, MARVELD2, BDP1, COL11A2, PDZD7, PJVK, SLC22A4, SLC26A5, LRTOMT/COMT2, DCDC2, LHFPL5, S1PR2, PNPT1, BSND, MSRB3, SYNE4, LOXID1, TPRN, GPSM2, PTPRQ, OTOGL, TBC1D24, ELMOD3, KARS, SERPINB6, CABP2, NARS2, MET, TSPEAR, TMEM132E, PPIP5K2, GRXCR2, EPS8, CLIC5, FAM65B/RIPOR2, EPS8L2, ROR1, WBP2, ESRP1, MPZL2, CEACAM16, GRAP, SPNS2, CLDN9, CLRN2, GAS2. Genes associated X-Linked Nonsyndromic Hearing Loss PRPS1, POU3F4, SMPX, AIFM1, COL4A6. Genes associated with Auditory Neuropathy: DIAPH3.

Other exemplary diseases and associated target gene or gene products for treatment or prevention are shown in the table below and further described in Congenital Hearing Loss. Nature Reviews Disease Primers, 2017, 3.

TABLE 5
SyndromeProteins involved (coding genes)
Jervell andPotassium voltage-gated channel subfamily E member 1
Lange-Nielsen(KCNE1) and potassium voltage-gated channel subfamily KQT
member 1 (KCNQ1)
UsherUsher syndrome type 1: Unconventional myosin-VIIa
(MYO7A), harmonin (USH1C), cadherin-23 (CDH23),
protocadherin-15 (PCDH15), Usher syndrome type-1G protein
(USH1G) and calcium and integrin-binding family member 2
(CIB2)
Usher syndrome type 2: usherin (USH2A), adhesion G protein-
coupled receptor V1 (ADGRV1) and whirlin (WHRN)
Usher syndrome type 3: clarin-1 (CLRN1)
AlportCollagen alpha-3(IV) chain (COL4A3), collagen alpha-4(IV)
chain (COL4A4) and collagen alpha-5(IV) chain (COL4A5)
Branchio-Eyes absent homolog 1 (EYA1), homeobox protein SIX1
oto-renal(SIX1) and homeobox protein SIX5 (SIX5)
WaardenburgPaired box protein Pax-3 (PAX3), microphthalmia-associated
transcription factor (MITF, endothelin-3 (EDN3), endothelin B
receptor (EDNRB), zinc finger protein SNAI2 (SNAI2) and
transcription factor SOX-10 (SOX10)
PendredPendrin (SLC26A4)
SticklerCollagen alpha-1151 chain (COL2A1), collagen alpha-1(IX)
chain (COL9A1), collagen alpha-2(IX) chain (COL9A2),
collagen alpha-1(XI) chain (COL11A1) and collagen alpha-
2(XI) chain (COL11A2)
TreacherTreacle protein (TCOF1), DNA-directed RNA polymerases I
Collinsand III subunit RPAC1 (POLR1C) and DNA-directed RNA
polymerases I and III subunit RPAC2 (POLR1D)

It will be appreciated that in any case where the gene is defective, a gene replacement strategy, gene editing or other approach can be appropriate.

Thus, also described herein are methods of inducing one or more mutations in a eukaryotic or prokaryotic cell (in vitro, i.e., in an isolated eukaryotic cell) as herein discussed comprising delivering to cell a vector as described herein. The mutation(s) can include the introduction, deletion, or substitution of one or more nucleotides at a target sequence of cell(s). In some embodiments, the mutations can include the introduction, deletion, or substitution of 1-75 nucleotides at each target sequence of said cell(s). The mutations can include the introduction, deletion, or substitution of 1, 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or 75 nucleotides at each target sequence. The mutations can include the introduction, deletion, or substitution of 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or 75 nucleotides at each target sequence of said cell(s). The mutations include the introduction, deletion, or substitution of 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or 75 nucleotides at each target sequence of said cell(s). The mutations can include the introduction, deletion, or substitution of 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, or 75 nucleotides at each target sequence of said cell(s). The mutations can include the introduction, deletion, or substitution of 40, 45, 50, 75, 100, 200, 300, 400 or 500 nucleotides at each target sequence of said cell(s). The mutations can include the introduction, deletion, or substitution of 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900, 2000, 2100, 2200, 2300, 2400, 2500, 2600, 2700, 2800, 2900, 3000, 3100, 3200, 3300, 3400, 3500, 3600, 3700, 3800, 3900, 4000, 4100, 4200, 4300, 4400, 4500, 4600, 4700, 4800, 4900, 5000, 5100, 5200, 5300, 5400, 5500, 5600, 5700, 5800, 5900, 6000, 6100, 6200, 6300, 6400, 6500, 6600, 6700, 6800, 6900, 7000, 7100, 7200, 7300, 7400, 7500, 7600, 7700, 7800, 7900, 8000, 8100, 8200, 8300, 8400, 8500, 8600, 8700, 8800, 8900, 9000, 9100, 9200, 9300, 9400, 9500, 9600, 9700, 9800, or 9900 to 10000 nucleotides at each target sequence of said cell(s).

In some embodiments, the modifications can include the introduction, deletion, or substitution of nucleotides at each target sequence of said cell(s) via nucleic acid components (e.g., guide(s) RNA(s) or sgRNA(s)), such as those mediated by a CRISPR-Cas system.

In some embodiments, the modifications can include the introduction, deletion, or substitution of nucleotides at a target or random sequence of said cell(s) via a non CRISPR-Cas system or technique. Such techniques are discussed elsewhere herein, such as where engineered cells and methods of generating the engineered cells and organisms are discussed.

For minimization of toxicity and off-target effect when using a CRISPR-Cas system, it may be important to control the concentration of Cas mRNA and guide RNA delivered. Optimal concentrations of Cas mRNA and guide RNA can be determined by testing different concentrations in a cellular or non-human eukaryote animal model and using deep sequencing the analyze the extent of modification at potential off-target genomic loci. Alternatively, to minimize the level of toxicity and off-target effect, Cas nickase mRNA (for example S. pyogenes Cas9-like with the D10A mutation) can be delivered with a pair of guide RNAs targeting a site of interest. Guide sequences and strategies to minimize toxicity and off-target effects can be as in WO 2014/093622 (PCT/US2013/074667); or, via mutation as herein.

Typically, in the context of an endogenous CRISPR system, formation of a CRISPR complex (comprising a guide sequence hybridized to a target sequence and complexed with one or more Cas proteins) results in cleavage of one or both strands in or near (e.g., within 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 50, or more base pairs from) the target sequence. Without wishing to be bound by theory, a tracr sequence, which may comprise or consist of all or a portion of a wild-type tracr sequence (e.g., about or more than about 20, 26, 32, 45, 48, 54, 63, 67, 85, or more nucleotides of a wild-type tracr sequence), may also form part of a CRISPR complex, such as by hybridization along at least a portion of the tracr sequence to all or a portion of a tracr mate sequence that is operably linked to a guide sequence.

In one embodiment, the invention provides a method of modifying a target polynucleotide in a eukaryotic cell. In some embodiments, the method includes delivering an engineered targeting moiety, polypeptide, polynucleotide, vector, vector system, particle, viral (e.g., AAV) particle, cell, or any combination thereof described herein having a CRISPR-Cas molecule as a cargo molecule to a subject and/or cell. The CRISPR-Cas system molecule(s) delivered can complex to bind to the target polynucleotide, e.g., to effect cleavage of said target polynucleotide, thereby modifying the target polynucleotide, wherein the CRISPR complex comprises a CRISPR enzyme complexed with a guide sequence hybridized to a target sequence within said target polynucleotide, wherein said guide sequence can be linked to a tracr mate sequence which in turn hybridizes to a tracr sequence. In some embodiments, said cleavage comprises cleaving one or two strands at the location of the target sequence by said CRISPR enzyme. In some embodiments, said cleavage results in decreased transcription of a target gene. In some embodiments, the method further comprises repairing said cleaved target polynucleotide by homologous recombination with an exogenous template polynucleotide, wherein said repair results in a mutation comprising an insertion, deletion, or substitution of one or more nucleotides of said target polynucleotide. In some embodiments, said mutation results in one or more amino acid changes in a protein expressed from a gene comprising the target sequence. In some embodiments, the method further comprises delivering one or more vectors to said eukaryotic cell, wherein one or more vectors comprise the CRISPR enzyme and one or more vectors drive expression of one or more of: the guide sequence linked to the tracr mate sequence, and the tracr sequence. In some embodiments, said CRISPR enzyme drive expression of one or more of: the guide sequence linked to the tracr mate sequence, and the tracr sequence. In some embodiments such CRISPR enzyme are delivered to the eukaryotic cell in a subject. In some embodiments, said modifying takes place in said eukaryotic cell in a cell culture. In some embodiments, the method further comprises isolating said eukaryotic cell from a subject prior to said modifying. In some embodiments, the method further comprises returning said eukaryotic cell and/or cells derived therefrom to said subject. In some embodiments, the isolated cells can be returned to the subject after delivery of one or more engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein to the isolated cell. In some embodiments, the isolated cells can be returned to the subject after delivering one or more molecules of the engineered delivery system described herein to the isolated cell, thus making the isolated cells engineered cells as previously described.

The targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein described herein can be used in a screening assay and/or cell selection assay. The engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein can be delivered to a subject and/or cell. In some embodiments, the cell is a eukaryotic cell. The cell can be in vitro, ex vivo, in situ, or in vivo. The targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein can introduce an exogenous molecule or compound, such as a cargo, to subject or cell to which they are delivered. The presence of an exogenous molecule or compound can be detected which can allow for identification of a cell and/or attribute thereof. In some embodiments, the delivered molecules or particles can impart a gene or other nucleotide modification (e.g., mutations, gene or polynucleotide insertion and/or deletion, etc.). In some embodiments the nucleotide modification can be detected in a cell by sequencing. In some embodiments, the nucleotide modification can result in a physiological and/or biological modification to the cell that results in a detectable phenotypic change in the cell, which can allow for detection, identification, and/or selection of the cell. In some embodiments, the phenotypic change can be cell death, such as embodiments where binding of a CRISPR complex to a target polynucleotide results in cell death. Embodiments of the invention allow for selection of specific cells without requiring a selection marker or a two-step process that may include a counter-selection system. The cell(s) may be prokaryotic or eukaryotic cells.

In one embodiment the invention provides for a method of selecting one or more cell(s) by introducing one or more mutations in a gene in the one or more cell (s), the method comprising: introducing one or more vectors, which can include one or more engineered delivery system molecules or vectors described elsewhere herein, into the cell (s), wherein the one or more vectors can include a CRISPR enzyme and/or drive expression of one or more of: a guide sequence linked to a tracr mate sequence, a tracr sequence, and an editing template; or other polynucleotide to be inserted into the cell and/or genome thereof; wherein, for example that which is being expressed is within and expressed in vivo by the CRISPR enzyme and/or the editing template, when included, comprises the one or more mutations that abolish CRISPR enzyme cleavage; allowing homologous recombination of the editing template with the target polynucleotide in the cell(s) to be selected; allowing a CRISPR complex to bind to a target polynucleotide to effect cleavage of the target polynucleotide within said gene, wherein the CRISPR complex comprises the CRISPR enzyme complexed with (1) the guide sequence that is hybridized to the target sequence within the target polynucleotide, and (2) the tracr mate sequence that is hybridized to the tracr sequence, wherein binding of the CRISPR complex to the target polynucleotide induces cell death, thereby allowing one or more cell(s) in which one or more mutations have been introduced to be selected. In a preferred embodiment, the CRISPR enzyme is a Cas protein. In another embodiment of the invention the cell to be selected may be a eukaryotic cell.

The screening methods involving the engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein, including but not limited to those that deliver one more CRISPR-Cas system molecules to cell, can be used in detection methods such as fluorescence in situ hybridization (FISH). In some embodiments, one or more components of an engineered CRISPR-Cas system that includes a catalytically inactive Cas protein, can be delivered by engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein to a cell and used in a FISH method. The CRISPR-Cas system can include an inactivated Cas protein (dCas) (e.g., a dCas9), which lacks the ability to produce DNA double-strand breaks may be fused with a marker, such as fluorescent protein, such as the enhanced green fluorescent protein (eEGFP) and co-expressed with small guide RNAs to target pericentric, centric and teleomeric repeats in vivo. The dCas system can be used to visualize both repetitive sequences and individual genes in the human genome. Such new applications of labelled dCas, dCas CRISPR-Cas systems, engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein can be used in imaging cells and studying the functional nuclear architecture, especially in cases with a small nucleus volume or complex 3-D structures. (Chen B, Gilbert L A, Cimini B A, Schnitzbauer J, Zhang W, Li G W, Park J, Blackbum E H, Weissman J S, Qi L S, Huang B. 2013. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell 155(7):1479-91. doi: 10.1016/j.cell.2013.12.001., the teachings of which can be applied and/or adapted to the CRISPR systems described herein. A similar approach involving a polynucleotide fused to a marker (e.g., a fluorescent marker) can be delivered to a cell via engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein and integrated into the genome of the cell and/or otherwise interact with a region of the genome of a cell for FISH analysis.

Similar approaches for studying other cell organelles and other cell structures can be accomplished by delivering to the cell (e.g., via an engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein) one or more molecules fused to a marker (such as a fluorescent marker), wherein the molecules fused to the marker are capable of targeting one or more cell structures. By analyzing the presence of the markers, one can identify and/or image specific cell structures.

In some embodiments, the engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein can be used in a screening assay inside or outside of a cell. In some embodiments, the screening assay can include delivering a CRISPR-Cas cargo molecule(s) via engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein.

Use of the present system in screening is also provided by the present invention, e.g., gain of function screens. Cells which are artificially forced to overexpress a gene are able to down regulate the gene over time (re-establishing equilibrium) e.g., by negative feedback loops. By the time the screen starts, the unregulated gene might be reduced again. Other screening assays are discussed elsewhere herein.

In an embodiment, the invention provides a cell from or of an in vitro method of delivery, wherein the method comprises contacting the delivery system with a cell, optionally a eukaryotic cell, whereby there is delivery into the cell of constituents of the delivery system, and optionally obtaining data or results from the contacting, and transmitting the data or results.

In an embodiment, the invention provides a cell from or of an in vitro method of delivery, wherein the method comprises contacting the delivery system with a cell, optionally a eukaryotic cell, whereby there is delivery into the cell of constituents of the delivery system, and optionally obtaining data or results from the contacting, and transmitting the data or results; and wherein the cell product is altered compared to the cell not contacted with the delivery system, for example altered from that which would have been wild type of the cell but for the contacting. In an embodiment, the cell product is non-human or animal. In some embodiments, the cell product is human.

In some embodiments, a host cell is transiently or non-transiently transfected with one or more vectors described herein. In some embodiments, a cell is transfected as it naturally occurs in a subject optionally to be reintroduced therein. In some embodiments, a cell that is transfected is taken from a subject. In some embodiments, the cell obtained from or is derived from cells taken from a subject, such as a cell line. Delivery mechanisms and techniques of the targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein.

In some embodiments, it is envisaged to introduce one or more of the engineered targeting moieties, polypeptides, polynucleotides, vectors, vector systems, particles, viral (e.g., AAV) particles, cells, or any combination thereof described herein directly to the host cell. For instance, the engineered AAV capsid system molecule(s) can be delivered together with one or more cargo molecules to be packaged into an engineered AAV particle.

In some embodiments, the invention provides a method of expressing an engineered delivery molecule and cargo molecule to be packaged in an engineered viral (e.g., AAV) particle in a cell that can include the step of introducing the vector according any of the vector delivery systems disclosed herein.

Described in certain example embodiments herein are assays and methods for screening and identifying cell and tissue surface receptors that facilitate transduction by one or more of the CNS specific targeting moieties of the present invention. In some embodiments, such a method can be based upon an RNAi, CRISPR activation (CRISPRa), CRISPR inhibition (CRISPRi) or CRISPR knockdown or knockout approach. In some embodiments, such a method can be based upon a small molecule library screening.

In some embodiments, the method includes contacting one or more cells with a CRISPRa, CRISPRi, or CRISPRkd/ko system or component thereof thereby increasing or decreasing expression of genes to which the system is targeted and transducing the one or more cells with a composition comprising a targeting moiety effective to target a CNS cell of the present invention, and detecting, quantifying, or otherwise measuring transduction efficiency of the composition a targeting moiety effective to target a CNS cell of the present invention to determine or otherwise identify genes, pathways, programs, receptors, and/or the like involved with or that mediates transduction of the compositions comprising a targeting moiety effective to target a CNS cell of the present invention and/or are capable of enhancing and/or reducing transduction by one or more of the compositions comprising a targeting moiety effective to target a CNS cell of the present invention. In some embodiments, the CRISPRa, CRISPRi, CRISPRkd/ko system comprises a dCas, such as a dCas9, dCas12, or other inactive Cas which are described in greater detail elsewhere herein. In some embodiments the CRSIPRi system comprises a dCas12 General principles of CRISPRa, CRISPRi, and CRISPRko/kd screens are known in the art. See also e.g., Chong et al., Trends Cell Biol. 2020 August; 30(8):619-627; Ramkumar et al., Blood Adv. 2020 Jul. 14; 4(13):2899-2911; Semesta et al., PLoS Genet. 2020 Oct. 14; 16(10); Kampamann et al., ACS Chem Biol. 2018 Feb. 16; 13(2):406-416; Sanson et al., Nat Commun. 2018 Dec. 21; 9(1):5416; Gilbert et al., Cell. 2014 Oct. 23; 159(3):647-61; Tian et al., Neuron. 2019 Oct. 23; 104(2):239-255.e12; Tian et al., Nat Neurosci. 2021 July; 24(7):1020-1034; Kampmann et al., Nat Rev Neurol. 2020 September; 16(9):465-480; Schuster et al., Trends Biotechnol. 2019 January; 37(1):38-55; Dominguez et al., Nat Rev Mol Cell Biol. 2016 January; 17(1):5-15; Dudek et al., Mol Ther. 2020 Feb. 5; 28(2):367-381; Chow and Chen. Trends Cancer. 2018 May; 4(5):349-358, Hanna and Doench. Nat Biotechnol. 2020 July; 38(7):813-823, Qi et al., Cell. 152(5):1173-1183 (2013); the teachings of which can be adapted for use with the present invention.

In some embodiments, the method includes contacting one or more cells with one or more small molecules, such as a small molecule or chemical library in which the small molecules contained in the library have known effects on particular cell surface molecules and/or receptors, optionally those known to be involved with viral, and more particularly AAV, transduction, and transducing composition a targeting moiety effective to target a CNS cell of the present invention and detecting, quantifying, or otherwise measuring transduction efficiency of the composition a targeting moiety effective to target a CNS cell of the present invention to determine or otherwise identify cell surface molecules and/or receptors and/or the like involved with or that mediates transduction of the compositions comprising a targeting moiety effective to target a CNS cell of the present invention and/or are capable of enhancing and/or reducing transduction by one or more of the compositions comprising a targeting moiety effective to target a CNS cell of the present invention.

The screening can be carried out using any suitable low or high throughput approaches, examples of which are provided elsewhere herein and are generally known in the art. In some embodiments, the screening can be done in vitro or ex vivo using cells, cell populations, organoids, tissue explants, and/or the like. In some embodiments, the screening can be done in vivo, such as via animal models, including, but not limited to mouse and non-human primates.

In some embodiments, the compositions comprising a targeting moiety effective to target a CNS cell of the present invention contain a cargo molecule that is a reporter molecule to facilitate transduction detection, quantification and measurement. Exemplary reporter cargo molecules are described in greater detail elsewhere herein.

In some embodiments, the method further includes directed evolution of viral, such as AAV, capsids based on genes, pathways, programs, cell-surface receptors and/or the like identified in a screen previously described so as to further evolve n-mer motifs to enhance transduction efficacy of the CNS targeting moieties.

The invention is further described in the following examples, which do not limit the scope of the invention described in the claims. Further embodiments are illustrated in the following Examples which are given for illustrative purposes only and are not intended to limit the scope of the invention.

FIG. 1 demonstrates the adeno-associated virus (AAV) transduction mechanism, which results in production of mRNA. As is demonstrated in FIG. 1, functional transduction of a cell by an AAV particle can result in the production of an mRNA strand. Non-functional transduction would not produce such a product despite the viral genome being detectable using a DNA-based assay. Thus, mRNA-based detection assays to detect transduction by e.g., an AAV can be more stringent and provide feedback as to the functionality of a virus particle that is able to functionally transduce a cell. FIG. 2 shows a graph that can demonstrate that mRNA-based selection of AAV variants can be more stringent than DNA-based selection. The virus library was expressed under the control of a CMV promoter.

FIGS. 3A-3B show graphs that can demonstrate a correlation between the virus library and vector genome DNA (FIG. 3A) and mRNA (FIG. 3B) in the liver. FIGS. 4A-4F show graphs that can demonstrate capsid variants expressed at the mRNA level identified in different tissues.

FIGS. 5A-5C show graphs that can demonstrate capsid mRNA expression in different tissues under the control of cell-type specific promoters (as noted on x-axis). CMV was included as an exemplary constitutive promoter. CK8 is a muscle-specific promoter. MHCK7 is a muscle-specific promoter. hSyn is a neuron specific promoter.

Generally, an AAV capsid library can be generated by expressing engineered capsid vectors each containing an engineered AAV capsid polynucleotide previously described in an appropriate AAV producer cell line. See e.g., FIG. 8. This can generate an AAV capsid library that can contain one more desired cell specific engineered AAV capsid variant. FIG. 7 shows a schematic demonstrating embodiments of generating an AAV capsid variant library, particularly insertion of a random n-mer (n=e.g., 3-25 or 3-15 amino acids) into a wild-type AAV, e.g., AAV9. In this example, random 7-mers were inserted between aa588-589 of variable region VIII of AAV9 viral protein and used to form the viral genome containing vectors with one variant per vector. As shown in FIG. 8, the capsid variant vector library was used to generate AAV particles where each capsid variant encapsulated its coding sequence as the vector genome. FIG. 9 shows vector maps of representative AAV capsid plasmid library vectors (see e.g., FIG. 8) that can be used in an AAV vector system to generate an AAV capsid variant library. The library can be generated with the capsid variant polynucleotide under the control of a tissue specific promoter or constitutive promoter. The library was also made with capsid variant polynucleotide that included a polyadenylation signal.

As shown in FIG. 6A the AAV capsid library can be administered to various non-human animals for a first round of mRNA-based selection. As shown in FIG. 1, the transduction process by AAVs and related vectors can result in the production of an mRNA molecule that is reflective of the genome of the virus that transduced the cell. As is at least demonstrated in the Examples herein, mRNA based selection can be more specific and effective to determine a virus particle capable of functionally transducing a cell because it is based on the functional product produced as opposed to just detecting the presence of a virus particle in the cell by measuring the presence of viral DNA.

As is further shown in FIG. 6A, after first-round administration, one or more engineered AAV virus particles having a desired capsid variant can then be used to form a filtered AAV capsid library. Desirable AAV virus particles can be identified by measuring the mRNA expression of the capsid variants and determining which variants are highly expressed in the desired cell type(s) as compared to non-desired cells type(s). Those that are highly expressed in the desired cell, tissue, and/or organ type are the desired AAV capsid variant particles. In some embodiments, the AAV capsid variant encoding polynucleotide is under control of a tissue-specific promoter that has selective activity in the desired cell, tissue, or organ.

The engineered AAV capsid variant particles identified from the first round can then be administered to various non-human animals. In some embodiments, the animals used in the second round of selection and identification are not the same as those animals used for first round selection and identification. Similar to round 1, after administration the top expressing variants in the desired cell, tissue, and/or organ type(s) can be identified by measuring viral mRNA expression in the cells. The top variants identified after round two can then be optionally barcoded and optionally pooled. In some embodiments, top variants from the second round can then be administered to a non-human primate to identify the top cell-specific variant(s), particularly if the end use for the top variant is in humans. Administration at each round can be systemic. As further shown in FIG. 6B after the second round of selection, a third round of selection, which can optionally include benchmarking against known, control, and/or standard (e.g., benchmark) variants can be performed.

FIG. 10 shows a graph that can demonstrate the viral titer (calculated as AAV9 vector genome/15 cm dish) produced by libraries generated using different promoters. As demonstrated in FIG. 10, virus titer was not affected significantly be the use of different promoters.

CNS n-mer inserts were generated as described elsewhere herein and then screened for transduction efficiency in various strains of mice (C57BL/6J and BALB/cJ). Table 1 shows the top motifs based on CNS transduction. As previously discussed, each n-mer insert's transduction efficacy in CNS cells was tested with both AQ and DG as the aa587 and aa588 (the two amino acids in the AAV immediately preceding the n-mer insert. Some exemplary n-mer inserts that stood out when preceded by AQ are KTVGTVY (SEQ ID NO: 3), RSVGSVY (SEQ ID NO: 4), RYLGDAS (SEQ ID NO: 5), WVLPSGG (SEQ ID NO: 6), VTVGSIY (SEQ ID NO: 7), VRGSSIL (SEQ ID NO: 8), RHHGDAA (SEQ ID NO: 9), VIQAMKL (SEQ ID NO: 10), LTYGMAQ (SEQ ID NO: 11), LRIGLSQ (SEQ ID NO: 12), GDYSMIV (SEQ ID NO: 13), VNYSVAL (SEQ ID NO: 14), RHIADAS (SEQ ID NO: 15), RYLGDAT (SEQ ID NO: 16), QRVGFAQ (SEQ ID NO; 17), QIAHGYST (SEQ ID NO: 18), WTLESGH (SEQ ID NO: 19), and GENSARW (SEQ ID NO: 20).

Some exemplary n-mer inserts that stood out when preceded by DG are ASNPGRW (SEQ ID NO: 22), WTLESGH (SEQ ID NO: 23), REQKKLW (SEQ ID NO: 24), ERLLVQL (SEQ ID NO: 25), RMQRTLY (SEQ ID NO: 26), and REQQKLW (SEQ ID NO: 21). Engineered AAVs including a CNS n-mer of Table 1 demonstrated the ability to specifically transduce CNS cells in both strains of mice, which is in contrast to the commonly used in the art CNS AAV. Without being bound by theory, this observation can demonstrate that the engineered AAVs containing an CNS-specific n-mer insert described herein can operate through a different receptor on the surface of CNS cells than the conventional AAV used in the art to achieve CNS specificity. Given that n-mer inserts preceded by AQ with top scores did not necessarily perform the same when preceded by DG can suggest that the 3D structure of the capsid conferred by the n-mer and its interaction with endogenous AAV amino acids can influence the ability of the engineered AAV capsid to transduce a cell and thus, without being bound by theory, can play a role in contributing to the cell-type specificity of the engineered capsids.

CNS n-mer inserts were generated as described elsewhere herein and then screened for transduction efficiency in non-human primates. Tables 2-3 show the top n-mer inserts. A general motif was observed across the very top hits (Table 3). The motif observed was P-motif having the formula amino acid sequence PX1QGTX2R, (SEQ ID NO: 317) wherein X1 and X2 are each selected from any amino acid. Exemplary n-mer insert variants containing a P-motif are shown in Table 3.

As shown in FIGS. 6A-6B shows a general schematic for selecting CNS specific capsid, which includes a benchmarking round which evaluates the performance of selected capsids against currently used capsids for, e.g., delivery to the CNS. Table 67 shows the selected capsids used in the benchmarking round of selection. Four variants developed using selection in mice and 8 using selection in NHPs were used for benchmarking. For benchmarking here, capsid variant specific barcodes were included with each variant. Viral particles for each capsid variant were produced individually and viral particles were then pooled. Such barcoding and pooling methodology is described in greater detail elsewhere herein and applied in this context. Pooled viral particles were then injected systemically (via I.V. administration) to the periphery of different mouse strains (C57BL/6J (“(C57”) and BALB/c (“BALBc”)) and non-human primates (Macaques) so that the ability for the capsid variants to cross the blood brain barrier in different species could be evaluated. Included in the benchmarking were both engineered capsid variants from mouse and non-human primate selection (rounds 1 and/or 2) and currently used capsid variants (AAV-CAP-B10, AAV-CAP-B22, and AAV-PhP.22). mRNA and DNA corresponding to the capsid variants in various tissues were then examined to determine the CNS, strain, and species specificity of the capsid variants.

TABLE 6
Capsid variantInsert sequenceSEQ ID NO:
Mouse variant 1RSVGSVY318
Mouse variant 2KTVGTVY319
Mouse variant 3WVLPSGG320
Mouse variant 4(DG)REQQKLW321
NHP variant 1PTQGTVR322
NHP variant 2PSQGTLR323
NHP variant 3PTQGTLR324
NHP variant 4RVDPSGL325
NHP variant 5VVSDYTV326
NHP variant 6TDALTTK327
NHP variant 7STIPTMK328
NHP variant 8PTQGTFR329

FIGS. 11A-11P show results from benchmarking the top selected capsids out of the second round of selection. In agreement with the literature, the AAV-CAP-B10, AAV-CAP22, and AAV-PhP.22 capsids demonstrated a species and strain preference, and importantly did not appear to perform well in non-human primates. Indeed, the NHP capsid variants developed using the methods described and benchmarked herein were successfully delivered to and expressed in one or more CNS tissues. Further, several NHP capsid variants tested here showed increased delivery to the CNS as compared to the capsid variants currently known and alleged to target the CNS and cross the blood brain barrier (AAV-CAP-B10, AAV-CAP22, and AAV-PhP.22). Further, most of the NHP variants were not observed to have strong liver delivery or expression (see e.g., FIGS. 11O and 11P). Expression in the dorsal root ganglion can lead to significant toxicity. Several NHP variants showed reduced or negligible delivery and/or expression in the dorsal root ganglion (DRG) (see e.g., FIG. 11N).

Directed capsid evolution and benchmarking is previously described in e.g., Examples 1-6. This Example demonstrates optimized capsid inserts specific for CNS in NHPs. Briefly, for these selections a library was screened with a fixed RGD motif (XXXRGDXXXX, where X is any amino acid), as well as a library containing a fixed P-family motif (XXXPXQGTXR (SEQ ID No: 1), where X is any amino acid) in non-human primates and identified the variants that were specific for only the CNS in NHPs. Table 7 provides the resulting top n-mer inserts and/or P motifs specific for CNS.

TABLE 7
Optimized CNS Capsid n-mer inserts and/or
P-motifs
Capsid Insert VariantSEQ ID NO:
EVGPTQGTVR332
DYEPSQGTMR333
SVSPGQGTYR334
AVTPIQGTIR335
ENVPMQGTVR336
AASPPQGTMR337
DQRPGQGTIR338
NVSPQQGTMR339
IPIPNQGTIR340
STVPAQGTMR341
STVPSQGTVR342
SPIPSQGTLR343
TTMPSQGTIR344
SVMPAQGTLR345
SIVPVQGTVR346
SVTPSQGTLR347
AVGPSQGTIR348
SINPSQGTIR349
AINPTQGTLR350
ALLPNQGTVR351
ASMPQQGTIR352
SNAPAQGTMR353
LNVPVQGTVR354
QVTPTQGTVR355
LVSPAQGTMR356
AVTPSQGTIR357
VAGPSQGTLR358
EKLPSQGTLR359
SISPLQGTVR360
ESRPLQGTYR361
NANPGQGTVR362
IPLPSQGTVR363
MPMPNQGTVR364
ETRPDQGTVR365
TEKPMQGTER366
GDDPLQGTSR367
SISPGQGTLR368
EMNPLQGTVR369
SAEPGQGTTR370
MNVPSQGTDR371
ITSPTQGTNR372
MLEPTQGTPR373
LEPPTQGTGR374
QNEPRQGTDR375
SMTPVQGTVR376
SRAPDQGTIR377
NTQPIQGTTR378
LSVPLQGTIR379
SEAPGQGTVR380
GREPGQGTYR381
IASPVQGTPR382
SAIPPQGTSR383
GMLPEQGTPR384
WDDPHQGTMR385
AIGPGQGTMR386
ASVPQQGTVR387
SVQPGQGTYR388
NAGPSQGTLR389
MKVPEQGTMR390
TTIPEQGTYR391
SAIPGQGTTR392
DNGPRQGTLR393
TLPPVQGTMR394
NTSPMQGTQR395
ETSPSQGTYR396
SATPAQGTVR397
MATPMQGTFR398
MNVPTQGTVR399
SVLPEQGTMR400
STTPIQGTMR401
NSDPQQGTVR402
SNTPLQGTTR403
SDAPQQGTLR404
SNAPIQGTMR405
NANPGQGTMR406
ESMPVQGTHR407
VERPLQGTMR408
SVSPTQGTMR409
VVAPLQGTDR410
SVTPLQGTIR411
VPNPVQGTPR412
GIWPGQGTGR413
LPTPIQGTLR414
ANEPRQGTVR415
TAFPTQGTMR416
SSAPNQGTMR417
MESPVQGTTR418
CTAPGQGTDR419
VTNPTQGTYR420
SNAPIQGTFR421
QSTPGQGTLR422
LVKPPQGTDR423
AAGPMQGTNR424
SSSPNQGTFR425
QESPLQGTVR426
AASPTQGTLR427
FTAPDQGTGR428
DNVPNQGTIR429
YSMPTQGTVR430
SSIPGQGTAR431
VTIPAQGTIR432
AHMPSQGTDR433
YVTPPQGTLR434
DGNPAQGTGR435
LQNPSQGTSR436
LQGPVQGTLR437
STNPAQGTLR438
LPTPIQGTMR439
SVAPTQGTVR440
QPSPMQGTVR441
MTQPSQGTIR442
SAEPNQGTTR443
TDTPSQGTVR444
LQQPLQGTTR445
NTHPAQGTVR446
VSAPMQGTMR447
SEKPAQGTYR448
TSLPTQGTLR449
SERPVQGTFR450
VLEPSQGTSR451
ANAPIQGTIR452
NVSPIQGTMR453
SVLPEQGTMR454
NDRPLQGTMR455
VVPPGQGTLR456
EPSPNQGTSR457
VLLPSQGTVR458
GSFPQQGTLR459
NTIPVQGTQR460
AASPPQGTLR461
TAVPSQGTHR462
YESPVQGTVR463
AALPSQGTLR464
SRIPDQGTIR465
MPRPDQGTMR466
ISTPTQGTLR467
VDIPMQGTLR468
NMLPTQGTIR469
TVLPGQGTIR470
TTDPVQGTVR471
SVTPVQGTSR472
FPLPSQGTVR473
EMAPNQGTSR474
VDRPSQGTMR475
NTEPPQGTDR476
MAMPPQGTLR477
ATLPSQGTLR478
LAIPPQGTSR479
TSGPVQGTFR480
SSGPGQGTDR481
MVTPGQGTMR482
EGTPVQGTTR483
MPIPSQGTPR484
NPLPTQGTSR485
SHFPPQGTNR486
AVTPTQGTIR487
ESGPSQGTSR488
MTVPSQGTFR489
EAYPTQGTIR490
SSTPAQGTFR491
DSRPLQGTIR492
DNAPLQGTNR493
QPIPPQGTMR494
ASSPTQGTER495
NVSPSQGTVR496
IHLPAQGTVR497
SSVPAQGTQR498
TTGPNQGTLR499
ATGPTQGTLR500
AATPGQGTYR501
AAVPTQGTVR502
EGKPEQGTTR503
SIAPTQGTIR504
DVRPSQGTIR505
MLRPEQGTDR506
TVSPTQGTTR507
KERPEQGTMR508
DSSPNQGTYR509
SLAPMQGTTR510
ELHPTQGTSR511
ETGPMQGTVR512
VLAPVQGTQR513
KEAPDQGTGR514
ASEPSQGTQR515
IYGPNQGTLR516
TERPVQGTFR517
GATPLQGTLR518
LAGPMQGTIR519
EVRPIQGTVR520
MANPIQGTVR521
TREPQQGTFR522
MKDPIQGTYR523
LSEPPQGTLR524
LMEPRQGTVR525
VASPMQGTSR526
QTFPNQGTMR527
MNRPTQGTER528
EVPPSQGTLR529
VTGPPQGTYR530
SHVPAQGTMR531
MVMPVQGTVR532
SDKPVQGTMR533
SVAPTQGTIR534
GTTPDQGTMR535
GSEPNQGTYR536
GPMPIQGTLR537
NQMPMQGTAR538
GNTPVQGTVR539
SANPLQGTIR540
MSFPSQGTHR541
NPDPIQGTIR542
EMNPVQGTNR543
TVLPNQGTVR544
VIQPVQGTVR545
QTFPEQGTMR546
VLTPSQGTTR547
NNGPMQGTVR548
VETPNQGTHR549
SLVPNQGTVR550
DSAPHQGTYR551
DHGPSQGTSR552
AAMPGQGTVR553
ISSPGQGTDR554
NVSPSQGTLR555
SSIPIQGTSR556
GSVPGQGTTR557
MREPSQGTSR558
TDQPSQGTVR559
MMQPVQGTSR560
SFQPGQGTLR561
MNAPSQGTTR562
NEVPTQGTAR563
VSGPEQGTSR564
GYEPAQGTMR565
SLIPDQGTIR566
DYGPSQGTVR567
TELPMQGTVR568
SYMPLQGTVR569
DYKPNQGTVR570
DTKPNQGTVR571
MNTPAQGTLR572
VMNPEQGTAR573
EILPGQGTLR574
MPSPAQGTIR575
NVIPEQGTNR576
QMEPHQGTTR577
FVVPDQGTNR578
ENNPGQGTTR579
NWKPEQGTDR580
SVSPNQGTIR581
DPSPLQGTDR582

This Example compares the transduction and vector genome distribution of the top hit (EVGPTQGTVR (SEQ ID NO: 332, Table 7) from the screen discussed in Example 8 and AAV9.

FIGS. 12A-12C show a comparison of transduction between the EVGPTQGTVR (SEQ ID NO: 332, Table 7) capsid insert variant with AAV9 in NHP tissues. FIG. 12A shows transgene expression from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in the primate cerebrum. FIG. 12B shows transgene expression from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in the primate nervous system. FIG. 12C shows transgene expression from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in various primate muscles and organs.

FIGS. 13A-13C show a comparison of the vector genome biodistribution between the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant with AAV9 in NHP tissues. FIG. 13A shows the vector genome biodistribution from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in the primate cerebrum. FIG. 13B shows the vector genome biodistribution from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in the primate nervous system. FIG. 13C shows the vector genome biodistribution from the engineered EVG capsid containing the EVGPTQGTVR (SEQ ID NO: 332) capsid insert variant compared to AAV9 in various primate muscles and organs.

Recombinant adeno-associated virus (rAAV) vectors are the vehicle of choice for gene therapy applications in the central nervous system (CNS) due to their low immunogenicity and ability to facilitate long-term gene expression in both dividing and non-dividing cells.1-6 Clinical and preclinical studies of rAAV-based therapies with naturally occurring AAV serotypes have shown promise in the treatment of a variety of CNS disorders.1,5,7-11 However, the efficacy of rAAVs in transducing the CNS has been limited by the protective effect of the blood-brain barrier (BBB) and the broad tissue tropism of naturally occurring AAV serotypes, which together result in inefficient transduction of target cell populations in the CNS.1,2,12 Direct administration of rAAVs into the CNS, such as via intrathecal, intracisternal, or intraparenchymal injection, is a commonly employed strategy to bypass the BBB.1,2,5,13 However, these delivery routes generally do not result in widespread and uniform transduction of the CNS and can be associated with considerable surgical risk.2,13

The discovery that the AAV9 serotype can cross the BBB has introduced the possibility of utilizing noninvasive systemic administration of rAAVs via the vascular system to facilitate widespread transduction across the CNS.1,2,13,14 Intravenous (IV) infusion has been employed in a number of clinical trials of CNS-targeted rAAV therapies1,11 and is the administration route of choice for an FDA-approved treatment for spinal muscular atrophy.7 However, systemic administration of naturally occurring AAV serotypes is complicated by sequestration of viral particles in the liver and the protective effect of the BBB, both of which limit rAAV bioavailability in the CNS.1,2,12,15,16 Achieving therapeutic efficacy in the CNS with systemic administration of rAAVs therefore requires large doses, sometimes exceeding 1E+14 vector genomes per kilogram body mass (vg/kg).1,2,5,13 In addition to posing significant manufacturing challenges, high dose rAAV therapy compounds the safety risk associated with an immune response in the liver, a phenomenon that has been observed in both clinical and preclinical studies.1,2,15,17-20

Engineering AAV capsids that display both enhanced transduction of the CNS and reduced transduction in peripheral organs following systemic administration will facilitate the development of CNS-targeted therapies with improved safety and efficacy at a reduced dose. Previous studies have successfully applied directed evolution techniques to generate novel AAV capsids with CNS-tropic properties in vivo,21-25 though translating these findings from mouse models to nonhuman primates (NHPs) has proved challenging and has complicated efforts to develop capsids with therapeutic potential in humans. A directed evolution strategy performed in Cre-transgenic C57BL/6J mice using the CREATE (Cre recombination-based AAV targeted evolution) method yielded potent CNS-tropic variants such as PHP.B and PHP.eB.21,22 However, the CNS-tropic properties of these variants translate poorly even to other mouse strains, and studies assessing intravenous administration of PHP.B in marmosets found that it failed to outperform AAV9 in CNS transduction.26-29 These findings cast doubt on the applicability of such vectors to human gene therapy and highlight the need to evaluate novel capsids in NHPs.

Recent studies have attempted to find less strain-specific CNS-tropic capsids using a multiplexed CREATE strategy in which directed evolution is performed across multiple mouse strains.23 Two PHP.eB-related variants identified in these efforts, AAV.CAP-B10 and AAV.CAP-B22, were later found to have improved CNS transduction in the marmoset brain compared to AAV9.25 Though variants demonstrating efficacy in marmosets likely hold greater therapeutic potential than those only capable of transducing the mouse brain, marmosets are much smaller and more evolutionarily distant from humans than are other common NHP models such as macaques. Given that positive results for certain engineered rAAVs in mice do not necessarily translate to NHPs25,26,29 and the extensibility of transduction data in marmosets to other NHPs is unknown, it is of utmost importance to assess the performance of novel rAAVs in appropriate animal models in order to identify candidate vectors for human gene therapy applications.

The unrealized potential of systemically administered rAAVs with CNS-tropic engineered capsids combined with the challenges in translating these capsids to NHPs serve as motivation for this work. In contrast to previous attempts to identify engineered capsids with therapeutic potential in the CNS, which typically involve selecting CNS-tropic capsids in mice, in this Example Applicant used an mRNA-based directed evolution strategy in both mice and cynomolgus macaques. This Example identifies capsids that (i) retain CNS-tropic behavior across multiple animal models such that their properties may be conserved throughout the lineage; and (ii) have CNS-tropic behavior in NHP models that are the closest practical evolutionary neighbors to humans. In both cases, Applicant seeks to identify capsids with the highest degree of translational and therapeutic potential in humans.

In Vivo mRNA-Based Selection for CNS-Tropic AAVs.

Applicant developed AAV vectors with CNS-tropic properties in mice using the previously described in vivo directed evolution strategy DELIVER (directed evolution of AAV capsids leveraging in vivo expression of transgene RNA).30 As the success of capsid variants in DELIVER is based on transgene mRNA expression, it preferentially selects for variants that are able to transcribe in addition to deliver genetic cargo. Applicant first generated AAV9-based capsid libraries with a random 7-mer peptide inserted in the VR-VIII hypervariable region between residues Q588 and A589, a location known to permit exposure of the peptide on the capsid surface.31,32 The capsid library construct was flanked by inverted terminal repeats (ITRs), thereby eliciting self-packaging of the cap gene; that is, each capsid variant encodes its own coding sequence as a transgene. To introduce selective pressure favoring capsid variants that preferentially transduce neurons, we placed the transgene under the control of the neuron-specific human synapsin 1 promoter (hSyn) (FIG. 14A).

Applicant performed two rounds of in vivo selection in parallel in C57BL6J and BALB/cJ mice and cynomolgus macaques using expression of transgene mRNA as the selection criteria. The first round of selection included a starting library of capsids with random 7-mer inserts. To create a library for our second round of selection, Applicant identified the top 30,000 most enriched capsid variants in the brain, drawing 10,000 high-scoring variants from mice and 20,000 from macaques. Applicant next introduced a synonymous codon control where each of the 30,000 top peptides were encoded both by their experimentally recovered DNA sequence and by a synonymous DNA codon sequence (FIG. 14B). Applicant also generated a complementary library where the two residues upstream of the 7-mer peptide insert were changed from AQ to DG, given that this is a modification thought to be responsible for the enhanced CNS-tropic properties of the engineered variant PHP.eB in mice.22

For the second round of selection in mice, we injected both the AQ and DG second-round libraries into separate sets of C57BL/6J and BALB/cJ mice. The identities of the most successful variants in mice differed depending on the prefix to the 7-mer insert (FIG. 15A-15B and Table 8). Of the variants with the wild-type AQ prefix, the four most enriched DNA sequences averaged across both mouse strains—corresponding to two pairs of synonymous peptide sequences—encoded two highly similar variants. These two variants; AQRSVGSVY (SEQ ID NO: 8587) and AQKTVGTVY (SEQ ID NO: 8588); are henceforth referred to as MDV1A and MDV1B (Mouse Double Valine), respectively (FIG. 15A), and are predicted to have similar but distinct secondary structure in the VR-VIII loop region (FIG. 15C). Of the variants with the modified PHP.eB-like DG prefix, the two most enriched DNA sequences are synonymous and encode the same peptide, DGREQQKLW (SEQ ID NO: 8589) (FIG. 17B). Applicant also recovered the sequences encoding PHP.B and PHP.eB from C57BL/6J but not BALB/cJ mice (FIG. 15B and Table 9), confirming previous findings that the CNS-tropic properties of PHP.B and PHP.eB are limited to C57BL/6J mice26-29 and further validating the DELIVER selection strategy.

Applicant chose MDV1A for further characterization in mice based on its superior performance in both the C57BL/6J and BALB/cJ strains. Applicant injected adult C57BL/6J and BALB/cJ mice of both sexes with 1E+12 vg of AAV9- or MDV1A-CMV-EGFP. Two weeks after administration of the vector, Applicant assessed vector genome delivery and transgene expression in the brain and spinal cord. MDV1A significantly outperformed AAV9 in both transgene delivery and expression in the brain of all groups of mice, demonstrating between a 25-fold and 160-fold improvement in transgene expression in the brain of male BALB/cJ and female C57BL/6J mice, respectively (FIG. 15D-15E). In the spinal cord, MDV1A significantly outperformed AAV9 in transgene delivery and expression in three of the four groups of mice, with between a 43-fold and 99-fold improvement in transgene expression in male BALB/cJ and female BALB/cJ mice, respectively (FIG. 15D-15E). In the spinal cord of female C57BL6J mice, there was a 24-fold performance difference between the two vectors that did not reach the threshold of statistical significance due to high variability in the data (FIG. 15D-15E). Immunostaining of sagittal mouse brain sections revealed greater EGFP expression from MDV1A than AAV9, with relatively uniform distribution of EGFP throughout the brain (FIG. 15F).

In order to select for variants with CNS-tropic activity in primates, Applicant also performed a second round of selection in three cynomolgus macaques. Applicant used the same AQ library as in the second round of selection in mice, which included variants identified in the first round in both mice and macaques. Applicant found that the variants most enriched in the macaque brain differed greatly from those identified in mice (FIG. 18A-18B). Both with and without correcting for the synonymous DNA codons, the ten most enriched variants across the entire macaque CNS were dominated by a motif typified by a proline in position 1, the string QGT in positions 3-5, and an arginine in position 7. Applicant also identified variants enriched in specific regions such as the cerebellum and spinal cord, though unlike in the CNS-wide results, these tissue-specific analyses did not converge on a single dominant motif (FIG. 19A-19B).

Applicant sought to more systematically identify sets of common motifs by performing k-medoids clustering on the top 1000 macaque variants using a dissimilarity metric based on pairwise substitution scores between 7-mer peptides. The cluster represented by the medoid sequence PTQGTLR (SEQ ID NO: 206) contained 19 variants, including 9 ranked in the top 100 sequences and 6 ranked in the top 10 (FIG. 16C). Many variants in this cluster—including most of the highest-performing variants—are broadly described by the motif PX1QGTX2R (SEQ ID NO: 317), where X1 is a polar uncharged residue and X2 is a nonpolar residue. Applicant defines the canonical Proline Arginine Loop (PAL) family of variants based on this motif, though more divergent PAL-like variants within the same cluster may share structural and functional properties with the canonical PAL variants. Computational modeling of the VR-VIII loop with the 7-mer insert predicted that canonical PAL variants share a nearly identical backbone conformation. However, even single-residue deviations from this core motif, such as the introduction of a proline at the third position in the sixth-ranked sequence PTPGTLR (SEQ ID NO: 4593), may considerably alter the backbone conformation (FIG. 16D). k-medoids identified a number of additional clusters containing high-performing variants with conserved structural properties (FIG. 19C), but many variants were sorted into singleton clusters or small clusters with only two or three variants (Table 9).

TABLE 8
Mouse AQ
SEQSEQ
IDIDCombinedBALB/cJC57BL/6J
RankPeptideNO:Encoding SequenceNO:scorescorescore
1RSVGSVY583CGGAGTGTTGGGAGTGTGTAT584460002400022000
2KTVGTVY585AAGACTGTGGGTACTGTTTAT586459802400021980
3KTVGTVY587AAAACCGTCGGCACAGTGTAC588451812400021181
4RSVGSVY589CGATCCGTCGGAAGCGTTTAC590449942200022994
5RYLGDAS591CGTTACTTAGGAGACGCCTCT592405922224518347
6WVLPSGG593TGGGTGCTACCATCTGGCGGC594375972009417504
7WVLPSGG595TGGGTTCTGCCTAGTGGTGGG596341401843315707
8VTVGSIY597GTAACAGTGGGCAGCATCTAC598329681981813150
9VRGSSIL599GTGCGTGGGTCGTCGATTCTT600323301945812872
10RYLGDAS601CGGTATTTGGGGGATGCTTCG602323291992112408
11VIQAMKL603GTGATTCAGGCTATGAAGTTG604321271783714290
12LTYGMAQ605CTGACTTATGGTATGGCTCAG606319561315218805
13LRIGLSQ607CTTCGGATTGGGCTGTCGCAG608317101328718423
14RYSGDAS609CGGTACTCAGGAGACGCTTCT61031198240007198
15RHHGDAA611CGGCATCATGGTGATGCGGCG612304061650113905
16VNYSVAL613GTGAACTACAGTGTCGCTCTA614299691940810561
17RHIADAS615CGTCATATTGCTGATGCTAGT616295541931010244
18RYLGDAT617CGGTATTTGGGGGATGCTACG618295271676112767
19QRVGFAQ619CAACGAGTCGGGTTCGCACAA62029454920320251
20RYSGDSV621AGGTACTCAGGCGACTCAGTC622289601767511286
21RHIADAS623AGACACATAGCGGACGCGTCG624285951526613329
22IAHGYST625ATTGCTCATGGGTATTCGACT626282161636411851
23VNYSVAL627GTTAATTATTCGGTGGCGCTT62827811195868225
24WTLESGH629TGGACCTTAGAAAGCGGGCAC630274711005517416
25RYSGDSV631CGTTATTCGGGGGATTCGGTT632274101491612493
26WTLESGH633TGGACTCTGGAGTCTGGTCAT634271941118916004
27VTVGSIY635GTTACTGTTGGGTCTATTTAT636270691697110098
28RYLGDAT637CGATACCTAGGTGACGCAACC638264941531211182
29RYSGDAS639AGGTATTCGGGTGATGCGAGT64025873203515522
30GDYSMIV641GGAGACTACTCTATGATAGTC642248471147513372
31GENSARW643GGGGAAAACTCTGCCAGATGG644244471336911078
32LAVGQKW645TTGGCGGTGGGGCAGAAGTGG64624445154049040
33SLDKPFK647AGTTTGGATAAGCCTTTTAAG64824000024000
34TLAVPFK649ACTTTGGCGGTGCCTTTTAAG65024000024000
35SLDKPFK651AGCTTAGACAAACCATTCAAA65224000024000
36RHHGDAA653AGACACCACGGGGACGCCGCA654239981380810190
37VKLGYSQ655GTGAAGCTTGGGTATTCGCAG656239121191411998
38GDYSMIV657GGGGATTATTCGATGATTGTG658236211246511156
39VRGSSIL659GTAAGAGGTTCCAGCATCCTA66023580162947286
40EAGSARW661GAAGCAGGTTCCGCTCGATGG66223477825715220
41EAGSARW663GAGGCGGGGAGTGCGCGGTGG664226921013312559
42QRVGFAQ665CAGAGGGTGGGTTTTGCGCAG66622222800314219
43WAISDGY667TGGGCAATCTCTGACGGCTAC66821970762514345
44LTYGMAQ669CTTACGTACGGGATGGCACAA67021785710314682
45RGPGLSQ671CGAGGCCCAGGCCTTAGCCAA67221738148816857
46SVSKPFL673AGTGTGAGTAAGCCTTTTTTG67421352021352
47LRIGLSQ675CTACGCATAGGCCTAAGCCAA67619761502314738
48RGPGLSQ677CGGGGTCCTGGGCTGTCTCAG67819667123077360
49IAHGYST679ATCGCCCACGGATACAGCACA68019631130476584
50RYVGESS681AGGTATGTGGGGGAGTCTTCG68219630891710713
51VIQAMKL683GTTATCCAAGCGATGAAACTA68419442120147428
52GPTMLFK685GGCCCAACAATGTTATTCAAA68619036019036
53GENSARW687GGTGAGAATAGTGCTCGGTGG68816736624410493
54LAVGQKW689CTCGCTGTCGGACAAAAATGG6901588693406546
55FTLTTPK691TTTACGTTGACGACGCCTAAG6921560725415352
56EDLLRLR693GAGGATCTTTTGCGTCTTAGG6941492094365484
57LNYSVSL695CTGAACTACAGTGTATCCCTA69614608109333675
58WAISDGY697TGGGCGATTAGTGATGGGTAT6981434243869955
59GPTMLFK699GGTCCGACGATGTTGTTTAAG70014140014140
60RYVGESS701AGATACGTAGGTGAAAGTTCT7021318575915594
61PIIEHAV703CCGATTATTGAGCATGCGGTG70412837239410443
62SLSTPFR705TCTCTTTCTACGCCTTTTCGT70612297012297
63VKLGYSQ707GTTAAATTGGGTTACTCCCAA7081117060815089
64LNYSVSL709TTGAATTATTCTGTGAGTTTG7101113272433889
65LGTYELD711CTTGGGACTTATGAGCTTGAT71211116011116
66PIIEHAV713CCCATAATAGAACACGCAGTA7141110020009100
67RYISDSA715CGATACATAAGTGACTCCGCT7161078820848703
68WSTSSGF717TGGAGTACATCATCGGGATTC718106146289985
69WSLGSGH719TGGTCACTAGGAAGTGGTCAC7201049812359263
70WSQSSGY721TGGAGTCAGTCTAGTGGTTAT722102587289529
71EDLLRLR723GAAGACTTGCTGAGACTGCGA724975464263328
72TEKLPFR725ACTGAGAAGCTGCCTTTTCGG726950109501
73IMLGYST727ATTATGTTGGGGTATTCGACT72894046858718
74SLSTPFR729AGCCTCAGCACCCCCTTCCGC730922209222
75ASNPGRW731GCGAGTAACCCTGGAAGGTGG732917324356738
76WSLGSGH733TGGTCGTTGGGTTCTGGGCAT734876108761
77NLIKPFL735AACCTTATAAAACCGTTCCTC736870708707
78HVENWHI737CATGTGGAGAATTGGCATATT73886806348046
79WSQSSGY739TGGTCCCAAAGCTCTGGGTAC74085605907970
80WSTSSGF741TGGTCGACTAGTAGTGGTTTT742826808268
81GKSPGVW743GGCAAATCCCCTGGAGTATGG74476857236448
82TEKLPFR745ACGGAAAAACTTCCGTTCAGG746724007240
83SLVTSST747TCGCTTGTTACTTCTAGTACG748678227824000
84LLYGYSS749CTTCTGTATGGTTATTCGAGT75067793346444
85VAGSSIL751GTTGCGGGTTCGTCGATTCTG752652006520
86GLNERVA753GGTCTGAATGAGCGTGTGGCG754641020004410
87LGTYELD755TTAGGCACATACGAATTGGAC756639406394
88KNRRHSV757AAAAACCGTCGGCACAGTGTA75863126190122
89YTLSQGW759TATACTTTGTCGCAGGGTTGG760631019124398
90SANPVVT761TCTGCGAATCCGGTTGTGACG762593747781159
91VAGSSIL763GTAGCTGGGAGCAGCATCTTG76456098944716
92NLIKPFL765AATTTGATTAAGCCTTTTCTT766559705597
93GGTSSGH767GGGGGTACGAGTAGTGGTCAT768546240921370
94VLESNPR769GTGCTTGAGTCGAATCCGCGG770539926172782
95WADSKDQ771TGGGCTGATAGTAAGGATCAG772537429862388
96VDHGGVV773GTGGATCATGGTGGTGTGGTT774534240001342
97ASTDSKT775GCTAGTACTGATTCTAAGACG776528552850
98KGASVTL777AAGGGGGCTAGTGTTACGCTT778523640001236
99SNVALTG779AGCAACGTTGCACTGACCGGC780523520003235
100VIASNEH781GTGATTGCTTCTAATGAGCAT782510935641545
101MSVGQSW783ATGTCGGTTGGGCAGTCGTGG784509810464052
102LSNGQGP785TTGAGTAATGGTCAGGGTCCT78650714076996
103PVTDSKM787CCAGTAACTGACTCAAAAATG788506816463422
104AADSSGR789GCGGCGGATAGTTCTGGGCGG79049954520475
105ANSHTNS791GCAAACAGTCACACCAACTCT792493535421393
106VGANAVA793GTTGGTGCTAATGCTGTTGCT794487233931479
107HVENWHI795CACGTTGAAAACTGGCACATC796486504864
108KVDQSLA797AAAGTAGACCAATCACTTGCA798479647960
109SGEALRL799TCAGGTGAAGCACTACGGCTA80047889713817
110VPSSTER801GTTCCGAGTTCTACTGAGCGG802478226642118
111VVQVNGR803GTCGTGCAAGTGAACGGACGC80447384415323
112ASNPGRW805GCTTCGAATCCGGGTCGGTGG806472936781051
113HNGQVGV807CACAACGGACAAGTGGGAGTC808470527361968
114SLAITER809AGTTTGGCGATTACTGAGCGG81046644000664
115SSAGTSA811AGTTCGGCGGGTACTTCGGCG812463220002632
116SVDNRDS813AGTGTGGACAACAGAGACAGT814461426142000
117VGQTTTL815GTTGGCCAAACCACAACATTG81646124428184
118GQVQMTS817GGTCAGGTGCAGATGACTTCT81845723903669
119GMHVVQA819GGAATGCACGTCGTCCAAGCA820455404554
120VGVPLGR821GTTGGGGTTCCCCTAGGGAGA822454804548
121KESTLST823AAGGAGTCTACTCTGAGTACG824454115792962
122RETVGST825CGCGAAACCGTAGGCAGTACT82645375374000
123GNGSTSL827GGTAACGGAAGCACATCGCTA82845114384073
124VDSTISI829GGTAACGGAAGCACATCGCTA83044974391106
125ASTATIR831GCGTCTACTGCTACTATTCGG83244844205279
126RSHDSET833AGGAGTCATGATTCTGAGACT834447304473
127FGSQMGA835TTTGGGTCTCAGATGGGGGCG83644544303151
128SVTDVKL837TCTGTTACTGATGTTAAGCTT838444924492000
129SHVSDSK839AGCCACGTATCCGACTCCAAA840444604446
130KLAPDGT841AAACTTGCTCCCGACGGAACG84244294000429
131YLVGYQM843TACCTAGTGGGGTACCAAATG84444198993520
132QDKSTYK845CAGGATAAGTCGACGTATAAG846439624951901
133PGGESRG847CCCGGAGGCGAAAGCCGAGGC848439520002395
134LTGSVQL849CTTACGGGTTCGGTTCAGCTT850438618512535
135SGETLRL851AGTGGCGAAACCCTACGTCTC85243854000385
136PSVSTLS853CCTAGTGTTTCTACGCTTAGT854438023802000
137PGTVNTH855CCTGGTACTGTTAATACGCAT85643543544000
138PSQGMTS857CCATCCCAAGGAATGACATCC85843404163177
139AGVLNTL859GCGGGGGTGCTGAATACTTTG860432920002329
140KNHGVDP861AAAAACCACGGAGTAGACCCC86243154017298
141GLMTNAK863GGGCTGATGACGAATGCGAAG864430320002303
142MNGVHVL865ATGAACGGAGTCCACGTACTT866430118712430
143TDVHSTS867ACGGATGTGCATTCGACTTCG868429620002296
144IMLGYST869ATCATGCTGGGTTACTCTACG87042682843984
145PAEHYQA871CCGGCTGAGCATTATCAGGCT872425942590
146GQTLAES873GGGCAAACATTAGCGGAATCG87442459563289
147SSVQGIL875TCGAGTGTTCAGGGGATTCTG876423222322000
148ALSQIEV877GCGCTGTCCCAAATAGAAGTC87842306863544
149VTTVTPV879GTCACGACTGTGACCCCCGTT880422316222601
150ATVTGAD881GCTACGGTGACCGGAGCAGAC882420742070
151VSGGDYS883GTCTCTGGAGGCGACTACTCA884420226391563
152KSQSEDV885AAGAGTCAGTCTGAGGATGTG886420220002202
153LLYGYSS887CTCCTCTACGGATACTCTTCA888419804198
154MVQSGLT889ATGGTTCAGTCGGGGTTGACG890419820002198
155EQYLGSP891GAGCAGTATCTGGGTTCTCCG892419620002196
156SGANLSN893TCGGGGGCTAACCTCTCGAAC89441813761420
157ALVQNGV895GCTTTGGTGCAGAATGGTGTT896416527241441
158SDQQKVW897AGTGATCAGCAGAAGGTTTGG89841653760405
159NGSDTPK899AACGGAAGCGACACACCGAAA90041634493714
160SDLTSYV901TCCGACCTCACCAGTTACGTT902416025531607
161GAADRQI903GGTGCGGCTGATAGGCAGATT904414304143
162AVGHVSG905GCCGTAGGACACGTTTCTGGT906414320002143
163RYISDSA907AGGTATATTTCGGATTCTGCG90841423503639
164AQGTISR909GCGCAGGGGACGATTTCGCGT910413625501586
165MANMLSD911ATGGCGAACATGTTATCTGAC91241322363896
166VMRDKDE913GTGATGCGTGATAAGGATGAG914412620002126
167GGKGEGP915GGTGGGAAGGGTGAGGGTCCG916410110323069
168SDVVVTH917TCTGATGTTGTTGTTACTCAT91840883762326
169GVLTTVT919GGGGTTCTTACTACGGTGACT920408604086
170NDGPREQ921AATGATGGTCCGAGGGAGCAG92240433837206
171PTQGVSM923CCAACCCAAGGAGTTTCGATG924403517352300
172NMGVVQL925AACATGGGCGTCGTGCAATTG926403204032
173AGGGDPR927GCGGGGGGTGGGGATCCGAGG928403127801250
174AGVVNAL929GCGGGGGTGGTGAATGCTTTG930401518702145
175ANPVGNV931GCGAATCCTGTTGGGAATGTT932400024831517
176VQGTQTG933GTGCAGGGTACGCAGACTGGT934400020002000
177SPGFSIA935TCTCCTGGATTCAGCATCGCT936400020002000
178AIERLTV937GCAATCGAAAGACTAACCGTT938400020002000
179TDKQNAF939ACGGACAAACAAAACGCATTC940400020002000
180LADSKDR941CTGGCAGACTCGAAAGACAGG942400020002000
181VHSTGEW943GTACACAGCACAGGCGAATGG944400020002000
182LHNALAV945CTGCATAATGCTCTGGCTGTT946400040000
183IGLDPKA947ATTGGTTTGGATCCGAAGGCG94839823549433
184VMASTGP949GTTATGGCTTCGACTGGTCCT95039803654326
185SVPGTVS951AGTGTGCCGGGGACTGTGTCT952397028041166
186MLSNGQV953ATGCTGTCTAATGGGCAGGTT954396539650
187ASTATLR955GCGTCTACTGCTACTCTTCGG956396320821880
188PGEHYQG957CCGGGTGAGCATTATCAGGGT958396319632000
189QVTDNKT959CAGGTGACTGATAATAAGACT960395722011756
190NGLQVSI961AACGGACTACAAGTGTCTATC962395639560
191SHPGNEL963AGCCACCCCGGCAACGAACTC96439523764188
192VSLNGGH965GTGTCGCTTAATGGGGGGCAT966395019502000
193MVASSID967ATGGTGGCTTCATCCATAGAC968393820001938
194MGVNTTI969ATGGGGGTGAATACGACTATT97039363388548
195MNGGHLM971ATGAATGGGGGTCATCTTATG97239343406528
196LGSDGRT973CTTGGCTCAGACGGCCGAACC974393113842547
197RVDTPQL975CGAGTCGACACACCACAATTG976392615662360
198TEQAKLS977ACTGAACAAGCCAAACTATCT978392519252000
199IGTNSTY979ATTGGTACGAATAGTACGTAT9803903384756
200LSESANR981TTGTCGGAGAGTGCGAATCGT982389128101082
201GQSSNQH983GGGCAGTCGTCTAATCAGCAT984385514782377
202MRSEQTT985ATGCGTAGTGAGCAGACGACG986385420001854
203AGISTQT987GCGGGGATTAGTACTCAGACG988383517832052
204TGESNVG989ACTGGGGAGAGTAATGTTGGT990382020001820
205ALANVSN991GCGCTTGCCAACGTTTCCAAC992380003800
206SFGMVVD993TCGTTCGGCATGGTAGTCGAC994380003800
207TMSHAEL995ACCATGTCGCACGCAGAATTA996379217922000
208LSNMVSA997TTGAGTAATATGGTGAGTGCT99837784503328
209HGTLVSR999CATGGGACTTTGGTGTCTCGG1000376317632000
210VAVTGAI1001GTTGCGGTGACTGGTGCTATT1002375619521803
211IMVDAHA1003ATTATGGTTGATGCTCATGCG1004375337530
212TPTLPFI1005ACGCCTACGTTGCCTTTTATT1006375103751
213GIAGLGI1007GGTATTGCTGGGCTTGGGATT1008375014382312
214SSLPDKT1009TCATCCCTACCGGACAAAACC1010374603746
215QSQTALR1011CAATCCCAAACAGCATTGCGA1012374423621382
216AVGNELL1013GCCGTTGGCAACGAACTGCTG1014374417442000
217GSGAGVA1015GGGAGCGGCGCCGGTGTAGCC10163744366381
218ESGLINV1017GAGTCGGGTCTGATTAATGTG1018373253727
219PNAGFDR1019CCGAACGCGGGGTTCGACCGT1020372017202000
220LNQGLGD1021CTGAATCAGGGGTTGGGGGAT1022371620021714
221ALASVGV1023GCGTTGGCATCCGTGGGTGTC1024371514892226
222SEGPSRY1025TCGGAGGGTCCTTCGCGTTAT1026371022011509
223SYGDGGV1027TCGTATGGTGATGGTGGTGTT102837082913795
224LGHNSGV1029TTGGGGCATAATTCTGGTGTT1030369614062290
225SSPNVGP1031TCTTCTCCGAACGTCGGTCCT1032368826001088
226VSGTSTH1033GTTTCGGGTACTTCTACGCAT103436873471216
227SEGGNNR1035AGTGAGGGTGGGAATAATCGG1036368023791302
228SGASLSN1037TCCGGAGCATCCCTTTCCAAC1038367203672
229SNGVPSS1039AGCAACGGAGTACCGTCATCG1040365916182041
230ESGTHLS1041GAGTCTGGGACTCATTTGTCG104236552745909
231VTVQVQR1043GTTACGGTGCAGGTGCAGAGG1044365236520
232ASESTPR1045GCTAGTGAGTCTACGCCGCGT1046365116651986
233LVTFRAD1047CTTGTTACTTTTCGTGCGGAT1048364626331013
234LTQMSNK1049CTGACTCAGATGAGTAATAAG105036402963677
235AVEGSRL1051GCGGTGGAGGGTTCGAGGCTG1052363520081627
236PNERINV1053CCGAATGAGAGGATTAATGTG1054362620031623
237GDHDRGS1055GGAGACCACGACAGGGGCTCG1056361736170
238RHQVSES1057CGTCATCAGGTTAGTGAGAGT105836102863747
239LDGLNLH1059CTAGACGGCTTGAACCTCCAC1060360413302274
240PGNGTLV1061CCGGGGAATGGGACGTTGGTT1062359822381360
241DSYGGNA1063GACTCGTACGGGGGGAACGCC10643596357026
242LHKGSES1065CTTCATAAGGGTAGTGAGAGT106635932662931
243SVDIVKL1067TCGGTTGATATTGTGAAGCTT106835906262964
244HTLSTGV1069CATACGCTGAGTACTGGGGTG1070358820001588
245SQINSGS1071TCCCAAATAAACTCTGGCAGC107235833046537
246ALSGLDK1073GCCCTGAGTGGGCTAGACAAA1074358235820
247RNSESEA1075CGAAACAGCGAATCGGAAGCG107635767392837
248PNERHTL1077CCTAACGAACGCCACACCTTG1078357520001575
249VNAGLGI1079GTAAACGCCGGCTTGGGCATC108035712987584
250PASGALT1081CCGGCTTCGGGTGCTCTTACT1082356712592308
251SAMVTSP1083TCAGCCATGGTTACCTCGCCA1084356410572507
252DSHVSGK1085GACTCACACGTCAGTGGAAAA1086355620821474
253SPQGALA1087TCGCCGCAGGGGGCTCTTGCT1088355303553
254GDNPAVA1089GGTGATAATCCTGCGGTGGCT109035473363184
255KEIHVSV1091AAAGAAATCCACGTTTCTGTG109235433345197
256VTTVSTV1093GTGACTACGGTTTCTACTGTG1094354214192124
257WTDGVSR1095TGGACTGATGGGGTGTCGCGG109635388672671
258AADSSAR1097GCGGCGGATAGTTCTGCGCGG109835353428107
259LDNRTMK1099CTTGATAATCGTACTATGAAG1100353143527
260DVADSKR1101GATGTTGCTGATTCTAAGCGT1102353020011529
261SVGGTIH1103TCGGTTGGTGGTACGATTCAT1104352220001522
262PLTAGVS1105CCTCTGACGGCGGGGGTGTCG1106352011812339
263AADISVR1107GCAGCAGACATATCAGTCCGC1108351835180
264YADSHTD1109TACGCCGACAGCCACACAGAC1110351635160
265VDVNLTR1111GTCGACGTAAACTTAACAAGA1112351216641848
266AIAEYQV1113GCGATTGCGGAGTATCAGGTG1114349722311266
267SRVDGSG1115TCGCGTGTTGATGGTTCGGGT1116349215731919
268HGDGVRV1117CACGGGGACGGAGTACGCGTC111834823345137
269HESRDHS1119CACGAATCGAGAGACCACAGT1120347720001477
270RYEQNTP1121AGGTACGAACAAAACACTCCC112234753022453
271ALASTQT1123GCGTTGGCGAGTACTCAGACG1124347522211254
272FSSERLP1125TTTTCGTCTGAGCGGCTTCCG1126346734670
273TTTHEGV1127ACGACGACTCATGAGGGGGTG1128346323591104
274RMDSAQL1129AGGATGGATTCGGCGCAGCTT1130345003450
275SHGPDSK1131TCTCATGGTCCTGATTCGAAG113234482653795
276SVATGVL1133TCGGTTGCGACGGGGGTTCTG113434473059388
277LLASGAK1135CTGCTTGCGAGTGGGGCTAAG1136344713152132
278ASLGAYS1137GCGTCGCTTGGGGCGTATTCG1138344524421003
279KELLVSA1139AAAGAACTCTTAGTAAGTGCA114034404253015
280SLGVAVA1141AGTTTAGGTGTCGCCGTCGCC1142344020001440
281VSGSISK1143GTTTCGGGGAGTATTTCTAAG114434293108322
282TTSGQTM1145ACCACTTCCGGTCAAACAATG114634273123304
283QTVGPLN1147CAAACAGTAGGACCGTTAAAC1148341013082102
284VNGNNTY1149GTAAACGGCAACAACACCTAC115034093122287
285VAEGGGV1151GTAGCCGAAGGCGGTGGCGTC115234083139269
286GSGENVR1153GGCTCTGGCGAAAACGTAAGG115434082977431
287AADSSMR1155GCGGCGGATAGTTCTATGCGG115634063189217
288LDGLNLH1157CTGGATGGGCTGAATCTTCAT115834025822820
289MAGALGP1159ATGGCAGGTGCACTGGGTCCC116034002501898
290GLNEHGA1161GGTCTGAATGAGCATGGGGCG116233902628762
291LLSSENR1163CTCTTGTCTTCTGAAAACCGG1164338923231066
292RDVSGHI1165AGGGATGTGTCGGGGCATATT1166338333830
293GDQAMVN1167GGGGATCAGGCTATGGTTAAT1168338003380
294AHVDVKV1169GCCCACGTAGACGTAAAAGTT117033795372842
295ALANSER1171GCTTTGGCGAATTCTGAGCGG1172337816611717
296KGSDTTI1173AAGGGTTCTGATACTACTATT117433782686692
297VAQGSVV1175GTGGCTCAGGGGTCGGTTGTT117633712834537
298GFEDGAR1177GGCTTCGAAGACGGTGCTCGA1178336017461614
299QADNHVR1179CAGGCGGATAATCATGTTAGG118033572683674
300RHADSTV1181CGTCATGCTGATTCTACGGTT118233553217138
301PMSQGEL1183CCGATGAGTCAGGGGGAGTTG1184335413542000
302GNSGGHV1185GGGAATAGTGGGGGTCATGTT118633502574776
303RNQAEEM1187AGAAACCAAGCTGAAGAAATG118833481233225
304LLSSENR1189CTTCTGTCGTCGGAGAATAGG119033465142831
305GFEGGTR1191GGGTTCGAAGGCGGCACTCGA1192334322971046
306GGGSESY1193GGAGGGGGGTCAGAATCATAC119433372984353
307EAASAIS1195GAAGCGGCATCAGCCATATCC1196332910542274
308LTTPIEL1197TTGACGACTCCGATTGAGTTG1198332810362292
309RTMSVML1199CGCACCATGTCTGTCATGCTG1200332614911835
310GIHETRA1201GGCATCCACGAAACACGGGCA1202332020001320
311SEGHSSY1203TCGGAGGGTCATTCGAGTTAT120433172459858
312SVSDVKH1205AGTGTCTCGGACGTCAAACAC120633083093215
313LSVSQSA1207CTCTCCGTCAGTCAATCTGCT1208329515111784
314VVKEYES1209GTAGTCAAAGAATACGAAAGC121032918592432
315RVGAEGT1211CGGGTTGGGGCGGAGGGGACG1212328132810
316QAGLGVI1213CAGGCTGGTCTTGGTGTTATT1214327803278
317RVHSTDT1215CGAGTCCACTCGACCGACACG1216327410792195
318VATESAF1217GTGGCAACCGAATCAGCATTC1218327032700
319SNGAGYL1219TCGAATGGGGGGGGTTATCTT122032693062963
320VGEGNKF1221GTGGGTGAGGGGAATAAGTTT122232688162451
321DVRGSVI1223GATGTGAGGGGTTCGGTTATT1224326311372126
322PLDGQGK1225CCGCTAGACGGCCAAGGCAAA1226325532550
323ASVSSQL1227GCCAGCGTCAGCTCACAACTC1228325411402114
324LAKEESH1229CTTGCTAAGGAGGAGTCGCAT123032527042548
325FTHGTGT1231TTCACGCACGGCACTGGGACG1232324120001241
326ASVSSQS1233GCCTCGGTCTCGAGTCAATCA1234323232320
327GVADNVK1235GGTGTCGCAGACAACGTCAAA123632262673553
328SAGVPGV1237TCGGCTGGAGTACCTGGAGTC1238322310052218
329SDSTVVG1239TCTGATAGTACTGTTGTGGGG1240321411192095
330FAGIAQA1241TTTGCGGGGATTGCGCAGGCG1242320903209
331QSDLGRV1243CAGTCGGATCTTGGGAGGGTG12443209312585
332PLQNNPH1245CCGCTTCAGAATAATCCGCAT1246320419461258
333PGTNSFS1247CCCGGAACCAACAGTTTCTCT124832016472554
334QEQGTST1249CAAGAACAAGGCACTTCGACG125031972891306
335RSEVNGV1251CGTTCAGAAGTAAACGGTGTC125231963006190
336LTDKMTS1253TTGACTGATAAGATGACGTCG1254319410642130
337GGTISGP1255GGGGGTACGATTAGTGGTCCT1256319031900
338ADGKGAI1257GCGGATGGGAAGGGTGCGATT125831883080108
339RTGDTIS1259CGGACGGGCGACACAATCAGT12603185309590
340PMSPGVA1261CCGATGTCTCCGGGGGTGGCT1262317831780
341SVMTDRP1263TCGGTGATGACCGACAGACCT126431728692303
342PTEGTLR1265CCTACTGAGGGGACGCTTCGG126631718942278
343ASGTGMT1267GCCTCAGGGACCGGCATGACG1268317111712000
344SANPVAR1269AGCGCTAACCCCGTAGCTCGG1270317019881182
345AAGVNLN1271GCGGCGGGTGTTAATCTGAAT1272317019151255
346TNQVITH1273ACAAACCAAGTAATAACTCAC127431692316853
347PLINGLV1275CCTTTGATTAATGGGCTGGTG127631682731437
348KSHSENN1277AAGTCGCATTCGGAGAATAAT127831683055114
349MNGGHVK1279ATGAATGGGGGTCATGTTAAG1280316517651400
350LAGTLVQ1281TTGGCAGGAACCCTAGTACAA128231632352811
351PLKGGGE1283CCTCTGAAGGGTGGTGGGGAG1284316131592
352VVNSSSS1285GTAGTGAACTCCTCTTCTTCC1286316131610
353SKADAYS1287TCGAAGGCTGATGCTTATAGT1288316031600
354MIGDVSP1289ATGATCGGAGACGTAAGTCCT1290315931590
355LDSSRFH1291CTGGATAGTTCGCGTTTTCAT129231562645512
356GLMSNAK1293GGACTCATGAGTAACGCAAAA1294315611761981
357PSVGMAT1295CCTTCAGTTGGCATGGCGACT129631542400755
358SSEGRNV1297AGTAGTGAGGGTCGTAATGTG129831522545608
359LQEQLAG1299CTTCAGGAGCAGCTTGCTGGG130031529302222
360VGVPLGR1301GTGGGTGTGCCGCTTGGTCGG1302314903149
361AGASAEA1303GCTGGGGCTAGTGCTGAGGCG1304314503145
362VNSSENK1305GTGAACAGCTCCGAAAACAAA1306314515401605
363INGRNDI1307ATAAACGGCCGGAACGACATC130831432564579
364VMASTGP1309GTAATGGCGTCAACAGGACCG1310314011402000
365AASEVYV1311GCTGCTTCTGAGGTTTATGTT131231372581556
366RNNVDST1313CGCAACAACGTAGACAGTACT131431362822314
367LDASKLV1315CTCGACGCATCCAAATTGGTT131631309962134
368AGDSSVR1317GCTGGTGACTCAAGTGTACGT131831252525600
369SGSNTGP1319TCGGGGTCTAATACGGGTCCT132031242969155
370GTLERTA1321GGTACTCTTGAGAGGACTGCT132231222898224
371KLTSEMT1323AAATTAACATCCGAAATGACC132431212467654
372DATTKSM1325GACGCCACAACTAAATCCATG1326312031200
373GLAGRVV1327GGGTTGGCGGGGCGTGTTGTT1328311703117
374AAGGIMN1329GCTGCCGGGGGGATCATGAAC1330311603116
375ISDYTTL1331ATATCAGACTACACAACACTT1332311603116
376VQHDLTL1333GTCCAACACGACCTTACCCTT133431162577540
377MAGSVSK1335ATGGCTGGTTCGGTATCAAAA1336311319281184
378SYGSDSK1337TCTTATGGTTCTGATTCGAAG133831098302279
379VLSSPGP1339GTTCTGTCTTCGCCTGGTCCT134031089812127
380VNSGQQN1341GTAAACAGCGGCCAACAAAAC134231062366739
381TSQGAIT1343ACGTCGCAAGGCGCAATAACC1344310615671539
382AIGHSQV1345GCGATTGGTCATAGTCAGGTT1346310631060
383QADIVGL1347CAGGCTGATATTGTTGGGCTG1348310511052000
384SKSNDSS1349AGTAAGTCTAATGATAGTTCT135031045732532
385LLAGADR1351TTGCTTGCTGGTGCTGATCGT135231002577522
386VYSDRTM1353GTCTACAGTGACCGCACCATG135430992930169
387SEGGVKY1355AGTGAGGGGGGTGTGAAGTAT135630992806293
388SSDGGKG1357TCAAGCGACGGCGGCAAAGGA1358309620001096
389LTHSTAD1359TTGACCCACTCCACAGCCGAC136030952372724
390PGGESRG1361CCTGGTGGGGAGTCTCGGGGG136230942629465
391PMNGSTR1363CCGATGAACGGGTCCACTAGG1364308920001089
392STDGGST1365TCGACTGATGGTGGTAGTACT1366308820001088
393EASGMNH1367GAAGCAAGCGGTATGAACCAC1368308620001086
394SGDKAAL1369AGTGGTGATAAGGCTGCGTTG1370308630860
395MLRGYSQ1371ATGTTACGCGGGTACTCGCAA137230842332852
396GGGVEVH1373GGGGGTGGTGTGGAGGTTCAT137430829142168
397NVIVNGV1375AATGTGATTGTGAATGGGGTG1376307910342044
398WNLDKTH1377TGGAATCTTGATAAGACTCAT137830758212254
399AQGGATV1379GCCCAAGGTGGCGCGACGGTA1380307303073
400PLINGLV1381CCACTTATCAACGGCTTAGTT138230736682405
401RVELTGT1383CGCGTAGAATTGACCGGCACG138430722886186
402GEGGTVR1385GGTGAGGGTGGTACTGTGAGG1386306816141454
403GAGELSS1387GGTGCGGGGGAGCTTAGTAGT1388306420001064
404LGKAVPD1389CTTGGTAAGGCGGTTCCGGAT1390306320001063
405WADTKDR1391TGGGCCGACACGAAAGACCGA139230632961102
406LAGLGGM1393CTAGCTGGCCTCGGTGGAATG139430612769292
407PVLAAGH1395CCAGTACTAGCGGCTGGGCAC139630569802076
408SDTIGLR1397AGTGACACCATAGGCCTCCGC139830562758298
409GVKETRA1399GGCGTCAAAGAAACCCGGGCC140030559532102
410AETMGVA1401GCAGAAACCATGGGTGTCGCC140230528952157
411SLTDRAS1403TCGTTGACGGATAGGGCGTCT1404305110512000
412RSNGSEN1405AGGTCGAATGGTTCGGAGAAT140630512853198
413PGSDGRT1407CCGGGTTCTGATGGTCGTACT140830512852199
414TRTEDYT1409ACTCGCACTGAAGACTACACC1410305130510
415PSVSVTL1411CCTTCTGTTTCTGTGACTTTG1412304910472002
416LSTGAEK1413CTCTCTACCGGCGCAGAAAAA1414304620001046
417SETNGVR1415AGCGAAACGAACGGCGTCCGG141630444082635
418SPQVYDD1417AGTCCGCAGGTTTATGATGAT1418304210422000
419EGGTHVR1419GAAGGTGGCACCCACGTACGG142030362865171
420STLGSTR1421TCGACGTTGGGTAGTACGAGG142230362829206
421SEGKVGP1423TCTGAGGGTAAGGTTGGGCCT1424303513181717
422AVKDELV1425GCTGTTAAGGATGAGCTGGTG1426303203032
423TPSGNLM1427ACGCCGTCGGGGAATCTTATG1428302920001029
424VMTGTLT1429GTGATGACGGGGACTTTGACT1430302803028
425QIASADI1431CAGATTGCGAGTGCGGATATT1432302703027
426LYGDGSV1433CTTTATGGTGATGGTAGTGTT1434302420001024
427PTQLQRV1435CCGACTCAGCTGCAGCGTGTG1436302303023
428SNVQDGL1437AGCAACGTCCAAGACGGACTA1438301403014
429YDAKVGH1439TACGACGCAAAAGTGGGGCAC14403014294271
430AVGLDNR1441GCTGTGGGTCTGGATAATCGT144230129082104
431NSNDVHM1443AACTCTAACGACGTCCACATG1444301103011
432PSGALMT1445CCGAGTGGTGCGCTGATGACT1446301010102000
433SQEQVSA1447AGTCAGGAGCAGGTGAGTGCT1448300919671042
434GSGSDGV1449GGGAGTGGTAGTGATGGGGTT1450300810082000
435VDTSDRV1451GTTGATACTAGTGATCGTGTT1452300730061
436ATMSPTT1453GCCACAATGTCCCCTACAACG1454300630060
437TPTLPFI1455ACACCCACCCTGCCATTCATA1456299902999
438AVKEYDS1457GCGGTGAAGGAGTATGATTCG145829992779220
439LSLPDGD1459CTCTCCTTACCAGACGGAGAC146029989182080
440AEKSGMV1461GCCGAAAAATCTGGTATGGTC146229972748249
441LNVDTGS1463CTCAACGTGGACACTGGTTCA146429892634355
442PTQGTPR1465CCAACGCAAGGAACACCGCGA1466298829880
443PVGTNAR1467CCTGTCGGCACAAACGCAAGA146829878992088
444NGGIITR1469AACGGAGGTATCATCACCCGC1470298312081775
445VVGTQDR1471GTTGTGGGGACTCAGGATAGG1472298002980
446NTLHTST1473AATACTCTTCATACTTCGACT147429802637342
447VGSLGTD1475GTAGGCTCGCTGGGTACTGAC1476297313881585
448LSTVTGQ1477CTTTCGACGGTGACTGGGCAG147829709801989
449VLTNTTT1479GTGCTTACGAATACTACTACT148029642680283
450SGDKAAL1481TCCGGCGACAAAGCCGCACTT148229592518441
451YQTETNN1483TATCAGACTGAGACGAATAAT148429572263694
452SVGLVAG1485AGTGTGGGTTTGGTGGCTGGT148629517842167
453RMTGDLT1487CGTATGACTGGAGACCTAACC148829512000951
454TSGNLTW1489ACTTCTGGTAATTTGACGTGG1490295002950
455ANVNVKV1491GCGAATGTTAATGTGAAGGTG1492294814941454
456VNVTMTT1493GTTAATGTGACGATGACTACG1494294717641182
457PGVSVTS1495CCTGGTGTGAGTGTGACTTCT1496294602946
458TGDRDQF1497ACCGGCGACAGAGACCAATTC149829422383560
459SKAEGPV1499AGTAAAGCCGAAGGACCTGTC150029422787154
460DSAPAAR1501GATTCGGCTCCGGCGGCTCGG150229412498443
461SRDDGRM1503TCGCGTGATGATGGGAGGATG150429392687252
462IPEGSVR1505ATCCCTGAAGGATCAGTACGA150629389382000
463VTTVSLV1507GTAACGACTGTGTCCCTAGTT150829332817116
464PIHGASS1509CCGATTCATGGTGCTAGTTCG151029322343589
465VSASISK1511GTTTCGGCGAGTATTTCTAAG151229319192012
466TKDNGVM1513ACTAAGGATAATGGTGTGATG1514293002930
467LGASVPK1515CTAGGCGCCTCCGTCCCCAAA151629297752154
468IEGLGGL1517ATAGAAGGCCTCGGAGGTTTG151829292705224
469VSGGDYS1519GTGAGTGGGGGTGATTATTCG1520292712926
470SEKQKVR1521AGTGAGAAGCAGAAGGTTAGG1522292702927
471PGRVSSE1523CCTGGTCGGGTCAGCTCAGAA152429279272000
472DRITLGT1525GATCGGATTACGTTGGGGACG1526292302923
473ANNGTTW1527GCTAATAATGGGACGACTTGG152829232636288
474TSVISQV1529ACGAGTGTTATTTCGCAGGTT1530292229220
475LANMMSV1531CTGGCTAATATGATGAGTGTT1532291929190
476EIVLTVP1533GAAATAGTGCTGACCGTCCCC1534291702917
477VHKDQEI1535GTGCATAAGGATCAGGAGATT153629172385532
478QVGDSTL1537CAAGTAGGAGACTCGACGTTA1538291729170
479SEKSVPL1539AGTGAGAAGAGTGTGCCGCTT154029032000903
480SNHDLTH1541TCCAACCACGACCTTACCCAC1542290229020
481NQLAEMV1543AACCAACTGGCTGAAATGGTG154428996612238
482QMTHGLI1545CAGATGACTCATGGGCTTATT1546289702897
483QVSDNKT1547CAAGTATCCGACAACAAAACC154828972622635
484GGTNSAH1549GGGGGTACGAATAGTGCTCAT155028973112587
485GAADRMQ1551GGGGCGGCGGATCGGATGCAG155228977112187
486ESGVHQK1553GAGTCTGGTGTTCATCAGAAG155428965112385
487GGTGALR1555GGCGGCACAGGGGCTCTCAGA1556289617761119
488NTLGVAY1557AACACATTAGGCGTGGCATAC155828952000895
489VQHSQDN1559GTGCAGCATTCGCAGGATAAT156028942775119
490EYNTRDK1561GAGTATAATACTCGGGATAAG156228948942000
491RSEVNGV1563CGGAGTGAGGTGAATGGGGTT1564288917681122
492GLAETRA1565GGGCTTGCTGAGACTAGGGCT156628852135750
493MNGGYVL1567ATGAACGGCGGATACGTACTT156828828112071
494TSGNAGL1569ACTTCTGGTAATGCTGGGCTT157028812315566
495NTTQTSW1571AACACTACACAAACGTCCTGG157228807252156
496SLAGGTP1573TCGTTGGCTGGTGGTACTCCT157428795952284
497MTGHDAV1575ATGACTGGGCATGATGCTGTG157628792746133
498DETRTHI1577GACGAAACCCGGACACACATA1578287828780
499DMLNNTA1579GACATGTTAAACAACACTGCT1580287202872
500THNENMF1581ACTCACAACGAAAACATGTTC1582287202872
501RNLDLTH1583CGTAACTTGGACTTAACGCAC1584287102871
502RDNVEST1585AGGGACAACGTGGAATCAACA1586286813571511
503NVVSLAT1587AATGTTGTGAGTCTGGCTACT1588286417071157
504REMGQNA1589CGTGAGATGGGGCAGAATGCT159028582349509
505AVVSAGP1591GCGGTGGTGTCTGCTGGGCCG1592285502855
506LTGISLV1593CTTACAGGAATATCTCTCGTT1594285314701383
507VLTVGSV1595GTACTCACGGTCGGGAGCGTG159628532682171
508HRDSAEP1597CACAGGGACTCGGCGGAACCC159828512000851
509VAALGMS1599GTTGCTGCTTTGGGTATGTCT1600285002850
510TTSENLM1601ACGACGTCGGAGAATCTTATG1602284828480
511YTAGSQA1603TACACTGCTGGCAGTCAAGCC1604284514051440
512LSTLLGA1605TTGAGTACGCTACTCGGAGCC1606284302843
513MSISEPR1607ATGAGTATCTCTGAACCCCGT1608284011101730
514VMDHKST1609GTCATGGACCACAAATCTACA1610284028400
515IKSDERL1611ATAAAAAGCGACGAACGTTTA1612284028400
516TTEKHTG1613ACCACGGAAAAACACACGGGC1614283902839
517ASPQGVR1615GCTAGTCCGCAGGGGGTTCGT161628382000839
518PGQHNQA1617CCGGGTCAGCATAATCAGGCT161828378811956
519PNLGNPS1619CCGAATCTGGGTAATCCTAGT162028362000836
520RLTEADR1621CGGTTGACTGAGGCGGATCGT162228352699136
521WNHSSTV1623TGGAACCACTCATCAACCGTG162428326012230
522IGNALLK1625ATTGGTAATGCGTTGCTGAAG162628312410420
523TAADHLR1627ACTGCTGCTGATCATTTGAGG162828307182112
524MVSNDTS1629ATGGTTAGCAACGACACTAGC1630282802828
525IQDSVQF1631ATTCAGGATTCGGTTCAGTTT163228273702456
526MGENLPS1633ATGGGGGAGAATCTTCCGAGT163428262932533
527ADSTQGK1635GCTGATTCGACTCAGGGTAAG1636282617181108
528IPVDMNR1637ATCCCCGTCGACATGAACAGG163828254672358
529SVDSGRL1639AGTGTTGATAGTGGGCGGCTT164028256652160
530SYGDGGV1641TCTTACGGGGACGGAGGGGTC164228252521304
531SHSLIEV1643TCTCATAGTCTGATTGAGGTG164428237442079
532LAGLGVI1645CTGGCGGGGCTGGGCGTCATA164628236232200
533MSVGQSW1647ATGTCAGTGGGTCAATCTTGG164828226762146
534TRDGQLA1649ACCAGGGACGGACAACTCGCA1650282002820
535AQEVARA1651GCGCAGGAGGTGGCGCGGGCT165228204472374
536GSVGVVV1653GGTTCGGTGGGTGTGGTTGTG1654281802818
537GGTLLTV1655GGTGGGACGTTGTTGACGGTG1656281812841534
538ITENVSR1657ATCACGGAAAACGTAAGCCGT165828172270546
539SAGVIMN1659TCGGCGGGTGTTATTATGAAT166028172287529
540AEGLRGQ1661GCTGAGGGGCTTCGGGGGCAG1662281428140
541GTGEIGM1663GGGACTGGTGAGATTGGTATG1664281328130
542VLGAHET1665GTACTAGGAGCGCACGAAACA166628122000812
543MALGYSS1667ATGGCGTTGGGGTATAGTAGT1668281102811
544STVTGGP1669AGCACCGTCACGGGCGGACCC1670281128110
545MQGENNK1671ATGCAGGGTGAGAATAATAAG167228109361874
546AAAQTAT1673GCAGCAGCCCAAACCGCAACG1674281010001810
547VNSGSVL1675GTGAATTCGGGTTCTGTGCTG167628082536272
548HAAGDRS1677CATGCGGCGGGTGATCGTAGT1678280612991508
549PVESQTA1679CCCGTTGAAAGCCAAACTGCC168028054802325
550STIPSLM1681AGCACAATACCCTCATTAATG1682280102801
551KNAGAES1683AAGAATGCGGGTGCTGAGAGT168428006502150
552SPGSDSK1685TCTCCTGGTTCTGATTCGAAG168627956792116
553KVTLEGD1687AAAGTCACACTAGAAGGTGAC168827932375418
554PQGNSSV1689CCCCAAGGCAACAGCTCAGTC169027912565226
555VLGTTTP1691GTGCTTGGGACGACTACGCCT169227912464327
556LSNHGPI1693TTGTCTAACCACGGCCCCATA1694279127910
557KENRVSD1695AAAGAAAACAGGGTAAGTGAC1696279127910
558IGGNSGD1697ATTGGGGGGAATAGTGGGGAT1698278811181670
559GAIGPAT1699GGGGCTATTGGTCCTGCTACT1700278602786
560DTHAKSM1701GATACGCATGCTAAGAGTATG1702278427840
561MNGGHHL1703ATGAATGGGGGTCATCATCTG170427837182065
562AAGLIQN1705GCTGCCGGGCTGATACAAAAC170627802393387
563QARDTKT1707CAAGCTCGAGACACCAAAACA1708278027800
564SSYANEH1709AGTTCGTATGCTAATGAGCAT171027792659120
565VLVSDRA1711GTGTTGGTTTCGGATCGTGCT171227787782000
566WSTDGGS1713TGGTCAACGGACGGCGGGAGT1714277302773
567CRESTCV1715TGTCGTGAGTCGACGTGTGTT171627722581191
568SQAEGPV1717AGTCAAGCGGAAGGGCCCGTG1718276402764
569ALSNDKH1719GCGCTTAGTAACGACAAACAC172027642208556
570LGASVPM1721CTTGGTGCTTCGGTTCCGATG172227632242521
571YSVGDSI1723TATTCTGTTGGGGATAGTATT172427622000762
572QNSTGLW1725CAAAACTCCACAGGCCTCTGG172627572432325
573LVAGQDL1727CTGGTGGCGGGGCAGGATTTG1728275727570
574GLSEGSV1729GGCTTAAGCGAAGGCTCAGTA1730275614011355
575TLAISEM1731ACTTTGGCGATTTCTGAGATG173227554412314
576LVHTTNN1733CTAGTACACACAACCAACAAC1734275327530
577TVVSSTR1735ACTGTTGTGTCTTCTACGAGG173627504732277
578GRGPDLT1737GGCAGGGGACCAGACCTCACT173827486312117
579IQSDHGR1739ATCCAAAGTGACCACGGACGC1740274827480
580QSSEMRD1741CAATCGTCCGAAATGCGTGAC174227435962147
581GSSENVS1743GGAAGCTCCGAAAACGTATCT174427402000740
582KTPGVDP1745AAGACTCCGGGGGTGGATCCT174627402368372
583QADDHGR1747CAGGCGGATGATCATGGTAGG174827397092030
584AYSDGSS1749GCTTATAGTGATGGGTCGTCT175027381944794
585MAASMTN1751ATGGCTGCCTCGATGACAAAC1752273702737
586STIPTLT1753AGTACTATTCCTACTCTGACG1754273302733
587AEVLNAL1755GCGGAGGTGCTGAATGCTTTG175627332444289
588AESLSGL1757GCTGAGAGTTTGAGTGGGTTG175827301791939
589NQGGGLT1759AACCAAGGTGGCGGCTTAACA1760273027300
590AHNGGVQ1761GCTCATAATGGTGGTGTTCAG176227272587140
591MGGSVTI1763ATGGGGGGTAGTGTTACGATT1764272627260
592VSVSMGI1765GTTAGTGTGAGTATGGGCATC176627245782146
593SGVAYER1767AGTGGGGTGGCTTATGAGAGG176827222274448
594PSATQSL1769CCGTCTGCGACTCAGTCGTTG177027225372185
595STGENKD1771TCCACGGGCGAAAACAAAGAC17722721265764
596PTQGTLG1773CCTACTCAGGGGACGCTTGGG177427207202000
597VLSADSV1775GTGCTTTCGGCTGATTCTGTG1776272027200
598LGETLIR1777TTGGGGGAGACTCTGATTCGG1778271827180
599EHLAGVV1779GAGCATTTGGCTGGTGTTGTT178027172600116
600QSDNHGR1781CAAAGCGACAACCACGGGCGG1782271627160
601LSVSQSA1783CTGAGTGTTTCTCAGTCGGCG178427142384330
602SELSLGY1785AGCGAATTGAGTCTCGGCTAC1786271327130
603AEDKANS1787GCTGAGGATAAGGCGAATAGT178827123602352
604STINTLM1789TCGACAATAAACACCCTAATG1790271115561155
605TGMTLGT1791ACGGGGATGACGCTGGGTACG1792270812461462
606LNGGHVL1793TTGAATGGGGGTCATGTTCTG179427022460242
607VVSDAGK1795GTGGTGAGTGATGCTGGGAAG179627003172383
608MNPSNSM1797ATGAATCCTAGTAATTCGATG179826972572125
609TAASIQS1799ACGGCCGCAAGCATACAATCC1800269602696
610RGTEHLM1801CGTGGTACTGAGCATTTGATG180226952192476
611LLADKSV1803CTTCTTGCTGATAAGAGTGTG180426954732222
612PGEHNHA1805CCGGGTGAGCATAATCATGCT180626932436256
613GTTSDTY1807GGTACTACGTCGGATACTTAT180826912400291
614GDISARY1809GGTGATATTTCTGCGAGGTAT181026882000688
615FSVSSLS1811TTCTCCGTCTCAAGTTTATCC181226886882000
616TSDRDQY1813ACTAGTGATCGGGATCAGTAT181426882339349
617AHVHVKE1815GCGCATGTTCATGTGAAGGAG1816268826880
618GRDLSTA1817GGTCGGGATCTTTCGACTGCT181826876802007
619GGGTEFY1819GGAGGAGGCACTGAATTCTAC1820268326830
620PYPSNSH1821CCGTATCCGAGTAATTCGCAT1822268226820
621NLGVGQM1823AATTTGGGTGTGGGTCAGATG182426802529151
622LSPGTDK1825CTGTCGCCGGGGACGGATAAG182626802362318
623GTDRVSR1827GGCACAGACAGAGTATCCCGT1828267926790
624MADGASM1829ATGGCGGATGGTGCGTCTATG183026785872091
625AGISNQT1831GCCGGAATCTCTAACCAAACT183226762520156
626ETQGRQF1833GAGACTCAGGGTCGTCAGTTT183426752842391
627YGSNDLS1835TATGGGAGTAATGATCTGAGT183626742262412
628AADNNRW1837GCTGCTGATAATAATAGGTGG183826743262348
629MPSNGQV1839ATGCCGTCTAATGGGCAGGTT184026746742000
630GFGDGTR1841GGCTTCGGAGACGGTACACGC1842267426740
631RLNEHEA1843AGGTTAAACGAACACGAAGCC18442671262051
632SVKSVTL1845AGTGTGAAGAGTGTGACGCTT184626672000667
633LTDGYTP1847CTGACCGACGGTTACACACCG184826672357310
634SNIGNDR1849TCGAATATTGGGAATGATAGG185026663622304
635KESTLST1851AAAGAAAGTACCCTCTCAACA185226668961770
636RVDPAQL1853AGGGTGGATCCGGCGCAGCTT185426662000666
637MYGESAK1855ATGTACGGGGAAAGCGCTAAA1856266526650
638VAEGGQI1857GTGGCGGAGGGTGGGCAGATT185826644892175
639LTDRVSR1859CTAACCGACAGAGTCTCTCGA1860266402664
640RLDELMI1861CGATTGGACGAACTAATGATC1862266302663
641PVKEYES1863CCGGTGAAGGAGTATGAGTCG1864266110401621
642QGGDSGG1865CAAGGGGGAGACTCAGGTGGC186626572543114
643VHTEAPY1867GTTCATACGGAGGCTCCGTAT1868265102651
644SQELRDR1869AGTCAGGAGCTGAGGGATCGT187026502516134
645VSRENVS1871GTCTCGCGTGAAAACGTTTCC1872264826480
646STDLSEL1873TCTACGGATTTGTCGGAGTTG187426452501144
647GTGIQTR1875GGCACAGGAATCCAAACACGT187626432000643
648ADSDYTE1877GCAGACTCCGACTACACAGAA187826422000642
649VDTSARD1879GTCGACACGTCTGCAAGAGAC1880264102641
650LGNKDGV1881TTGGGGAATAAGGATGGTGTT188226416412000
651REAGTNS1883CGGGAGGCTGGGACGAATTCT1884264026400
652PGATNNP1885CCGGGTGCGACGAATAATCCG188626361686950
653NSISLIN1887AACTCTATCAGCCTCATAAAC188826362000636
654ASSEFKI1889GCCTCATCCGAATTCAAAATA189026352328307
655VTTGSPV1891GTTACGACTGGGTCGCCGGTA189226332078555
656GSTNVNV1893GGTAGTACGAATGTTAATGTG1894263102631
657GDMSGSL1895GGGGATATGAGTGGGAGTTTG1896263126310
658SRTDSGP1897AGTCGTACGGATTCGGGGCCG189826302000630
659EKGSTLV1899GAGAAGGGGTCGACGTTGGTG190026296292000
660GHATDSV1901GGTCATGCTACTGATAGTGTG190226286282000
661VSNGTFV1903GTGTCGAACGGAACGTTCGTA190426262000626
662SAGGSLQ1905TCGGCAGGAGGTAGCCTACAA1906262602626
663HDTSDSV1907CATGATACTAGTGATAGTGTT190826264672160
664AAGVSLN1909GCGGCGGGTGTTAGTCTGAAT1910262626260
665NTVTNIL1911AACACCGTCACGAACATCCTC19122624C2624
666KSHSENN1913AAATCACACTCAGAAAACAAC1914262414581166
667RNHDLTH1915AGGAATCATGATCTGACTCAT1916262426240
668VKDGPGT1917GTGAAAGACGGACCCGGTACG191826232340283
669VVVGNVK1919GTTGTTGTTGGTAATGTGAAG1920262326230
670AGGGDTR1921GCTGGTGGGGGCGACACACGT1922262326230
671KSISGEW1923AAGAGTATTTCGGGTGAGTGG192426226222000
672ASADSRS1925GCTTCGGCGGATTCTCGTAGT192626204362184
673SVAQNQT1927TCCGTAGCTCAAAACCAAACT192826206202000
674DRASSDA1929GACAGGGCTTCATCAGACGCC193026202481139
675PVRDTKT1931CCGGTGCGTGATACTAAGACT193226201696924
676GVGNTNI1933GGTGTGGGGAATACTAATATT193426192466154
677GARLTYT1935GGAGCCCGCCTCACTTACACA193626197501869
678TSLGLMV1937ACGAGTCTGGGTCTTATGGTG1938261802618
679KAVDNGL1939AAGGCTGTTGATAATGGGCTG194026166162000
680AHEAGSR1941GCTCATGAGGCGGGTAGTCGT19422615252590
681ASQDRGL1943GCTTCGCAGGATAGGGGGTTG194426111282483
682SVTDIKH1945TCTGTTACTGATATTAAGCAT194626092000609
683ISNGTER1947ATATCAAACGGAACAGAACGC1948260802608
684GHQNGGI1949GGGCACCAAAACGGCGGAATC195026087791828
685GNGTGVI1951GGTAATGGGACTGGTGTGATT19522607252483
686AKTNDSN1953GCGAAGACGAATGATAGTAAT195426052028577
687GMATQTT1955GGGATGGCTACTCAGACGACT1956260202602
688SSDTTLR1957AGTTCGGATACTACTTTGCGT195826022392209
689LVDDKAH1959TTGGTTGATGATAAGGCGCAT196026022003599
690TLAISQP1961ACCTTAGCCATATCGCAACCT196226002430170
691AGFSSQS1963GCGGGGTTTAGTTCTCAGTCG196425972000597
692ILIGTSP1965ATTCTTATTGGTACTTCGCCG196625973562242
693SMESSSR1967TCTATGGAAAGCAGTTCGCGT196825962278318
694ARSEFKT1969GCGAGGTCTGAGTTTAAGACT1970259525950
695NKSDHEF1971AACAAATCAGACCACGAATTC197225942466128
696VEVPTAN1973GTTGAAGTGCCAACAGCGAAC197425933692224
697LLTSAVA1975CTGCTTACATCGGCTGTTGCC197625922355237
698MGGVSNP1977ATGGGGGGGGTTAGTAATCCG197825925922000
699LGDSASP1979CTTGGGGATTCTGCTTCGCCG198025872000587
700MVGGGVS1981ATGGTGGGTGGTGGGGTGTCG198225875872000
701ASQLTQT1983GCGTCTCAGCTTACTCAGACT198425872476111
702LSNMMSV1985CTGTCTAATATGATGAGTGTT1986258625807
703MYVAHSS1987ATGTATGTTGCTCATAGTTCG198825853992186
704THDPIQR1989ACTCATGATCCGATTCAGCGT199025852322263
705HQDRTTL1991CATCAGGATAGGACGACGCTT199225842000584
706GTLERTA1993GGGACGTTGGAACGTACGGCC1994258113891192
707IVDVTAR1995ATAGTGGACGTTACTGCTCGG199625804822098
708IFNTTNT1997ATTTTTAATACTACGAATACT199825805802000
709AKLLDSL1999GCAAAACTCCTCGACAGCCTT2000258010081572
710WDDSKDR2001TGGGACGACTCAAAAGACAGA200225793562223
711MLRGYSQ2003ATGCTTAGGGGTTATAGTCAG2004257802578
712QDGMLTR2005CAAGACGGTATGTTGACAAGG200625788031775
713LANMLNV2007CTGGCTAATATGTTGAATGTT2008257602576
714PYEGAGT2009CCGTACGAAGGCGCAGGTACT201025764502126
715MDGKSPP2011ATGGATGGGAAGTCGCCGCCG2012257625760
716GQAGTYS2013GGCCAAGCGGGTACCTACTCG2014257525750
717MDGKSPT2015ATGGACGGAAAAAGTCCAACA2016257410161558
718VMTVETS2017GTGATGACCGTCGAAACCTCG201825742250324
719VQMTLHK2019GTTCAGATGACGCTTCATAAG2020257202572
720VEWKHPL2021GTGGAGTGGAAGCATCCTTTG202225711929642
721RGAESSE2023CGTGGTGCCGAAAGCAGTGAA202425715712000
722LSNMLSV2025CTGTCTAATATGTTGAGTGTT202625705702000
723ELVATTI2027GAGCTTGTGGCTACTACTATT202825702000570
724LAGLGGP2029CTCGCAGGCCTTGGTGGCCCC203025673732194
725RISHEGT2031CGCATATCCCACGAAGGAACT2032256702567
726TAAGIDR2033ACTGCTGCGGGGATTGATCGG2034256614871079
727FAEVAQA2035TTCGCCGAAGTAGCCCAAGCT2036256602566
728GPAEGQG2037GGGCCAGCCGAAGGACAAGGT2038256502565
729GAADRQI2039GGCGCAGCAGACCGACAAATA204025652298266
730AVSGYTV2041GCAGTGTCAGGGTACACGGTT204225641754811
731IANLADS2043ATTGCGAATCTTGCTGATTCG204425643042260
732TSYDKLV2045ACGTCGTATGATAAGTTGGTT204625624982065
733FQDTIGV2047TTTCAGGATACGATTGGGGTG2048256211611401
734TNGGEGA2049ACTAATGGGGGTGAGGGGGCG2050256002560
735IAQNTPY2051ATCGCACAAAACACACCCTAC2052255813341224
736QVHDTKT2053CAAGTCCACGACACAAAAACG205425584212136
737RLNEHEA2055CGTCTGAATGAGCATGAGGCG205625571782775
738AGSGTEV2057GCGGGTTCTGGGACTGAGGTT205825571949608
739IVIAEIH2059ATTGTGATTGCTGAGATTCAT206025546241930
740QVRETKT2061CAAGTTAGGGAAACCAAAACC206225532226327
741SVNSGLL2063AGTGTTAATAGTGGGCTGCTT206425512348203
742VGVNGSH2065GTGGGTGTGAATGGTTCTCAT206625502274276
743AAAQSAT2067GCAGCAGCACAATCGGCAACG206825492379170
744SKAEGPV2069TCGAAGGCTGAGGGTCCGGTT207025482000548
745AGLQVSI2071GCCGGTTTACAAGTCAGCATC2072254725470
746QNERITV2073CAGAATGAGAGGATTACTGTG207425462000546
747QEKGTST2075CAAGAAAAAGGAACCTCGACG207625447671777
748SHGSDSK2077TCGCACGGCTCCGACTCTAAA2078254325430
749RMENGNT2079AGAATGGAAAACGGTAACACC2080254202542
750LGVEVGA2081TTGGGTGTGGAGGTTGGGGCG208225424822060
751TTGPSNA2083ACTACTGGGCCGAGTAACGCC208425422000542
752RDLDGKY2085AGGGACCTTGACGGAAAATAC2086254225420
753MDTHTNT2087ATGGACACCCACACAAACACA208825415412000
754SLINTGS2089TCACTCATCAACACAGGTTCT209025405402000
755KSISGEW2091AAAAGCATCTCTGGCGAATGG209225382184354
756GVNHAVA2093GGTGTTAATCATGCGGTGGCT209425382237301
757AGEHYQA2095GCAGGCGAACACTACCAAGCG2096253825380
758TTGLTGS2097ACGACGGGGCTGACTGGTAGT209825352000535
759PTQGTLQ2099CCGACCCAAGGTACCTTGCAA210025352261274
760GGTQAVL2101GGTGGGACTCAGGCTGTGCTG210225342392142
761AADVILN2103GCTGCCGACGTCATCCTTAAC2104253425340
762ITTGPGG2105ATTACGACGGGTCCTGGGGGT2106253312421290
763TTLAGPA2107ACTACTCTGGCTGGTCCTGCG2108253325330
764NNGTLPI2109AATAATGGTACTTTGCCGATT211025327541779
765IDSLNSV2111ATAGACAGTCTGAACTCCGTC211225292387142
766GAASSTK2113GGCGCAGCATCGTCCACAAAA2114252802528
767ELRVKDT2115GAGCTTAGGGTTAAGGATACT211625282962232
768MGASATL2117ATGGGTGCATCCGCAACCTTG211825272000527
769YGTVVET2119TACGGAACAGTGGTGGAAACG212025272000527
770GTLVSEL2121GGTACGTTGGTGTCGGAGCTG212225222000522
771ATGTESR2123GCAACAGGGACCGAATCAAGG212425203052215
772NGGIGGF2125AATGGTGGGATTGGTGGTTTT212625204732047
773LADNHGR2127CTGGCGGATAATCATGGTAGG212825205202000
774TSASVSQ2129ACTTCTGCTTCTGTTTCTCAG213025185182000
775GNSGGDF2131GGAAACAGCGGTGGGGACTTC2132251825180
776TYEDLRV2133ACTTATGAGGATCTTAGGGTG2134251825180
777RDASITI2135CGAGACGCCTCGATAACAATA213625163612155
778GAQVNGT2137GGTGCACAAGTAAACGGTACA213825161595921
779RMTGDLT2139AGGATGACGGGTGATTTGACT214025142190324
780ITSEPLP2141ATCACATCCGAACCCCTACCT2142251302513
781TLAISEL2143ACGCTTGCTATCAGCGAATTG214425092140369
782ASLLNKT2145GCCAGCTTACTCAACAAAACG214625092000509
783RDYAEQP2147CGCGACTACGCTGAACAACCT214825092382126
784ARVDTGI2149GCGCGTGTAGACACGGGGATA215025082000508
785EGKTQLQ2151GAGGGTAAGACTCAGCTGCAG2152250825080
786VGNDERP2153GTTGGGAATGATGAGCGTCCG215425073332175
787LSLSKDK2155CTGAGTCTTAGTAAGGATAAG2156250725070
788ISLDATS2157ATAAGCCTCGACGCTACATCT215825062000506
789GTMSPGA2159GGTACTATGTCTCCTGGGGCT2160250625060
790GSGERPV2161GGAAGTGGTGAAAGGCCGGTA216225042000504
791AGEHYQA2163GCGGGTGAGCATTATCAGGCT2164250425040
792RNEGINQ2165CGTAATGAGGGTATTAATCAG216625022188314
793GMGASSK2167GGTATGGGGGCGTCTTCTAAG216825022389113
794QLVAQDR2169CAGTTGGTGGCGCAGGATCGG217025008511649
795TTADIVR2171ACGACGGCGGATATTGTTAGG217224983322167
796YTVTGTI2173TACACCGTAACTGGCACAATC217424984982000
797GNGTGVL2175GGTAATGGGACTGGTGTGCTT217624972363134
798PIHGASS2177CCAATACACGGGGCGTCATCT217824962001495
799DSHASGD2179GATAGTCATGCGTCGGGGGAT2180249624960
800PNERHTV2181CCGAATGAGAGGCATACTGTG218224941581913
801NRLGDRI2183AACAGGCTGGGCGACCGACTA218424942322172
802LLQSLND2185CTCCTACAATCGCTGAACGAC218624942000494
803DRAELRL2187GACCGGGCAGAACTAAGGCTT2188249302493
804RNFSVVL2189CGGAACTTCAGTGTAGTACTG219024921945547
805IAGVPQA2191ATTGCGGGGGTTCCGCAGGCG2192249224911
806EPSLSSP2193GAGCCGTCTCTGAGTTCTCCG219424924922000
807LNGGHVM2195TTGAATGGGGGTCATGTTATG219624922171321
808RDLNSDV2197AGGGACCTTAACTCGGACGTC219824912274217
809PGQHNQA2199CCAGGACAACACAACCAAGCC220024903072183
810GVEGSGM2201GGGGTGGAAGGCTCCGGAATG2202249012631227
811QADNNGR2203CAAGCTGACAACAACGGCCGC2204249024900
812ADAGIMM2205GCGGACGCCGGCATCATGATG220624865531933
813DRADDSR2207GATAGGGCTGATGATTCTCGT220824862486C
814DQTYTSA2209GACCAAACATACACAAGCGCG2210248524850
815GVRDTNI2211GGAGTTCGAGACACAAACATA221224833832100
816MSVATQR2213ATGTCAGTCGCGACTCAACGA221424834832000
817PALEANI2215CCGGCTCTTGAGGCTAATATT221624821623858
818VLNEHVA2217GTCCTTAACGAACACGTAGCT2218248202482
819GLNEHQA2219GGTCTGAATGAGCATCAGGCG2220248202482
820SLDSLTS2221AGTTTAGACAGCTTAACCAGT222224822345137
821WTDRESL2223TGGACTGATCGGGAGTCGCTG222424812481C
822DVGTGAL2225GATGTTGGGACTGGGGCGTTG2226248124810
823VGHVESP2227GTAGGCCACGTCGAATCTCCA2228248002480
824KGSDTAM2229AAAGGGTCAGACACAGCCATG2230248002480
825KLSSEKT2231AAGCTTTCGAGTGAGAAGACT223224782366112
826VGLSRDL2233GTTGGGCTGAGTCGGGATCTG2234247702477
827VSNAVGQ2235GTTTCGAATGCTGTGGGTCAG2236247724770
828LSNHGSV2237TTGAGCAACCACGGATCGGTA223824763942083
829GAPSLGD2239GGGGCGCCGTCGTTGGGTGAT224024762000476
830MNGAHVL2241ATGAACGGCGCGCACGTATTG2242247524750
831NGNMASY2243AACGGAAACATGGCAAGTTAC224424721620852
832IAQMHSS2245ATCGCACAAATGCACAGTTCC224624706061864
833PTTLGHD2247CCGACAACTCTCGGACACGAC2248247024700
834QANMLSD2249CAAGCCAACATGCTCTCAGAC225024691614856
835WANGNTV2251TGGGCGAATGGGAATACGGTG225224682000468
836EEKSASY2253GAGGAGAAGTCGGCTTCTTAT225424681731736
837VSPAASV2255GTTAGTCCTGCTGCGAGTGTT2256246824680
838ARSLGEV2257GCGAGGTCGCTTGGGGAGGTT225824674672000
839MDVSSGP2259ATGGATGTGAGTAGTGGTCCG226024652000465
840ISNYTRL2261ATATCTAACTACACGCGGCTT226224642000464
841PDERLTV2263CCTGACGAACGGCTAACGGTT2264246324630
842TDALKSK2265ACCGACGCCCTAAAAAGCAAA2266246224620
843ATDSTQS2267GCCACCGACAGCACTCAAAGC2268246224620
844SSLLTTA2269TCGTCGTTGCTGACTACTGCT227024616031858
845VLTSPGP2271GTTCTGACTTCGCCTGGTCCT2272246002460
846DSHVSGM2273GATAGTCATGTGTCGGGGATG227424592372222
847NDSAANS2275AACGACTCTGCTGCGAACTCC2276245924590
848ALGVAVA2277GCTCTTGGGGTTGCTGTTGCT2278245802458
849GTAGHMS2279GGGACTGCTGGGCATATGTCG228024574572000
850NNLGDRL2281AACAACCTGGGCGACAGGCTC228224561937519
851LGAGSPN2283CTAGGCGCCGGAAGCCCGAAC2284245624560
852SGSNTGT2285TCGGGGTCTAATACGGGTACT2286245552450
853GVGASEK2287GGTGTGGGGGCTAGTGAGAAG228824554721984
854MSNVGTW2289ATGAGTAACGTAGGCACATGG229024552000455
855IVMAENN2291ATCGTAATGGCGGAAAACAAC2292245402454
856HVDLGTK2293CACGTTGACTTAGGCACAAAA229424522242228
857PNERVTV2295CCGAATGAGAGGGTTACTGTG22962452237379
858DRDTNPY2297GACCGAGACACCAACCCATAC229824524522000
859DGGLPKS2299GACGGAGGCTTACCCAAAAGC2300245214191033
860RISQDGD2301AGAATATCCCAAGACGGAGAC2302245102451
861AVLAGTR2303GCGGTTCTGGCGGGGACTAGG230424502000450
862KASDTPM2305AAAGCAAGTGACACGCCCATG2306245024500
863GNDVGRS2307GGGAACGACGTAGGCCGCTCG2308244602446
864STLSGTD2309TCGACGCTGTCTGGTACTGAT231024461766680
865FSSEQLT2311TTTTCGTCTGAGCAGCTTACG2312244502445
866SHLGDRL2313AGTCATCTTGGTGATCGTTTG231424432000444
867AVKEYQS2315GCTGTTAAAGAATACCAATCT2316244302443
868MGTPTNT2317ATGGGTACTCCTACGAATACG2318244302443
869QSLATGI2319CAGTCGCTTGCTACTGGGATT232024429211522
870PGVAMVS2321CCTGGGGTAGCAATGGTATCT232224422000442
871SAETRNG2323TCTGCGGAGACTAGGAATGGG232424413352106
872EGGYSGR2325GAGGGTGGTTATAGTGGGCGT232624402000440
873ETEASSR2327GAGACGGAGGCGAGTTCGCGT2328244024400
874STHHTST2329TCGACGCACCACACCAGTACG233024392000439
875REMPLSH2331AGGGAGATGCCTTTGAGTCAT23322438234593
876RELQSAA2333CGCGAATTACAAAGCGCAGCT233424372000437
877GSGSGVL2335GGTTCTGGGTCGGGGGTGCTG233624371925512
878GVLTTVT2337GGAGTCTTGACCACTGTTACG2338243302433
879NLQGNAH2339AATCTGCAGGGTAATGCTCAT2340243211601271
880AAISSQT2341GCGGCGATTAGTTCTCAGACG2342243002430
881ETTVSHV2343GAGACTACGGTTTCTCATGTG2344242813761052
882ALTNGQR2345GCACTAACCAACGGTCAACGT234624272000427
883NNNGATS2347AATAATAATGGTGCGACTTCT234824265261900
884IDGKSPP2349ATTGATGGGAAGTCGCCGCCG2350242502425
885PTGTVVT2351CCGACTGGGACTGTTGTTACT235224254252000
886SGEQLRI2353TCTGGGGAGCAGCTTAGGATT2354242413431081
887AVNNVTL2355GCTGTGAATAATGTTACTCTT235624242000424
888LSDLMRS2357CTTAGCGACCTCATGAGGTCT235824246271797
889QYVVTGG2359CAGTATGTTGTTACTGGTGGG236024232000423
890MDGKTPP2361ATGGACGGTAAAACTCCCCCT2362242102421
891IGMDPKA2363ATTGGTATGGATCCGAAGGCG236424201509910
892GVDAVAY2365GGAGTGGACGCTGTGGCATAC2366241402414
893GNQGGTR2367GGTAATCAGGGGGGGACGCGT236824145871827
894ASASSPR2369GCCTCAGCATCATCACCTAGG2370241302413
895GLSPEAR2371GGTTTGTCTCCTGAGGCGCGT2372241302413
896LVTTLHM2373TTGGTCACCACACTACACATG237424132922121
897HAGLGII2375CACGCGGGGCTGGGCATAATC237624122000412
898AILGASS2377GCCATACTAGGCGCATCTTCC237824122000412
899STHDIRV2379TCTACTCACGACATACGAGTC238024114112000
900AEQLSHS2381GCAGAACAACTATCCCACAGC2382241124110
901LHDTLTR2383CTGCACGACACATTAACCCGC238424102000410
902PGEHYPA2385CCCGGCGAACACTACCCAGCG238624101755655
903SSGSGVA2387AGTTCTGGGTCGGGGGTGGCT2388241024100
904MVDSAQL2389ATGGTGGATTCGGCGCAGCTT2390240702407
905SGLVTEL2391AGCGGATTGGTAACTGAACTG2392240602406
906HGHIAQS2393CATGGGCATATTGCGCAGTCG239424061692237
907HTDGSYV2395CATACGGATGGGAGTTATGTT239624052542151
908MAGQPSQ2397ATGGCGGGTCAGCCTAGTCAG239824044042000
909PTQGTPR2399CCTACTCAGGGGACGCCTCGG240024027031699
910AGAAIVA2401GCTGGTGCGGCGATTGTTGCG2402240223939
911VSASTMA2403GTTAGTGCTTCGACTATGGCT2404240102401
912SSDSRTL2405TCGTCGGATTCGCGGACGTTG2406240002400
913LTDAHGI2407TTAACGGACGCTCACGGGATC2408239802398
914TLQGGLS2409ACTCTGCAGGGTGGTCTGTCT2410239802398
915SMSSEIW2411TCGATGTCATCCGAAATATGG241223982262172
916QYVDTGG2413CAGTATGTTGATACTGGTGGG24142398231385
917VSNTSES2415GTGTCAAACACCTCCGAATCG2416239613701026
918IGSAGDR2417ATTGGGAGTGCGGGTGATCGT241823963962000
919ANEDRMS2419GCTAATGAGGATCGGATGAGT242023952000395
920AAESSVR2421GCGGCGGAGAGTTCTGTGCGG242223953952000
921SQSDFPN2423TCACAAAGTGACTTCCCCAAC2424239402394
922VRGEETV2425GTTCGTGGGGAAGAAACCGTC242623949851410
923NNLGDRM2427AATAATCTTGGTGATCGTATG242823942000394
924GNLDLKA2429GGAAACCTTGACCTCAAAGCC243023947521642
925GTKDVSP2431GGCACAAAAGACGTGTCACCC243223932000393
926AIPIAST2433GCTATTCCGATTGCGTCTACG2434239202392
927VSASSVD2435GTATCAGCCTCGTCGGTGGAC243623922259133
928NMDRDHV2437AATATGGATCGTGATCATGTG2438239002390
929SHASDSK2439TCTCATGCTTCTGATTCGAAG244023902612130
930PGSDGGH2441CCGGGGTCGGATGGGGGGCAT2442239013571033
931NLGEVQM2443AATTTGGGTGAGGTTCAGATG2444239023900
932RYFGDAS2445CGTTACTTCGGAGACGCGAGT244623892000389
933AADTTVR2447GCCGCTGACACGACAGTACGC244823882111276
934VTGVESR2449GTCACAGGAGTCGAATCTCGA245023872000387
935ATVSSPR2451GCGACCGTATCAAGCCCTAGG245223872000387
936GGNAGLN2453GGTGGTAATGCGGGGCTTAAT245423873872000
937RNQSDQM2455CGGAACCAATCGGACCAAATG245623872025362
938RHQGTES2457AGGCACCAAGGAACAGAATCG245823875431844
939TTSIPTP2459ACGACGTCTATTCCGACGCCT2460238602386
940TSHLGQS2461ACGTCACACCTTGGGCAAAGT246223862000386
941GVNPAVS2463GGTGTTAATCCTGCGGTGTCT246423862000386
942GSDTTVG2465GGTTCGGATACGACGGTTGGT246623851022282
943DLAQSGR2467GATTTGGCTCAGAGTGGGCGT246823851796589
944GMNEHVA2469GGGATGAACGAACACGTCGCC247023842832100
945VMDHKST2471GTTATGGATCATAAGAGTACG247223842000384
946MSTDTPA2473ATGTCGACAGACACGCCCGCG2474238123810
947AMTVEMP2475GCTATGACGGTTGAGATGCCT2476238002380
948GSRENER2477GGCAGCCGCGAAAACGAACGA2478237913011078
949ATDSQGL2479GCTACGGATTCGCAGGGGCTG2480237823780
950PNERITV2481CCGAATGAGAGGATTACTGTG248223772000377
951TTLSDTA2483ACTACTCTGTCTGATACTGCG2484237602376
952HSTGAEM2485CACTCAACAGGCGCCGAAATG2486237602376
953AVLAAAH2487GCGGTGTTGGCTGCGGCTCAT2488237623760
954STLHSST2489TCTACATTGCACTCGAGCACC2490237523750
955VLSVGVV2491GTGCTGAGTGTGGGGGTTGTT2492237523750
956KEGIATA2493AAAGAAGGTATCGCTACCGCA249423742000374
957MVNHTNT2495ATGGTGAACCACACTAACACT249623743742000
958ESTATLR2497GAGTCTACTGCTACTCTTCGG249823722003369
959AVKEHES2499GCCGTCAAAGAACACGAATCC250023722000372
960RYFGDAS2501CGGTATTTTGGGGATGCTTCG25022371412330
961LDARGDR2503CTGGACGCCAGGGGAGACCGT250423711868503
962PLNKDTR2505CCATTGAACAAAGACACTCGG250623706481722
963FQVDKVM2507TTTCAGGTTGATAAGGTTATG250823693692000
964LTDKITS2509CTGACAGACAAAATCACTAGT251023681629739
965LKPAIEL2511CTTAAGCCTGCGATTGAGCTG251223685191849
966ADSVYAK2513GCTGATAGTGTTTATGCTAAG25142368227494
967VMSSPGP2515GTTATGTCTTCGCCTGGTCCT251623673672000
968ALAISER2517GCCCTAGCTATCAGTGAACGT2518236723670
969VTKEHLA2519GTTACTAAAGAACACCTCGCC252023661915451
970GLTNTNI2521GGACTTACAAACACGAACATA252223662252114
971KQVSMES2523AAGCAGGTGTCGATGGAGTCG252423652246119
972SPSASPN2525AGTCCGTCGGCTTCTCCTAAT252623642000364
973VVVGNVK2527GTCGTAGTGGGCAACGTTAAA252823642000364
974SHGSDTK2529TCCCACGGAAGTGACACCAAA253023631823540
975QSREIKI2531CAGAGTAGGGAGATTAAGATT2532236223620
976PALQGNI2533CCGGCTCTTCAGGGTAATATT2534236123610
977TLVGVER2535ACACTCGTGGGCGTCGAAAGG2536236002360
978SGGTMTL2537AGTGGGGGTACGATGACGCTG2538236002360
979DRRTDDS2539GATCGTCGTACTGATGATTCT2540235902359
980NSDNTRL2541AATTCGGATAATACTAGGCTG254223574621896
981ALQSAQV2543GCACTACAATCTGCACAAGTT254423562000356
982NVNPDSL2545AACGTAAACCCCGACAGCTTA254623562211145
983SNGLPAK2547AGTAATGGGCTTCCTGCGAAG254823562205151
984GLNEHVV2549GGGCTAAACGAACACGTCGTA255023552142141
985LVTVHTA2551CTAGTCACCGTACACACGGCA255223547141640
986GVAATNS2553GGCGTAGCGGCAACCAACAGC2554235402354
987QVTDNKT2555CAAGTAACAGACAACAAAACG255623541992155
988VNGVSTI2557GTTAACGGAGTATCCACAATC255823542169186
989PIASSYE2559CCTATTGCGTCTAGTTATGAG2560235423540
990LGESLSR2561TTGGGTGAGTCTTTGAGTCGG2562235423540
991PALAGNF2563CCGGCTCTTGCGGGTAATTTT2564235302353
992RRDASDP2565CGGCGAGACGCCTCCGACCCC2566235323530
993WDNFRFA2567TGGGACAACTTCAGATTCGCG2568235202352
994VIGGVLS2569GTGATTGGTGGTGTGTTGAGT2570235210191333
995ADDNNRW2571GCTGATGATAATAATAGGTGG257223522000352
996LTHSTAV2573CTTACCCACAGTACAGCGGTG257423523522000
997VLSGEVL2575GTCTTGTCTGGAGAAGTCCTT257623513512000
998SVVADKH2577AGTGTTGTGGCGGACAAACAC2578235002350
999PGSTDPK2579CCTGGGAGTACTGATCCGAAG258023502000350
1000ASAELGR2581GCGAGTGCGGAGCTGGGTCGT258223492218131
Mouse_DG
SEQSEQ
IDIDCombinedBALB/cJC57BL/6J
RankPeptideNO:SequenceNO:scorescorescore
1REQQKLW2583CGTGAGCAGCAGAAGCTTTGG2584391512115118000
2REQQKLW2585CGGGAACAACAAAAATTATGG2586380002200016000
3ASNPGRW2587GCTTCGAATCCGGGTCGGTGG258824583458320000
4SLDKPFK2589AGTTTGGATAAGCCTTTTAAG259024000024000
5TLAVPFK2591ACTTTGGCGGTGCCTTTTAAG259222000022000
6TLAVPFK2593ACCTTAGCTGTCCCGTTCAAA259422000022000
7SLDKPFK2595TCACTGGACAAACCATTCAAA259621590021590
8WTLESGH2597TGGACTCTGGAGTCTGGTCAT259820534422816306
9ASNPGRW2599GCAAGCAACCCTGGAAGATGG260018000018000
10REQKKLW2601CGTGAGCAGAAGAAGCTTTGG260217695101757520
11WTLESGH2603TGGACGCTCGAATCGGGCCAC260417407400013407
12REQKKLW2605CGGGAACAAAAAAAACTGTGG26061645773199138
13ERLLVQL2607GAGCGGCTGCTTGTTCAGCTG260815513200013513
14RMQRTLY2609CGTATGCAGCGGACTCTGTAT261012453200010453
15ERLLVQL2611GAACGACTTCTAGTCCAACTA261211949011949
16LGFSPPR2613CTTGGATTCTCACCTCCCCGT261411710011710
17WAISDGY2615TGGGCGATTAGTGATGGGTAT26161082865714258
18TEKLPFR2617ACTGAGAAGCTGCCTTTTCGG261810129010129
19VRGSSIL2619GTGCGTGGGTCGTCGATTCTT2620971476172097
20KLADSVP2621AAACTTGCAGACTCAGTGCCC2622935671442212
21VRGSSIL2623GTCCGGGGATCCAGTATCCTG2624844950593390
22RQQQKLW2625AGGCAACAACAAAAATTATGG2626839320006393
23TVNNDRF2627ACCGTTAACAACGACCGATTC2628802856112417
24TVSENAV2629ACGGTTAGTGAGAATGCGGTT2630800080000
25MSANERT2631ATGTCGGCGAATGAGCGTACG2632800060002000
26RDQQKLW2633CGTGATCAGCAGAAGCTTTGG2634776148132948
27GASNGGT2635GGGGCGAGTAATGGTGGGACT2636767452522422
28TEKLPFR2637ACAGAAAAACTCCCCTTCAGA2638754807548
29ASTATLW2639GCCTCAACGGCCACCTTATGG2640680706807
30VTELTKF2641GTGACTGAGCTTACGAAGTTT2642665540002655
31KEISVSV2643AAGGAGATTAGTGTGTCGGTT264464206007413
32QVAQQGA2645CAGGTTGCGCAGCAGGGGGCG2646639320004393
33GVAGTNT2647GGGGTGGCTGGGACGAATACT264863866011375
34ASAQGAL2649GCGTCTGCTCAGGGTGCGCTT2650638214444938
35QSVDRSK2651CAGTCGGTGGATCGTAGTAAG2652629351971096
36RYVGESS2653CGGTACGTCGGAGAAAGCAGT2654625214224830
37LGHNAGV2655TTGGGGCATAATGCTGGTGTT2656623920034237
38RAAGTSA2657CGCGCCGCTGGCACCTCAGCA2658600060000
39VGISSGV2659GTTGGTATTTCTTCGGGTGTG2660600040002000
40VLVSPGP2661GTTCTGGTTTCGCCTGGTCCT2662600040002000
41SGETLRI2663TCTGGGGAGACGCTTAGGATT2664597740001977
42STEGAAL2665AGTACGGAGGGGGCGGCTCTG266659545715239
43SSSGAAR2667TCTTCTTCAGGTGCCGCCCGC2668592659260
44VLASNGP2669GTGTTAGCGTCCAACGGGCCG2670589832692628
45VVQVTGR2671GTTGTTCAGGTTACTGGGCGT267258755410465
46FAVRLSS2673TTTGCTGTGCGGTTGTCGTCG2674585528093046
47LVRDTKT2675CTCGTAAGAGACACAAAAACG267658115452359
48SGESLSR2677TCTGGCGAAAGCTTATCTAGG2678580438042000
49TLANSQR2679ACTTTGGCGAATTCTCAGCGG2680574640001746
50SQEQRAR2681AGTCAGGAGCAGAGGGCTCGT2682567940001679
51SREGGNV2683AGCCGAGAAGGAGGGAACGTA2684558035802000
52LGGSSMG2685CTTGGTGGGTCTTCAATGGGG2686556526122953
53SGSTDKL2687TCTGGTTCGACTGATAAGTTG2688555632642292
54RDQQKLW2689CGGGACCAACAAAAACTGTGG2690540440001404
55RVSEVGS2691CGCGTCTCAGAAGTTGGCAGC2692537953790
56LGFSPPR2693CTGGGTTTTAGTCCGCCGAGG2694536905369
57SPADTRR2695TCTCCTGCGGATACTAGGAGG2696533853380
58LSRGDEM2697TTGAGTCGCGGAGACGAAATG2698532605326
59RMQRTLY2699CGAATGCAACGAACATTGTAC2700531520003315
60IANLAAS2701ATTGCGAATCTTGCTGCTTCG270253115107204
61AGGVRDR2703GCGGGTGGTGTTCGTGATCGT270452994717582
62GSGSGGL2705GGGAGTGGATCCGGAGGCTTA270652525073179
63TLANSER2707ACACTCGCCAACTCAGAAAGG2708523720003237
64VNYSVAL2709GTTAATTATTCGGTGGCGCTT2710520632871920
65KNPGVYT2711AAGAATCCGGGGGTGTATACT2712517317503423
66QREAARI2713CAGCGTGAGGCGGCGCGGATT2714517315343639
67LSQGSQQ2715CTATCCCAAGGTAGTCAACAA2716516751670
68NGSEGDR2717AACGGTTCGGAAGGGGACAGG2718515731572000
69NVGVVQL2719AACGTAGGCGTAGTACAACTA272051444780364
70TLAVPF*2721ACTTTGGCGGTGCCTTTTTAA2722513505135
71HSLQTSA2723CACTCATTACAAACCTCTGCG2724509626822414
72SATDVKH2725TCGGCAACAGACGTAAAACAC2726508350830
73SGANLSY2727TCTGGTGCGAATTTGTCTTAT272849854271714
74KAHDGEV2729AAAGCGCACGACGGCGAAGTT2730497820002978
75KASDTPM2731AAGGCTTCTGATACTCCTATG2732484020002840
76PGEHNKA2733CCGGGTGAGCATAATAAGGCT27344838474395
77NSAADRQ2735AATAGTGCGGCTGATCGTCAG2736480820002808
78FTHGTGT2737TTCACGCACGGCACAGGCACA273847894000789
79LSNHGPI2739CTCTCCAACCACGGCCCGATC274047593814945
80GGASPVR2741GGGGGTGCTTCTCCTGTGCGG2742475647560
81KLTNIGT2743AAACTTACCAACATAGGCACG2744472320002723
82PGSDGRT2745CCAGGAAGTGACGGAAGGACG274647194000719
83AGGGATR2747GCCGGTGGAGGCGCCACTCGC2748471804718
84GTGSTIV2749GGTACAGGCTCCACAATCGTA2750471629421774
85SNGAGYL2751TCCAACGGAGCTGGGTACTTA2752471420002714
86GGTSSGH2753GGAGGAACTTCCAGCGGGCAC275447034000703
87AVLSQNI2755GCTGTGTTGTCTCAGAATATT275646963737959
88REEQKVW2757CGTGAGGAGCAGAAGGTTTGG275846944000694
89PTQGTNR2759CCCACACAAGGAACCAACCGC276046904000690
90VTTVSNV2761GTTACGACCGTGAGCAACGTT2762468520002685
91GNVNGGA2763GGTAACGTGAACGGAGGAGCG2764468504685
92SVDSGRL2765TCGGTCGACTCTGGACGTTTG276646794000679
93LSNHVPV2767TTATCCAACCACGTTCCGGTG2768464820002648
94ASTATLW2769GCGTCTACTGCTACTCTTTGG2770464715743073
95SDKPVNT2771TCCGACAAACCAGTCAACACA2772464331031540
96NVQTVST2773AACGTTCAAACCGTCTCAACT277446424000642
97RTSTDVV2775AGAACTTCCACAGACGTGGTA277646114000611
98KASDTPK2777AAAGCGTCCGACACACCGAAA277846024474127
99VQGPQNG2779GTGCAGGGTCCGCAGAATGGT2780460020002600
100GGTNSGH2781GGCGGAACCAACAGCGGCCAC278245994000599
101GNSGGRF2783GGGAATAGTGGGGGTCGTTTT2784459820002598
102LAGLGGG2785CTAGCTGGGCTGGGTGGCGGG2786457820002578
103QLRDTKT2787CAGCTGCGTGATACTAAGACT2788457733321245
104MGVAGVH2789ATGGGTGTTGCTGGAGTTCAC2790457220002572
105AVPGTYS2791GCGGTGCCTGGGACGTATTCT279245674000567
106MAAKSTP2793ATGGCTGCGAAGTCGACGCCG279445583666892
107QVRDTNT2795CAGGTGCGTGATACTAATACT279645553848707
108LHNLTQD2797CTTCATAATCTTACGCAGGAT2798453726111926
109PGNSASI2799CCAGGAAACTCCGCATCGATA280045154133382
110MGGGTNH2801ATGGGAGGTGGGACCAACCAC2802450845080
111TLANSQR2803ACACTAGCCAACAGTCAACGT2804449720002497
112SAETLRL2805TCTGCGGAGACGCTTAGGCTT280644914000491
113MGGGANP2807ATGGGGGGGGGTGCTAATCCG280844884000488
114GLAETRA2809GGGCTTGCTGAGACTAGGGCT2810446820002468
115AMSSTVG2811GCAATGAGTTCCACCGTTGGC281244614000461
116TGPQVSI2813ACTGGGCCGCAGGTTAGTATT2814445720002457
117SVDNGKR2815TCGGTGGATAATGGGAAGCGG2816445328751578
118NVSRDHS2817AACGTCTCCCGTGACCACAGT281844464000446
119KSNSENS2819AAGTCGAATTCGGAGAATAGT2820443120002431
120VSLNEGH2821GTGTCGCTTAATGAGGGGCAT2822442220002422
121VTADRTT2823GTGACTGCTGATCGGACTACG282443973818579
122VLGGTAG2825GTGCTGGGTGGTACTGCGGGG2826438520002385
123TAVNSTS2827ACGGCTGTGAATTCGACTTCG2828438519472438
124LSLTDVV2829CTGAGTTTGACTGATGTGGTT283043843943441
125PIHGASS2831CCGATTCATGGTGCTAGTTCG283243784000378
126GINAVGP2833GGGATTAATGCTGTTGGTCCG2834437020002370
127GSGTGVA2835GGTTCTGGGACGGGGGTGGCT2836436704367
128RQQQKLW2837CGTCAGCAGCAGAAGCTTTGG2838435819592399
129TGQTEMT2839ACGGGGCAGACTGAGATGACT2840434120002341
130VGNSSGV2841GTGGGCAACTCCTCTGGGGTT2842433420002334
131FNSGTGT2843TTCAACAGCGGGACTGGCACA284443334000333
132KLAEGIR2845AAGCTTGCTGAGGGGATTAGG2846432804328
133GNQGGTR2847GGTAATCAGGGGGGGACGCGT284843204000320
134GSESGVA2849GGTTCTGAGTCGGGGGTGGCT285043164000316
135TSSRPEE2851ACTTCTTCTCGGCCGGAGGAG2852431243120
136RSVGTSA2853CGTTCGGTGGGTACTTCGGCG285443124000312
137AVVLNAL2855GCGGTGGTGCTGAATGCTTTG285643024000302
138RTSTDVV2857AGGACGAGTACGGATGTTGTG2858430043000
139LRNTQLD2859TTGCGGAATACTCAGTTGGAT2860428420002284
140IPPGVPR2861ATTCCGCCTGGGGTTCCGCGT286242803607673
141MNGGHIL2863ATGAATGGGGGTCATATTCTG2864426813392930
142RYVGESS2865AGGTATGTGGGGGAGTCTTCG2866425126421610
143PNGVSVV2867CCTAATGGTGTTTCTGTGGTG286842473597650
144ASLGAYS2869GCGTCGCTTGGGGCGTATTCG2870424432121032
145GSGSVVA2871GGTTCTGGGTCGGTGGTGGCT287242389593279
146MGSNGQV2873ATGGGGTCTAATGGGCAGGTT2874423620002236
147TGVLINS2875ACGGGTGTTTTGATTAATTCG2876422814722756
148LSLTGGV2877TTGTCGCTCACCGGGGGAGTC2878420812102998
149SPEGRNV2879TCGCCCGAAGGCCGCAACGTA288041954000195
150TIPNLSR2881ACAATACCGAACTTATCGCGC2882418429111273
151KNGGHVQ2883AAAAACGGTGGGCACGTACAA288441804000180
152ADLAGSR2885GCGGATCTGGCGGGGTCTAGG288641734000173
153SYGSDSK2887TCTTATGGTTCTGATTCGAAG2888416428301334
154SLGADVG2889AGTTTGGGGGCGGATGTTGGG2890416420002164
155TVLTGSF2891ACCGTCCTCACCGGAAGCTTC289241364000136
156GNSGGRF2893GGCAACTCAGGCGGCAGGTTC2894411620002116
157DRGGSTV2895GACCGTGGGGGGTCCACCGTA2896410418822222
158AGYSGTT2897GCAGGTTACTCCGGTACGACG2898410418032301
159KGSDSPM2899AAGGGTTCTGATTCTCCTATG2900408504085
160DSHVSGY2901GACTCGCACGTATCAGGCTAC2902407904079
161KSHSEYS2903AAGTCGCATTCGGAGTATAGT2904407120002071
162PTQGTHP2905CCTACTCAGGGGACGCATCCG2906407028741196
163LEQSVAR2907CTCGAACAATCTGTCGCACGC2908405840580
164KEIRASV2909AAGGAGATTCGTGCGTCGGTT291040563266790
165NAINRMV2911AATGCGATTAATCGTATGGTG2912403010952935
166KIGENAS2913AAAATCGGCGAAAACGCGAGT2914401120002011
167KMGGGVS2915AAGATGGGTGGTGGGGTGTCG2916400740070
168EVGNVSR2917GAGGTGGGGAATGTGTCTCGT291840023303699
169RVTTHTQ2919CGTGTTACTACTCATACGCAG2920400040000
170AGFSNQT2921GCCGGATTCTCGAACCAAACT2922400040000
171QREAARI2923CAAAGGGAAGCTGCACGCATC2924400040000
172LSLNTKT2925CTATCCCTAAACACCAAAACC2926400040000
173RGGVSSV2927CGTGGTGGAGTTTCTAGTGTT2928400040000
174ENRVNNA2929GAAAACAGAGTGAACAACGCA2930400040000
175SNGGRVE2931AGTAACGGCGGACGCGTTGAA2932400040000
176GVKNTNI2933GGCGTCAAAAACACGAACATC2934400040000
177SDRTHAS2935TCGGATCGGACTCATGCGTCG2936400040000
178GSGSGVL2937GGTTCTGGGTCGGGGGTGCTG2938400040000
179AEGTDGV2939GCCGAAGGGACGGACGGCGTC2940400040000
180GGKGEGP2941GGTGGGAAGGGTGAGGGTCCG2942400040000
181TAVKVSG2943ACGGCTGTGAAGGTTAGTGGT2944400040000
182TTGEGIR2945ACTACGGGTGAGGGTATTCGT2946400040000
183LLASGAK2947CTGCTTGCGAGTGGGGCTAAG2948400040000
184SLGTMTL2949TCTCTTGGAACCATGACCCTC2950400040000
185RSEGTSA2951CGTTCGGAGGGTACTTCGGCG2952400040000
186TNGQASR2953ACTAATGGTCAGGCGTCTAGG2954400040000
187LGHKAGG2955TTGGGGCATAAGGCTGGTGGT2956400040000
188VVAGTYS2957GTGGTGGCTGGGACGTATTCT2958400040000
189SGEQIRL2959TCTGGGGAGCAGATTAGGCTT2960400040000
190RGGVTGE2961CGGGGGGGTGTTACTGGTGAG2962400040000
191VSSEAPL2963GTGTCTTCGGAGGCGCCGCTG2964400040000
192RGPDHKT2965AGAGGCCCCGACCACAAAACT2966400040000
193QAHGGPR2967CAGGCGCATGGGGGGCCTCGT2968400040000
194LLTDKRV2969CTTCTTACTGATAAGCGTGTG2970400040000
195TSPGAGL2971ACTTCTCCGGGTGCGGGGCTG2972400020002000
196LTDSRPV2973TTGACGGATAGTAGGCCTGTT2974400020002000
197RSEVNGV2975CGTTCCGAAGTAAACGGTGTG2976400020002000
198SPGLSIS2977AGTCCTGGGCTGTCTATTAGT2978400020002000
199SGSLVGA2979TCAGGGAGCCTAGTTGGTGCC2980400020002000
200MVDKPSE2981ATGGTTGATAAGCCTTCTGAG2982400020002000
201TPSAFPN2983ACTCCGTCGGCTTTTCCTAAT2984400020002000
202MSLNDGV2985ATGAGTTTGAATGATGGGGTT2986400020002000
203TTEAIVR2987ACTACTGAAGCGATCGTCCGC2988400020002000
204SLVNASF2989AGTCTGGTTAATGCTTCGTTT2990400020002000
205LSNHRPV2991CTAAGCAACCACCGACCAGTG2992400020002000
206GSGSGVL2993GGCTCGGGATCAGGTGTACTC2994400020002000
207AMSSTVG2995GCGATGTCGAGTACTGTGGGT2996400020002000
208GMVTNHV2997GGCATGGTTACTAACCACGTT2998400020002000
209KMGGGVS2999AAAATGGGCGGGGGTGTTAGC3000400020002000
210KSDRGVV3001AAGTCGGATCGTGGGGTTGTT3002400020002000
211QLRDTKT3003CAACTTCGCGACACGAAAACG3004400020002000
212VANGFPR3005GTCGCTAACGGCTTCCCCAGA3006400020002000
213YVNGATE3007TATGTGAATGGGGCGACTGAG3008400004000
214QVTDNKT3009CAAGTCACCGACAACAAAACG3010400004000
215PGHGPER3011CCTGGTCATGGTCCGGAGAGG3012400004000
216MLGQAGG3013ATGCTGGGTCAGGCTGGGGGG3014400004000
217APDSTVR3015GCGCCGGATAGTACTGTGCGG3016400004000
218SHSSDSK3017TCTCATAGTTCTGATTCGAAG3018400004000
219GLHGQSA3019GGCCTCCACGGACAATCCGCC3020400004000
220NLGVGQM3021AACTTAGGAGTAGGCCAAATG3022400004000
221FISSTMR3023TTTATTTCTAGTACGATGCGT3024400004000
222DGTQSGR3025GACGGCACACAATCCGGCAGG3026400004000
223MAGKSSP3027ATGGCGGGAAAAAGTTCTCCA3028399020001990
224LSTGAQM3029CTTTCGACGGGTGCGCAGATG3030398919792010
225NGGSEKR3031AATGGTGGTTCTGAGAAGCGT303239813293688
226EDRSTTP3033GAGGATCGGTCGACTACTCCT3034397439740
227GSVSSTK3035GGCTCGGTATCCTCAACAAAA3036397439740
228KNPGVDP3037AAGAATCCGGGGGTGGATCCT303839613476485
229SPSALPN3039TCACCCTCAGCCCTCCCGAAC3040395920001959
230LPSLTGG3041CTTCCGAGTTTGACTGGGGGT304239589433015
231TDRSDKG3043ACTGATAGGTCTGATAAGGGG3044395618442112
232AQQGSTL3045GCTCAGCAGGGGTCTACGCTG3046395319532000
233VLESNPR3047GTGCTTGAGTCGAATCCGCGG304839523385567
234RIVGSDP3049AGGATTGTGGGTAGTGATCCG3050395203952
235LSMTHGV3051CTGAGTATGACTCATGGGGTT3052394863942
236KLAERIR3053AAACTCGCGGAACGTATCCGG305439433711232
237VSAGLGI3055GTCTCGGCAGGTTTAGGAATC3056393839380
238NSKDVLR3057AACTCGAAAGACGTCCTCCGA305839373729208
239SMQSPST3059TCTATGCAGTCGCCTAGTACG306039353478457
240TLNSATT3061ACGCTTAATTCGGCGACTACT3062392720001927
241SGESLRL3063TCTGGGGAGTCGCTTAGGCTT3064392439240
242QVRDIKT3065CAGGTGCGTGATATTAAGACT3066391920001919
243QGGNAMR3067CAGGGTGGGAATGCTATGCGT3068391419142000
244MLGGGES3069ATGTTGGGTGGTGGGGAGTCG3070391220001912
245PTQGTLR3071CCGACACAAGGTACACTACGC3072391220001912
246VIAGLAI3073GTAATCGCCGGACTCGCCATC307439113697214
247KEISVSV3075AAAGAAATCTCTGTATCTGTG307639053522383
248LSANVRT3077CTGTCGGCGAATGTTCGGACT3078390418612044
249KGSDNPM3079AAGGGTTCTGATAATCCTATG308039023427475
250GAPSGSL3081GGAGCTCCGTCAGGATCCCTT3082390015382362
251TSGNAGL3083ACAAGCGGGAACGCGGGCCTC3084389938990
252REQQKLW3085AGGGAACAACAAAAATTATGG308638983425473
253NVTGVVL3087AACGTAACTGGAGTTGTCCTT308838962913983
254MESVTQG3089ATGGAAAGCGTAACCCAAGGA3090388438840
255TTVKVSP3091ACGACTGTGAAGGTTAGTCCT3092387638760
256GLPDTMA3093GGTCTGCCAGACACGATGGCC3094386903869
257QSQTADA3095CAGTCTCAGACGGCTGATGCT3096386138610
258TLTLSMR3097ACACTTACGCTTTCAATGAGG3098385403854
259GNPGSHS3099GGGAATCCGGGTTCTCATAGT3100384920001849
260SGNQPRM3101TCTGGGAATCAGCCTAGGATG3102383024661364
261SHGSDLK3103TCTCATGGTTCTGATTTGAAG310438283516312
262SLGEGRH3105TCGTTGGGTGAGGGTCGGCAT3106381620001816
263AGISSQP3107GCTGGCATCAGCTCACAACCA310838123288524
264SLGEARP3109TCTTTAGGGGAAGCGCGTCCC311038113390421
265IYSDGSS3111ATTTATAGTGATGGGTCGTCT3112380714262381
266RVSLAVK3113AGGGTGTCGCTGGCTGTGAAG311438023456346
267HGTGNTY3115CACGGAACCGGTAACACATAC3116379920001799
268TSERGSL3117ACGAGTGAGAGGGGGTCGCTG311837903071718
269ASPQVSL3119GCGAGTCCGCAGGTGTCGCTT3120378917892000
270VVQDPGR3121GTTGTTCAGGATCCTGGGCGT3122378137810
271QGGDSGG3123CAGGGTGGTGATAGTGGGGGT3124378120001781
272TADARAL3125ACAGCGGACGCGCGCGCTTTG3126377837780
273TNGHSQV3127ACTAACGGACACAGCCAAGTC3128376314562307
274RNQAEEM3129CGTAACCAAGCCGAAGAAATG3130376237620
275VAVSSNK3131GTTGCTGTTTCTTCGAATAAG3132375915512208
276KTAQVQP3133AAAACCGCTCAAGTCCAACCT313437503483267
277AVGSDGV3135GCTGTGGGTTCTGATGGTGTT3136374817482000
278LNSSTQR3137TTGAATTCTAGTACGCAGCGT3138373817382000
279QMKSRSD3139CAGATGAAGTCTCGTTCTGAT314037273470257
280SVQITGL3141TCTGTGCAGATTACTGGTTTG3142372713382389
281STLQGVA3143AGTACGCTGCAGGGTGTGGCT3144372637260
282KLAEGVR3145AAGCTTGCTGAGGGGGTTAGG3146372320001723
283TDKQNAF3147ACAGACAAACAAAACGCCTTC3148372016642056
284REIRVSV3149CGAGAAATCAGAGTATCCGTC3150371017102000
285QVRDAKT3151CAGGTGCGTGATGCTAAGACT3152369820001698
286VEGPTTN3153GTCGAAGGGCCTACAACGAAC3154369403694
287SDRTAVV3155AGTGATCGGACGGCGGTTGTT315636937342959
288QSDNHGR3157CAGTCGGATAATCATGGTAGG315836923163529
289ETQGRQF3159GAGACTCAGGGTCGTCAGTTT3160367736770
290PSVSTLS3161CCGTCGGTCAGCACACTGTCG316236763367309
291RGQSDPA3163CGGGGGCAGTCTGATCCGGCG3164365236520
292AKESGLM3165GCAAAAGAATCCGGCCTAATG3166365203652
293AEGRSAM3167GCGGAGGGGAGGAGTGCTATG3168364820001648
294RLNEHVA3169CGGTTGAACGAACACGTTGCC3170364020001640
295TGASTFV3171ACCGGAGCCAGTACATTCGTA3172364016901950
296ALIRDNV3173GCTCTGATTCGTGATAATGTT3174362820001628
297PHAATPG3175CCTCATGCTGCGACTCCTGGG3176362516252000
298RSAGTSS3177CGTTCGGCGGGTACTTCGTCG3178362436240
299SKAEGPV3179TCGAAGGCTGAGGGTCCGGTT3180362036200
300LSLRDGV3181CTCTCGCTCCGGGACGGAGTC3182361920001619
301TVSEQPR3183ACGGTTTCGGAGCAGCCGCGT3184361636160
302LSLGSQL3185CTGTCTCTGGGGTCGCAGCTG3186360613802226
303NGTAGDR3187AACGGAACCGCAGGGGACCGC318836043347257
304VQVSSVA3189GTTCAGGTTTCTTCTGTGGCT3190360116012000
305SDGKTHP3191TCGGATGGTAAGACTCATCCG3192360111902411
306NEPSVNT3193AACGAACCCTCGGTAAACACG3194359935990
307DVADSKR3195GATGTTGCTGATTCTAAGCGT319635903097493
308GVAGTYS3197GGCGTCGCAGGTACCTACAGT3198358335830
309NIKDVNR3199AATATTAAGGATGTTAATAGG3200357720001577
310DLVLLHR3201GATCTTGTTTTGTTGCATAGG3202356803568
311SNGTVTI3203TCAAACGGGACTGTAACAATA320435593230329
312LANTLSV3205CTGGCTAATACGTTGAGTGTT3206355635560
313PTQVTLR3207CCTACTCAGGTGACGCTTCGG3208354835480
314ILGGLAV3209ATATTGGGCGGCCTAGCCGTG3210352915292000
315TAVHSAS3211ACGGCTGTGCATTCGGCTTCG321235263141385
316TSLSQDR3213ACGAGTTTGTCGCAGGATAGG3214352403524
317LDTTARL3215CTTGATACTACTGCTCGTCTT3216351715172000
318GASDQLS3217GGTGCTTCGGATCAGCTTTCT3218351435140
319EKVAPTP3219GAGAAGGTTGCTCCGACGCCT3220351420001514
320AAGGQVL3221GCGGCGGGTGGGCAGGTTCTG3222351235120
321PGDRSPS3223CCGGGTGATCGTTCGCCTTCT3224350835080
322DALSRMA3225GACGCATTGTCACGTATGGCT3226349803498
323TAGQVSK3227ACTGCGGGTCAGGTGTCTAAG3228349503495
324LAKDSGG3229TTGGCTAAGGATTCTGGGGGG3230348811632325
325SAGRADL3231AGTGCAGGTAGAGCCGACCTC3232348234820
326TTAAIVS3233ACAACAGCTGCGATAGTATCC323434743117357
327SMGQKEL3235AGCATGGGCCAAAAAGAACTA3236347420001474
328PQKGGGV3237CCTCAGAAGGGTGGTGGGGTG3238347220001472
329GAVSSIK3239GGGGCCGTGAGTAGTATCAAA3240347211502322
330LSSGVSK3241CTCTCATCGGGTGTATCCAAA3242346615241942
331SVGLTNG3243AGTGTGGGTCTGACGAATGGT324434657802685
332VKQTDVA3245GTTAAGCAGACGGATGTTGCT3246346134610
333EAVRLSA3247GAAGCCGTACGGCTGTCCGCA3248346113532108
334GTLTSGY3249GGCACACTCACGAGCGGATAC3250346020001460
335KASDTPK3251AAGGCTTCTGATACTCCTAAG3252345734516
336RGGVTGE3253CGTGGTGGCGTGACAGGAGAA3254345034500
337AVLAGYR3255GCAGTCCTTGCAGGCTACCGT3256344834480
338GTYSTSL3257GGTACGTACAGCACCAGCCTC3258344820001448
339TSLGLVV3259ACATCTCTTGGGTTGGTAGTT3260344534450
340QVRDTKI3261CAGGTGCGTGATACTAAGATT3262344534450
341SMGQKEL3263TCTATGGGGCAGAAGGAGCTT326434373247190
342KEAPHGV3265AAAGAAGCCCCCCACGGTGTA3266343110772354
343KSLSENS3267AAATCATTGTCCGAAAACAGC326834223152270
344SEVSKGK3269AGTGAGGTTAGTAAGGGTAAG327034203087333
345RTQTDLG3271CGGACGCAGACGGATCTTGGT3272341434140
346TTLGATA3273ACGACGTTGGGTGCTACGGCT3274341020001410
347ITSGTGT3275ATTACTAGTGGTACGGGTACT327634083290118
348QTQTAVR3277CAGACTCAGACGGCTGTTCGT3278340820001408
349LADNHGR3279TTGGCTGACAACCACGGCCGT3280340714072000
350NQINAGV3281AACCAAATCAACGCCGGCGTG3282339913992000
351SHGSDSR3283TCTCATGGTTCTGATTCGAGG328433852963421
352SQKEVAT3285AGCCAAAAAGAAGTTGCCACC3286338333830
353KEGAVYV3287AAGGAGGGGGCGGTGTATGTG3288338016151765
354VSGSISK3289GTTTCGGGGAGTATTTCTAAG3290337012302140
355VSPGKLH3291GTCAGTCCTGGCAAACTCCAC3292336720001367
356GETNTNI3293GGGGAAACAAACACCAACATC3294336533650
357TYGSGPT3295ACTTATGGTTCTGGGCCTACT3296336213622000
358SSGQLGV3297TCCTCCGGCCAATTAGGCGTT329833602800560
359WAISDGY3299TGGGCTATATCAGACGGATAC3300336019311429
360HAEGARS3301CATGCGGAGGGTGCTCGTAGT3302335633560
361SGEALRL3303TCTGGGGAGGCGCTTAGGCTT3304335503355
362SMGIGAV3305TCTATGGGGATTGGTGCGGTT3306334733470
363MVKQQLT3307ATGGTCAAACAACAACTCACC330833472974373
364LTSGQAV3309TTGACATCTGGCCAAGCAGTC3310334710672280
365LSLTNGV3311TTGAGCCTTACAAACGGTGTG3312334633460
366SGANLSI3313TCAGGAGCAAACCTCAGCATC3314334520001345
367SHLNTTP3315TCGCATTTGAATACTACTCCT3316334219991343
368VMSGTSH3317GTGATGTCAGGCACGTCTCAC3318333913392000
369REQQKIW3319CGTGAGCAGCAGAAGATTTGG3320333222961037
370DGQRLGA3321GACGGCCAAAGATTAGGTGCG3322333118481483
371VLASPGH3323GTACTAGCGAGTCCGGGACAC3324332803328
372NSKDGHR3325AACTCAAAAGACGGACACCGT3326332633260
373KNGGHVL3327AAAAACGGCGGCCACGTGTTG332833263162164
374ALSGLAR3329GCCCTCTCAGGCTTAGCACGT3330332320001323
375TSGNAGL3331ACTTCTGGTAATGCTGGGCTT333233222722600
376AGSDYTV3333GCTGGTAGTGATTATACTGTG3334332113251996
377TQIETRR3335ACGCAGATTGAGACGAGGCGG3336331733170
378LISSTQR3337TTGATTTCTAGTACGCAGCGT3338331320001313
379KIAEGIR3339AAGATTGCTGAGGGGATTAGG3340331003310
380PSPGSQL3341CCGTCTCCGGGGTCGCAGCTG3342330703307
381RVESADL3343CGCGTAGAATCCGCAGACCTC3344330120001301
382KGSDSPK3345AAGGGTTCTGATTCTCCTAAG334632932985308
383TVGHADK3347ACGGTTGGGCATGCTGATAAG334832909592332
384VDRSGIP3349GTAGACCGTTCTGGAATACCA3350328903289
385QVRDTKT3351CAGGTGCGTGATACTAAGACT3352328812882000
386RNNVETT3353CGTAACAACGTGGAAACAACA3354328132810
387AVISMTP3355GCCGTAATATCTATGACCCCG3356327312732000
388ATEKAVR3357GCGACGGAGAAGGCTGTGAGG335832692811458
389SADVTAR3359AGTGCTGATGTGACGGCGAGG3360326712672000
390GTGGAVR3361GGAACCGGCGGGGCAGTGCGC3362326620001266
391RTMGDST3363CGTACAATGGGCGACTCAACG3364325732570
392LGDSAKP3365CTAGGCGACTCAGCCAAACCT3366325720001257
393VTQGTSL3367GTTACGCAGGGGACTTCTCTG3368325214351817
394IAVGLTV3369ATTGCGGTGGGGCTGACTGTT3370325103251
395FTPGTAT3371TTTACTCCTGGTACGGCTACT3372323520001235
396LGDSASP3373CTTGGGGATTCTGCTTCGCCG3374322412242000
397VGVPTTN3375GTAGGAGTACCCACCACGAAC3376321712172000
398LLADKRA3377CTTCTTGCTGATAAGCGTGCG3378321510042211
399TDGRGDR3379ACGGATGGTCGGGGTGATAGG338032127792433
400NLIKPFL3381AATTTGATTAAGCCTTTTCTT3382320920001209
401EIALTVH3383GAAATAGCACTGACAGTACAC3384320703207
402NSKDVQR3385AATAGTAAGGATGTTCAGAGG338632062470736
403DLVLLHR3387GACTTAGTCCTCCTTCACCGA3388319903199
404MSVQDRG3389ATGTCCGTTCAAGACCGAGGC3390319903199
405DRSTTVP3391GATCGTAGTACGACGGTTCCT3392319731970
406LSRGSQL3393TTAAGTCGCGGTTCACAACTT339431972950247
407THLSSTR3395ACGCATCTGAGTAGTACTCGT3396319631960
408MNGGHAL3397ATGAACGGAGGGCACGCATTG3398319331930
409SPSAFPI3399TCGCCTAGTGCATTCCCAATC3400319203192
410DYGSTGR3401GATTATGGTTCTACGGGGCGG3402318531850
411QVAQQGA3403CAAGTCGCACAACAAGGCGCT340431842792392
412IGSGVLA3405ATTGGTAGTGGTGTTCTTGCT3406318403184
413VGHGGVD3407GTGGGTCATGGTGGTGTGGAT3408317611762000
414AVGSTVK3409GCGGTTGGGTCGACGGTGAAG3410317031700
415SGEQLRI3411TCTGGGGAGCAGCTTAGGATT3412316820001168
416RNNVDST3413CGGAATAATGTTGATTCTACG3414316431640
417AVGTAIG3415GCGGTAGGCACGGCAATCGGC3416316020001160
418VAGLGGL3417GTTGCGGGTTTGGGGGGGCTT3418315403154
419RTSAGVV3419AGGACGAGTGCGGGTGTTGTG3420315131510
420FTSGTGN3421TTCACGAGCGGAACAGGCAAC3422314931490
421QREATRI3423CAGCGTGAGGCGACGCGGATT3424314020001140
422LNNPVQV3425CTGAATAATCCTGTGCAGGTT3426313703137
423LISTTLR3427TTGATTTCTACTACGCTGCGT3428313603136
424GGTVSGH3429GGAGGGACTGTATCTGGACAC3430313420001134
425AGTLYAR3431GCGGGGACTCTGTATGCTCGT3432313420001134
426GQSSNLH3433GGACAAAGTAGCAACTTGCAC3434313203132
427DVSGSVI3435GACGTAAGCGGCTCCGTAATC3436313020001130
428KLAEGVR3437AAATTGGCAGAAGGAGTCAGA3438312320001123
429SGLQVSI3439TCAGGATTGCAAGTGTCGATA3440312031200
430QTGAIVV3441CAAACAGGAGCAATCGTTGTC3442311316711442
431LEANVSH3443CTGGAAGCTAACGTGAGTCAC3444311220001112
432ALSGLSK3445GCTCTTTCTGGTCTTTCTAAG3446310803108
433MGKQTTL3447ATGGGGAAGCAGACTACGCTG344831072664443
434HHSQYGA3449CACCACTCGCAATACGGCGCT3450310603106
435TGLQGSI3451ACCGGCCTTCAAGGATCTATA3452310131010
436LISGEKT3453TTGATTTCTGGTGAGAAGACG3454310120001101
437RSASGNE3455CGGAGTGCTAGTGGTAATGAG3456309930990
438SEKSVPL3457TCCGAAAAAAGCGTACCACTG3458309720001097
439VLDSRSP3459GTTCTGGATAGTAGGAGTCCG346030962423673
440QGGNSGR3461CAAGGAGGCAACTCAGGTAGG3462309510952000
441SAVASGK3463TCGGCGGTGGCGTCGGGTAAG3464309420001094
442AQGPQTG3465GCGCAGGGTCCGCAGACTGGT3466309210922000
443MNVGNVL3467ATGAACGTTGGGAACGTGCTC3468309203092
444VLGGTGK3469GTCCTCGGAGGTACCGGTAAA347030892941148
445KSHSEIS3471AAATCCCACAGTGAAATCAGC3472308803088
446SDSRVSY3473AGTGACTCCCGAGTATCGTAC3474308703087
447YTAGSMA3475TATACGGCGGGGTCTATGGCG3476308610862000
448TRFDGSG3477ACGCGTTTTGATGGTTCGGGT3478308410842000
449QREAERI3479CAGCGTGAGGCGGAGCGGATT3480307720001077
450KNPAVDP3481AAAAACCCCGCAGTCGACCCG3482307630760
451FSSETLT3483TTCAGCTCCGAAACCTTGACC3484307215941478
452YGNSGVI3485TACGGCAACTCGGGGGTCATA348630626942369
453KNPGADP3487AAAAACCCTGGTGCCGACCCC3488305830580
454LAIAGTM3489CTTGCTATTGCGGGGACTATG3490305703057
455SNLGNTS3491TCGAATCTGGGTAATACTAGT3492305210522000
456PIQLGQA3493CCGATTCAGTTGGGTCAGGCT3494305120001051
457KTETGYE3495AAGACTGAGACTGGTTATGAG3496305120001051
458VTKVSHV3497GTCACAAAAGTAAGTCACGTC3498304620001046
459AGGGVPR3499GCAGGAGGCGGCGTCCCACGT3500304318751168
460ADKGGVA3501GCTGATAAGGGGGGTGTGGCT350230422903139
461SEGISRY3503TCAGAAGGCATATCTCGGTAC3504304003040
462LDHGGVD3505CTCGACCACGGAGGAGTAGAC3506303820001038
463NEQSVKT3507AACGAACAAAGCGTTAAAACC3508303720001037
464SARDMTR3509AGCGCCCGCGACATGACTCGT3510303420001034
465TSVGMQV3511ACGTCGGTTGGGATGCAGGTT351230337382295
466VQAGKEL3513GTGCAGGCTGGTAAGGAGTTG3514303303033
467NASAGDR3515AATGCGAGTGCGGGTGATCGT3516302830280
468AAGVILK3517GCGGCGGGTGTTATTCTGAAG3518302830280
469SGAEGGR3519TCTGGTGCTGAGGGTGGTCGG3520302830280
470NRQEHSN3521AACCGCCAAGAACACAGCAAC3522302820001028
471AADGSVR3523GCCGCAGACGGAAGTGTTAGG3524302820001028
472SGANLSM3525TCTGGTGCGAATTTGTCTATG3526302710272000
473HSSGWTS3527CACTCGAGTGGATGGACCAGC3528302203022
474NLGVVQP3529AATTTGGGTGTGGTTCAGCCG3530302203022
475HSTGAEK3531CATTCGACGGGTGCGGAGAAG3532302030200
476NLSISER3533AATTTGTCGATTTCTGAGCGG3534301803018
477TNLADTA3535ACTAATCTGGCTGATACTGCG3536301320001013
478RSSGTSA3537AGATCATCCGGGACCTCAGCA3538301130110
479KNGGHVL3539AAGAATGGGGGTCATGTTCTG354030107882222
480GSSGGHF3541GGAAGCTCGGGTGGACACTTC3542300520001005
481ISHSESV3543ATTAGTCATTCGGAGAGTGTG3544300503005
482VTGVSRV3545GTTACGGGTGTGAGTCGTGTG3546300320001003
483QGGNSGA3547CAGGGTGGTAATAGTGGGGCT3548300303003
484AMPTSGH3549GCGATGCCGACGAGTGGGCAT3550300003000
485TLTNGMP3551ACCTTGACCAACGGTATGCCA3552299902999
486SHGTDSK3553TCCCACGGAACGGACAGTAAA3554299829980
487WSDRESR3555TGGTCTGACCGCGAATCTAGG3556299717591238
488KEIRVSV3557AAAGAAATAAGGGTCTCCGTG3558299229920
489FAGVTQA3559TTTGCGGGGGTTACGCAGGCG3560298902989
490DSHVSGV3561GACTCTCACGTATCCGGAGTG3562298816541334
491LLKESTP3563CTCCTTAAAGAAAGTACACCT3564298402984
492ADREVRY3565GCGGATCGGGAGGTGCGTTAT3566298202982
493MNGGHGL3567ATGAATGGGGGTCATGGTCTG356829802000980
494SLRDVEG3569TCGCTGCGTGATGTGGAGGGT357029792000979
495SKSGVVA3571AGTAAGTCTGGTGTGGTGGCG3572297915751404
496RNEGSVP3573CGAAACGAAGGCTCGGTCCCT357429782000978
497NLQGNAL3575AACTTACAAGGCAACGCGCTA357629772000977
498YSTTAGM3577TATTCGACTACGGCTGGTATG3578297302973
499AADGSVR3579GCGGCGGATGGTTCTGTGCGG358029722000972
500VGNMLSV3581GTCGGGAACATGCTATCTGTG358229702000970
501KEYITAV3583AAGGAGTATATTACGGCTGTG3584296929690
502ANAGMSR3585GCGAATGCTGGGATGTCTAGG358629692000969
503HTVEGAL3587CATACGGTTGAGGGGGCGCTG358829672000967
504NHQSLVN3589AACCACCAATCGCTCGTTAAC359029632000963
505VSGTLLA3591GTATCCGGCACGTTACTGGCA3592296302963
506QSRPDAL3593CAGAGTCGTCCGGATGCTCTT359429582000958
507AGVVNGL3595GCGGGGGTGGTGAATGGTTTG359629562000956
508RGGETSE3597CGGGGGGGTGAGACGTCTGAG3598295329530
509LSLTVGV3599CTGAGTTTGACTGTTGGGGTT360029522000952
510HISSLAM3601CACATATCCTCCCTTGCCATG3602295102951
511AFSGGET3603GCCTTCAGCGGTGGTGAAACG3604294817661182
512LRGTENQ3605TTGCGTGGGACGGAGAATCAG3606294842944
513QSQTAVD3607CAATCACAAACAGCAGTCGAC3608294602946
514ASSATLL3609GCGTCTAGTGCTACTTTGTTG361029452000945
515GQALVSS3611GGTCAAGCTTTAGTGTCGAGT3612294502945
516TAVHSTS3613ACGGCTGTGCATTCGACTTCG361429432000943
517KNPGLDH3615AAAAACCCAGGACTAGACCAC361629402729211
518KSGLLID3617AAAAGCGGCCTTCTTATAGAC3618293729370
519SGVTPLR3619TCGGGAGTAACTCCACTCCGT3620293102931
520REEQKVW3621AGAGAAGAACAAAAAGTCTGG362229262000926
521LSQGSQM3623CTTAGTCAAGGATCCCAAATG362429262000926
522LSLTATS3625CTGTCTCTGACGGCTACGTCT3626292618671059
523KGSDTPK3627AAGGGTTCTGATACTCCTAAG3628292602926
524SKPENAL3629TCGAAACCCGAAAACGCACTA363029252000925
525GGTNSAH3631GGGGGTACGAATAGTGCTCAT363229242000924
526FSTDTLS3633TTCAGCACCGACACCTTATCG3634292402924
527SVDVTAK3635AGTGTTGATGTGACGGCGAAG363629179172000
528VAQGSVV3637GTGGCTCAGGGGTCGGTTGTT363829162000916
529TSGSGTS3639ACTTCTGGTTCTGGTACGTCG3640291602916
530KEVRVSV3641AAGGAGGTTCGTGTGTCGGTT3642290929090
531RVDSVQL3643AGAGTTGACTCAGTTCAACTG364429092000909
532TGVQTAV3645ACTGGAGTCCAAACCGCCGTC3646290829080
533AADSTER3647GCGGCGGATAGTACTGAGCGG364829082000908
534GEAGKYS3649GGGGAGGCTGGGAAGTATTCT3650290629060
535AGGGSPR3651GCCGGAGGCGGATCGCCTCGT365229032000903
536GEAGTNS3653GGTGAAGCCGGCACAAACTCG365429022000902
537RVDSSQI3655AGGGTGGATTCGTCGCAGATT3656290202902
538YTAGSMA3657TACACTGCTGGCAGCATGGCC365829012000901
539MLGAGVS3659ATGTTGGGTGCTGGGGTGTCG3660290118791022
540QADNNGR3661CAGGCGGATAATAATGGTAGG366229002000900
541TLHDKVL3663ACCTTGCACGACAAAGTCTTA366428992000899
542VTKTLPQ3665GTGACCAAAACTTTGCCGCAA3666289902899
543QSLTDRV3667CAGAGTTTGACTGATCGGGTT3668289712901607
544ANRNESD3669GCTAATCGTAATGAGAGTGAT3670289202892
545TSHDTLV3671ACGTCGCATGATACGTTGGTT3672288628860
546SEGLTRY3673TCTGAAGGCCTCACCAGGTAC3674288428840
547SHGADSK3675TCACACGGGGCCGACAGCAAA367628792615263
548VLASTGH3677GTCTTGGCGAGCACCGGGCAC367828782000878
549NHLSDRL3679AATCATCTTAGTGATCGTTTG3680287428740
550MGRTDGL3681ATGGGAAGGACGGACGGATTA368228729271945
551VSTERGT3683GTGAGTACTGAGCGGGGGACT368428712432439
552SGHKAGV3685AGTGGTCACAAAGCAGGGGTG368628672000867
553TSAEYNL3687ACGAGTGCGGAGTATAATTTG368828642000864
554RSSETVA3689CGTTCATCTGAAACCGTGGCA369028642000864
555NALSVKT3691AATGCGCTGTCTGTGAAGACT3692286028600
556KTEQVQP3693AAGACGGAGCAGGTGCAGCCG369428602000860
557RTLHDDT3695AGAACACTACACGACGACACG369628592000859
558QSVSYLK3697CAGAGTGTGTCGTATCTGAAG369828592000859
559ASGSAVA3699GCTTCTGGGTCGGCGGTGGCT370028572000857
560MVTQQLK3701ATGGTGACGCAGCAGTTGAAG3702285602856
561KNSGVDP3703AAAAACTCTGGCGTCGACCCA3704285428540
562GGPAEGR3705GGAGGGCCAGCCGAAGGAAGG370628542000854
563VKTSDRT3707GTTAAGACGTCGGATAGGACG370828532000853
564NGVTLQV3709AACGGGGTAACCCTACAAGTA371028538532000
565LSVSQSA3711CTGAGTGTTTCTCAGTCGGCG3712285302853
566TRLQEGT3713ACACGTCTCCAAGAAGGCACC371428482000848
567LSRGEEI3715CTTTCGAGGGGTGAGGAGATT3716284702847
568SLGNSDH3717TCGTTGGGGAATTCGGATCAT371828432000843
569LAGVAQA3719CTAGCTGGCGTGGCTCAAGCT3720283928390
570NGQTGKH3721AATGGGCAGACGGGGAAGCAT3722283842834
571VVTLGRQ3723GTGGTTACTCTGGGTCGTCAG372428362000836
572LNADTDR3725CTAAACGCAGACACTGACCGG372628332000833
573ASRLPQT3727GCGTCTCGGCTTCCTCAGACT3728283102831
574LTPGSQL3729CTGACTCCGGGGTCGCAGCTG3730283018281002
575KSSDTPM3731AAGAGTTCTGATACTCCTATG373228292000829
576QIQSRSD3733CAGATTCAGTCTCGTTCTGAT373428282000828
577NEIRVSV3735AATGAGATTCGTGTGTCGGTT3736282802828
578LQSGVLT3737CTTCAGTCGGGTGTTCTGACT3738282802828
579LSANVRN3739CTATCTGCCAACGTACGTAAC3740282628260
580ASVSSPH3741GCATCGGTCAGCTCCCCACAC374228262000826
581GGTINGH3743GGGGGTACGATTAATGGTCAT3744282617891037
582SLAGGTP3745AGCTTAGCAGGCGGCACGCCG374628222000822
583IGASVTL3747ATTGGGGCTAGTGTTACGCTT374828212000821
584GYGSGEA3749GGTTATGGGTCGGGGGAGGCT375028202000820
585MQKEGSP3751ATGCAGAAGGAGGGGTCGCCG375228172000817
586LSANLRT3753TTATCTGCAAACCTCAGAACG375428142000814
587LSANVRT3755CTCTCTGCAAACGTACGTACA375628138531960
588ASLLPQP3757GCGTCTCTGCTTCCTCAGCCT3758281302813
589GTGEIGM3759GGGACTGGTGAGATTGGTATG376028122000812
590LGHKPGV3761CTAGGTCACAAACCAGGGGTG376228112000811
591LNLTDGV3763CTGAATTTGACTGATGGGGTT3764281028100
592SEVSKGM3765AGTGAGGTTAGTAAGGGTATG376628102000810
593QSLTHGV3767CAGAGTTTGACTCATGGGGTT376828052003802
594VVQVPAR3769GTTGTTCAGGTTCCTGCGCGT377028058052000
595MNGGHAL3771ATGAATGGGGGTCATGCTCTG377228042000804
596GTLTLAY3773GGAACTCTCACGCTGGCCTAC3774280402804
597GAATSQI3775GGGGCAGCAACAAGCCAAATC377628032000803
598PTQGSLR3777CCGACACAAGGATCTCTACGT377828032000803
599VAGSSIL3779GTCGCCGGGAGTAGCATATTG378028032000803
600MLGGGMS3781ATGTTGGGTGGTGGGATGTCG378228022000802
601GVAGTFS3783GGGGTGGCTGGGACGTTTTCT378428012000801
602TIGHSQV3785ACCATCGGACACTCACAAGTC378627992000799
603QSQKDVG3787CAATCGCAAAAAGACGTAGGA378827981847951
604GGTNSGH3789GGGGGTACGAATAGTGGTCAT379027972000797
605GGVSSTK3791GGTGGTGTTTCTTCGACTAAG379227971922605
606QVRDNNT3793CAGGTGCGTGATAATAATACT379427952000795
607AGGGVPR3795GCGGGGGGTGGGGTTCCGAGG379627932500293
608ASVSSPP3797GCGTCGGTAAGTAGTCCCCCG379827932000793
609MNGSHVL3799ATGAATGGGAGTCATGTTCTG380027922000792
610VQHSQDN3801GTGCAGCATTCGCAGGATAAT380227922000792
611KGASDTL3803AAAGGCGCGTCTGACACCCTC380427887882000
612SMATGVK3805TCGATGGCGACGGGTGTTAAG3806278802788
613SVGSGLL3807TCGGTGGGGAGCGGTTTGCTC380827872000787
614GAGSGVA3809GGTGCTGGGTCGGGGGTGGCT381027867862000
615MGSGERL3811ATGGGGTCTGGGGAGCGGTTG381227852000785
616TGNDVRR3813ACCGGCAACGACGTAAGACGC381427851797988
617PNSGKDY3815CCCAACTCAGGCAAAGACTAC3816278317591024
618LNSSTLR3817CTCAACAGTTCTACACTCAGG3818277902779
619LGHKAGH3819TTGGGGCATAAGGCTGGTCAT3820277627760
620LADSKDR3821TTGGCTGATAGTAAGGATCGG3822277402774
621LENQSLG3823CTGGAGAATCAGAGTCTTGGT3824277102771
622QYVVSGV3825CAATACGTCGTCTCTGGCGTT382627692000769
623MNGGRVL3827ATGAATGGGGGTCGTGTTCTG3828276702767
624LSLTAGV3829CTGAGTTTGACTGCTGGGGTT3830276402764
625FGIASGA3831TTTGGTATTGCTAGTGGGGCG383227637632000
626EGGYSGA3833GAAGGGGGGTACTCAGGCGCG3834276302763
627VTLGATS3835GTAACACTCGGAGCGACCAGC3836275702757
628TGLQVGI3837ACTGGGCTGCAGGTTGGTATT3838275627560
629IYPQSST3839ATATACCCGCAAAGCTCGACA3840275602756
630NANSLME3841AATGCGAATTCTTTGATGGAG3842275502755
631LHDGNTR3843TTGCATGATGGGAATACGCGG3844275502755
632GYGSGLA3845GGTTATGGGTCGGGGCTGGCT3846275502755
633TIGHSQV3847ACGATTGGTCATAGTCAGGTT384827542630124
634VANSGLA3849GTGGCGAATTCTGGGCTGGCT385027542000754
635AGLLNAL3851GCGGGGTTGCTGAATGCTTTG385227542000754
636KNAGHVL3853AAGAATGCGGGTCATGTTCTG385427522000752
637PMSNTHP3855CCGATGTCGAATACTCATCCG3856274927490
638LAGSLPL3857CTTGCTGGTTCGCTTCCGTTG385827492000749
639SRLENIS3859AGTAGGTTGGAGAATATTAGT3860274802748
640HSEGVGR3861CATAGTGAGGGTGTTGGGCGG3862274402744
641GAPINSF3863GGAGCACCAATAAACTCTTTC386427422000742
642GLEPRVP3865GGGCTGGAGCCTCGTGTTCCT3866273802738
643PAREGNF3867CCTGCCAGGGAAGGCAACTTC386827362000736
644AAGGQVL3869GCTGCTGGGGGACAAGTCCTC3870273527350
645TSYDKLV3871ACGTCGTATGATAAGTTGGTT387227341995739
646ELNAVAR3873GAACTTAACGCAGTTGCTCGG387427332000733
647SLRHVEV3875AGCCTACGCCACGTTGAAGTC387627252000725
648AADSSGR3877GCGGCGGATAGTTCTGGGCGG387827242000724
649ANEVKHV3879GCGAATGAGGTTAAGCATGTG388027242000724
650DLAQSGR3881GATTTGGCTCAGAGTGGGCGT388227232000723
651GLAGTNT3883GGGCTGGCTGGGACGAATACT388427222002720
652SPSIGPV3885TCGCCTTCTATTGGTCCTGTG388627217212000
653SVAGLSR3887TCTGTTGCTGGTCTTTCTAGG3888272102721
654ADVHVKV3889GCGGATGTTCATGTGAAGGTG3890272027200
655IVKQGDI3891ATTGTTAAGCAGGGTGATATT3892272027200
656ASASGVA3893GCTTCTGCGTCGGGGGTGGCT3894271902719
657MGGGNIP3895ATGGGAGGTGGCAACATACCC3896271702717
658TPTSSTR3897ACACCAACATCCAGCACACGA389827162000716
659VAEKAMA3899GTAGCTGAAAAAGCTATGGCA3900271602716
660GSGSGAA3901GGTTCTGGGTCGGGGGCGGCT390227152000715
661NMQDGGM3903AACATGCAAGACGGCGGCATG3904271413561358
662QLRDNKT3905CAATTACGCGACAACAAAACG390627132000713
663HAGLGVI3907CATGCTGGTCTTGGTGTTATT3908271302713
664VNVSYRA3909GTGAATGTTTCGTATCGTGCT3910271211011611
665IRNDKGP3911ATCCGAAACGACAAAGGGCCT3912271127110
666ASLAQAV3913GCTAGTCTTGCACAAGCAGTT391427102000710
667TLSHAEL3915ACGTTGTCTCATGCTGAGCTG391627102000710
668TYSDGST3917ACCTACTCTGACGGTTCTACC3918271002710
669REKGVTV3919AGAGAAAAAGGGGTCACGGTC392027092000709
670ENRVYSP3921GAGAATCGTGTTTATTCTCCG3922270702707
671ATQGTLR3923GCTACTCAGGGGACGCTTCGG392427062000706
672KHVDTGA3925AAGCATGTGGATACGGGGGCG3926270402704
673LANKMSD3927CTGGCTAATAAGATGAGTGAT392827017012000
674TLVGVVS3929ACCCTCGTGGGGGTAGTCTCT3930269902699
675GVPGTNS3931GGTGTTCCCGGTACTAACTCC393226962000696
676RTDGADL3933CGTACGGATGGTGCGGATCTT3934269602696
677PTQGTLL3935CCGACACAAGGAACTTTGTTG393626912000691
678SHASDTK3937TCTCATGCTTCTGATACGAAG3938269026900
679AVVSAGP3939GCGGTGGTGTCTGCTGGGCCG394026872000687
680HNPQSLG3941CACAACCCTCAATCTCTCGGT3942268602686
681TSLGIML3943ACCAGCCTAGGAATAATGCTT3944268302683
682RPQGSES3945CGTCCTCAGGGTAGTGAGAGT394626821880802
683AVNNVTL3947GCTGTGAATAATGTTACTCTT3948268026800
684NGSAGNR3949AACGGTAGCGCTGGGAACCGC3950267926790
685VFGETRA3951GTTTTTGGTGAGACGCGTGCG395226762000676
686ASESSPS3953GCATCGGAATCAAGCCCATCT3954267502675
687NSVGASI3955AACAGTGTAGGAGCGTCAATC3956267402674
688DAGNQMG3957GATGCGGGGAATCAGATGGGG3958267226720
689GSGRDGL3959GGTAGTGGAAGAGACGGACTG3960267102671
690STVTGGP3961AGTACTGTTACTGGGGGTCCG3962267102671
691TSYDKMV3963ACGTCGTATGATAAGATGGTT396426702345325
692SGANLSN3965TCTGGTGCGAATTTGTCTAAT396626682009659
693VKSTEGT3967GTTAAATCCACGGAAGGAACA3968266626660
694GKDGHQM3969GGGAAGGATGGTCATCAGATG3970266602666
695LGQKAGV3971CTTGGCCAAAAAGCAGGAGTC3972266326630
696REQQKIW3973AGAGAACAACAAAAAATATGG397426632000663
697AGNLSVK3975GCGGGGAATCTGAGTGTGAAG3976266126610
698SHSLIEV3977TCTCATAGTCTGATTGAGGTG3978266102661
699HVSGASL3979CATGTTTCGGGTGCTTCTCTT3980265902659
700LNLKGVV3981TTGAATCTGAAGGGTGTGGTT398226572005653
701AHEARGD3983GCGCACGAAGCACGAGGGGAC398426571883774
702HANTAGV3985CATGCGAATACTGCTGGGGTG3986265602656
703TGAGGHP3987ACGGGTGCTGGTGGGCATCCT3988265402654
704LSRGQEM3989CTGTCACGAGGGCAAGAAATG399026532000653
705LSNHGHV3991CTGAGTAATCATGGGCATGTT399226522000652
706STHHTST3993TCTACACACCACACCTCAACC399426512000651
707QKISTVQ3995CAGAAGATTTCGACTGTGCAG3996265115361115
708GVRNTNV3997GGAGTCCGGAACACAAACGTA3998264826480
709LTVSLNK3999CTAACTGTATCTCTTAACAAA400026472000647
710LSNNGPV4001CTCTCTAACAACGGCCCCGTG400226452000645
711LNLKGVV4003CTTAACCTCAAAGGGGTCGTC400426451896749
712PKPSHGE4005CCGAAGCCTAGTCATGGTGAG4006264502645
713ETNRGSV4007GAAACAAACCGGGGATCCGTA400826442000644
714NLTSDKV4009AATCTGACGTCTGATAAGGTT401026422000642
715MEDRART4011ATGGAGGATAGGGCTCGGACT4012264202642
716GVAGTNS4013GGGGTGGCTGGGACGAATTCT401426412010631
717SLGQDKL4015AGTCTAGGCCAAGACAAATTG4016264026400
718GVGEGRA4017GGCGTCGGGGAAGGACGAGCC401826402000640
719ASLLPPT4019GCATCGCTCTTGCCCCCCACG4020263902639
720TALVLHK4021ACGGCTCTTGTTCTTCATAAG4022263802638
721AVSDHTV4023GCTGTTAGTGATCATACTGTG4024263726370
722SQSAIPN4025AGTCAGTCGGCTATTCCTAAT4026263602636
723MNGGHLQ4027ATGAATGGGGGTCATCTTCAG4028263526350
724KNGGNVL4029AAAAACGGTGGAAACGTGTTG403026352000635
725GAVSSTT4031GGTGCTGTTTCTTCGACTACG403226332000633
726STLNTST4033AGTACTCTTAATACTTCGACT403426312000631
727RSPNVGQ4035AGGTCGCCTAATGTTGGGCAG403626302000630
728PTQGTFR4037CCTACTCAGGGGACGTTTCGG403826301831799
729NHGSDSK4039AACCACGGGTCAGACAGCAAA404026282000628
730LPSGHLH4041CTTCCGAGTGGTCATCTTCAT404226282000628
731SEKVVAT4043TCGGAGAAGGTGGTGGCGACG404426252000625
732TVPNTVL4045ACCGTTCCCAACACAGTCCTG4046262502625
733LSIGQGH4047CTATCCATAGGACAAGGACAC4048262502625
734TSVLSQV4049ACGAGTGTTCTTTCGCAGGTT405026242000624
735VLSSHGP4051GTCTTATCGTCACACGGCCCA4052262402624
736TSQASSV4053ACGTCGCAGGCTTCGTCTGTG405426232000623
737NRSAGDR4055AACCGTTCGGCTGGCGACCGA405626222000622
738RGGVTTQ4057CGAGGAGGAGTAACAACACAA405826222000622
739SGLKGVN4059TCCGGACTAAAAGGTGTTAAC4060262202622
740STITNLM4061AGTACTATTACTAATCTGATG406226191884735
741KGASVTL4063AAGGGGGCTAGTGTTACGCTT4064261826180
742TSTHEGV4065ACTTCAACGCACGAAGGAGTT406626182000618
743KLGGGVS4067AAGTTGGGTGGTGGGGTGTCG406826182000618
744GLESRVP4069GGTTTAGAATCAAGAGTGCCC4070261626160
745QSLSDGV4071CAATCATTAAGCGACGGCGTC4072261426140
746ADAAHAL4073GCGGACGCCGCCCACGCGCTT4074261302613
747CAGGCEL4075TGTGCGGGTGGTTGTGAGCTT407626122000612
748PGERNNP4077CCAGGAGAACGTAACAACCCC407826121929683
749GVRNTDI4079GGAGTTCGGAACACAGACATC4080261202612
750STLHTSI4081AGTACTCTTCATACTTCGATT4082261102611
751AEVGSNR4083GCAGAAGTTGGCTCAAACAGG4084261002610
752NDRNTSS4085AACGACCGAAACACATCAAGT408626082000608
753GPVSSTK4087GGTCCTGTTTCTTCGACTAAG408826082000608
754VHGTGGA4089GTGCATGGTACTGGGGGTGCT4090260814881120
755VLASSGP4091GTATTGGCAAGTTCGGGCCCA4092260802608
756LAGSISL4093CTTGCTGGGTCGATTTCGTTG409426042000604
757SRTLEET4095TCTAGGACGCTTGAGGAGACT409626042000604
758KEIRVSV4097AAGGAGATTCGTGTGTCGGTT409826012000601
759HAVAGAT4099CATGCTGTGGCTGGGGCGACT410025972000597
760VDHGGVN4101GTAGACCACGGCGGGGTTAAC410225955952000
761TTATIVR4103ACCACTGCTACGATCGTACGC410425945942000
762SGEGLAS4105TCGGGTGAGGGGCTGGCTAGT410625932000593
763TVGLTIA4107ACCGTAGGCCTTACTATAGCA410825921729863
764DYDSGRR4109GACTACGACTCTGGACGTAGA4110258902589
765QGVTVGL4111CAGGGGGTTACGGTTGGGCTT4112258902589
766KSQSENS4113AAGTCGCAGTCGGAGAATAGT4114258725870
767SEGLSRD4115TCCGAAGGGCTGTCCAGAGAC411625872000587
768SGDGTSK4117TCGGGTGATGGGACTTCTAAG411825862000586
769DGSAGDR4119GATGGGAGTGCGGGTGATCGT412025862000586
770TDALTSK4121ACGGACGCACTCACAAGCAAA412225832000583
771QSQTTVG4123CAGTCTCAGACGACTGTTGGT412425831923660
772GSASAVA4125GGCAGCGCTTCAGCAGTAGCA4126258302583
773VTVTMSR4127GTCACGGTAACCATGTCACGG4128258102581
774GVGNTNI4129GGCGTAGGTAACACGAACATA413025792000579
775TIAHSQV4131ACGATTGCTCATAGTCAGGTT4132257902579
776VQGTQTG4133GTGCAGGGTACGCAGACTGGT413425782000578
777LRVTEIL4135CTTCGAGTCACTGAAATACTA4136257802578
778SSSGLVR4137AGTTCCTCTGGGCTAGTCCGA4138257702577
779LSLSKDK4139CTATCGTTATCTAAAGACAAA4140257402574
780TGISVNG4141ACCGGCATCTCTGTCAACGGT414225732000573
781SKSAFPN4143TCGAAATCCGCCTTCCCAAAC4144257125710
782VGNSSGV4145GTTGGTAATTCTTCGGGTGTG414625712000571
783DSHVSGD4147GACTCACACGTCTCCGGCGAC4148256914771092
784KVYDTPM4149AAGGTTTATGATACTCCTATG415025682000568
785GSMENVR4151GGGAGTATGGAGAATGTGCGT415225672000567
786ETTQGSP4153GAAACAACGCAAGGCAGTCCC415425652000565
787KGSGLEI4155AAGGGGTCTGGGCTTGAGATT4156256102561
788TALQVSI4157ACTGCGCTGCAGGTTAGTATT415825601976584
789PTQSDLA4159CCTACGCAGTCGGATCTTGCT416025582000558
790STQTLGE4161TCGACTCAGACTTTGGGGGAG416225581869690
791LSNRGPV4163CTTTCTAACAGAGGCCCGGTG4164255602556
792EHVNVKV4165GAGCATGTTAATGTGAAGGTG416625552000555
793PLKGGGE4167CCTCTGAAGGGTGGTGGGGAG416825541776778
794SHGSVSK4169TCTCATGGTTCTGTTTCGAAG4170255202552
795MLGGGVS4171ATGTTGGGTGGTGGGGTGTCG4172254913651184
796TAVHSTS4173ACTGCCGTACACAGCACGTCA417425482000548
797QADSHGR4175CAAGCAGACAGCCACGGCCGT417625481815733
798ATLKPDY4177GCAACGTTGAAACCCGACTAC4178254802548
799LDTSARV4179CTTGATACTAGTGCTCGTGTT4180254702547
800HTSGTSS4181CATACGAGTGGGACGTCGTCG418225442000544
801TGARDQY4183ACAGGGGCGCGTGACCAATAC418425441585959
802SGETLRL4185TCTGGGGAGACGCTTAGGCTT418625441566978
803VSLSDGV4187GTAAGCCTTTCGGACGGTGTG418825422000542
804AGVVNAL4189GCGGGGGTGGTGAATGCTTTG419025412231310
805DASKLVN4191GATGCGAGTAAGCTTGTGAAT419225392000539
806EIRLSTH4193GAAATAAGACTGTCCACCCAC419425392000539
807MGRTDGL4195ATGGGGCGTACTGATGGGTTG4196253615071029
808TGLLVSI4197ACTGGGCTGCTGGTTAGTATT4198253602536
809KAGLLFD4199AAAGCAGGCCTTCTATTCGAC420025322000532
810SRAEGIK4201AGTCGTGCGGAGGGGATTAAG420225322000532
811NGKVDRD4203AATGGGAAGGTTGATCGGGAT420425302000530
812SPPSSPR4205TCTCCGCCGAGTTCGCCGCGT420625301806724
813NGIAGDR4207AATGGGATTGCGGGTGATCGT420825272000527
814NRASDGI4209AACCGAGCTTCTGACGGGATA421025242000524
815GVELISR4211GGTGTTGAGCTGATTTCGCGG421225212000521
816RVQLSET4213CGGGTGCAGCTGTCTGAGACT421425212000521
817LNYSVSL4215TTAAACTACAGTGTAAGCCTC4216252010671454
818ASVSSKS4217GCTAGTGTGTCTTCGAAGAGT421825192000519
819TPSTGVL4219ACTCCTAGTACTGGGGTGCTG422025191899620
820HANTAGV4221CACGCAAACACAGCAGGTGTA422225182000518
821VVSVLNV4223GTGGTGAGTGTGCTGAATGTT422425182000518
822AHDHVKV4225GCGCATGATCATGTGAAGGTG422625181558960
823SSEGRNV4227TCGTCAGAAGGAAGAAACGTT422825172000517
824NNNGATS4229AATAATAATGGTGCGACTTCT423025161571945
825LSQGSQQ4231CTGTCTCAGGGGTCGCAGCAG4232251652511
826GGTTSGH4233GGCGGGACAACGAGCGGGCAC4234251502515
827TLASQEL4235ACTCTGGCGTCGCAGGAGCTG423625142000514
828ATGTESR4237GCGACTGGTACTGAGTCGCGG423825102000510
829NNANLVI4239AATAATGCTAATCTGGTTATT424025102000510
830STITTLK4241TCCACAATAACGACACTTAAA4242250902509
831QLNSADS4243CAGCTGAATTCGGCTGATAGT4244250713741133
832ETNFGVS4245GAGACTAATTTTGGGGTGTCT4246250502505
833ENVHVKV4247GAAAACGTGCACGTTAAAGTC424825002000500
834AGGGAPM4249GCCGGCGGAGGAGCACCCATG425025002000500
835PNESVRA4251CCTAACGAAAGTGTACGGGCA425224981811687
836LKPGLAD4253CTGAAGCCGGGGTTGGCGGAT4254249802498
837LANRLSV4255CTGGCTAATAGGTTGAGTGTT425624971615882
838FAGIAQA4257TTTGCGGGGATTGCGCAGGCG425824932000493
839RVHSAQH4259CGTGTCCACAGTGCTCAACAC426024911933558
840LSGLRSG4261TTGTCTGGTCTTCGTAGTGGT426224902000490
841SEGLSRL4263TCCGAAGGATTATCCCGACTT426424902000490
842RLLRDES4265AGGTTGCTACGAGACGAATCT4266249002490
843SPSAIPN4267TCTCCCAGTGCGATACCGAAC4268248802488
844VSRGAEL4269GTTTCGAGGGGTGCGGAGCTG4270248802488
845KGSDTPM4271AAAGGATCGGACACACCGATG427224872000487
846VDHGGVM4273GTGGATCATGGTGGTGTGATG427424872000487
847VNAALGI4275GTTAATGCTGCGCTTGGGATT427624872000487
848TMSMGKL4277ACGATGTCGATGGGGAAGCTG427824842000484
849INGGHDL4279ATCAACGGAGGCCACGACCTC428024842000484
850GTGSTIV4281GGGACGGGTTCTACGATTGTG428224842000484
851TNGGHVL4283ACTAACGGCGGACACGTGCTC4284248414441040
852LISSTMR4285TTGATTTCTAGTACGATGCGT4286248412751209
853AGGNGSY4287GCAGGAGGAAACGGCTCCTAC428824831618865
854ITQAAYV4289ATTACGCAGGCTGCTTATGTT429024822000482
855AVLAGSM4291GCGGTTCTGGCGGGGTCTATG429224802000480
856GASGAVL4293GGAGCTTCAGGGGCGGTCCTC4294248002480
857PTATESL4295CCTACCGCTACAGAAAGTCTC4296248002480
858SPQGGLP4297TCGCCGCAGGGGGGTCTTCCT429824772000477
859VRASIVD4299GTGAGGGCTAGTATTGTTGAT430024772000477
860AVKENET4301GCGGTGAAGGAGAATGAGACG430224772000477
861HVSQDHS4303CATGTGTCTCAGGATCATTCG4304247762471
862SKSNDSS4305AGTAAGTCTAATGATAGTTCT4306247702477
863KTEQVQP4307AAAACAGAACAAGTCCAACCT4308247602476
864MTGTAHQ4309ATGACGGGTACGGCTCATCAG431024751832643
865IKQAVYV4311ATAAAACAAGCAGTCTACGTA431224742000474
866IPSGGPR4313ATTCCGTCTGGGGGTCCGCGT431424742000474
867TKPNMVS4315ACTAAGCCGAATATGGTGAGT431624722000472
868NDRNTSS4317AATGATAGGAATACGTCTTCG431824712000471
869TGISVGK4319ACCGGGATATCAGTAGGCAAA4320247114291042
870SEQQKDW4321TCTGAACAACAAAAAGACTGG432224702000470
871LLTSVKV4323CTACTAACGTCTGTTAAAGTA432424671938529
872TTEKVTG4325ACGACTGAGAAGGTTACTGGT4326246402464
873GALSTTK4327GGAGCGTTAAGTACAACCAAA432824632000463
874RTLHDNT4329AGGACCCTCCACGACAACACA433024632000463
875TREHNSI4331ACTAGGGAGCATAATTCGATT433224632000463
876NHITGGV4333AACCACATAACAGGCGGGGTC433424631819644
877SQAQAGY4335TCTCAGGCGCAGGCGGGTTAT433624622000462
878NTSRIGV4337AATACGAGTAGGATTGGTGTG433824621546916
879HVASTAA4339CACGTAGCGTCGACCGCGGCT434024612000461
880IARINSH4341ATAGCCAGAATCAACTCCCAC4342245902459
881LISSTLR4343TTGATTTCTAGTACGCTGCGT4344245902459
882SRLENIV4345TCGCGGCTCGAAAACATAGTA434624584582000
883SPTSSPT4347TCTCCGACGAGTTCGCCGACT4348245602456
884HDGLGVI4349CATGATGGTCTTGGTGTTATT435024552000455
885SDQNGPR4351TCTGATCAGAATGGGCCTCGG435224542000454
886SEQKNVW4353AGTGAGCAGAAGAATGTTTGG4354245402454
887RIVVSVP4355CGGATAGTTGTGAGCGTACCC4356245202452
888QDGPAVK4357CAGGATGGGCCTGCGGTGAAG4358245002450
889KKVITDD4359AAGAAGGTGATTACTGATGAT436024492000449
890MGGVNNT4361ATGGGGGGGGTTAATAATACG4362244802448
891VSSKGEW4363GTCAGCAGCAAAGGTGAATGG436424472000447
892GSGQMDA4365GGGAGTGGGCAGATGGATGCT4366244402444
893MAGKAPP4367ATGGCTGGGAAGGCGCCGCCG4368244302443
894QVRDTMT4369CAAGTTCGAGACACAATGACC437024422000442
895GHVTSGD4371GGCCACGTCACATCTGGGGAC437224422000442
896SEVSKGI4373AGTGAGGTTAGTAAGGGTATT437424422000442
897NRQEHTY4375AATCGGCAGGAGCATACGTAT4376244202442
898YVSTVVG4377TATGTTAGTACTGTTGTGGGG437824412000441
899SNLSVVI4379TCTAACCTCTCAGTCGTAATC438024402000440
900GHPQTTA4381GGGCATCCTCAGACTACGGCT438224402000440
901ESSGNKL4383GAGTCTAGTGGTAATAAGCTG438424392000439
902NFQADGL4385AACTTCCAAGCAGACGGACTG438624392000439
903GTSTLGY4387GGCACTTCAACCCTCGGCTAC438824382000438
904MNVDGRD4389ATGAATGTTGATGGGCGTGAT439024382000438
905QAIEGNF4391CAAGCCATCGAAGGGAACTTC439224381895543
906KSHSENN4393AAGTCGCATTCGGAGAATAAT4394243802438
907VNYSVAL4395GTAAACTACAGTGTTGCACTC4396243802438
908SGSRITV4397TCGGGTAGTCGGATTACTGTT4398243702437
909LSLNDGD4399CTATCACTTAACGACGGCGAC440024362000436
910QGGNSGA4401CAAGGGGGGAACTCGGGCGCA4402243524350
911LSNMLTV4403CTGTCTAATATGTTGACTGTT440424352000435
912AGGGGPR4405GCAGGCGGAGGCGGACCACGT440624342000434
913FDKTGVH4407TTTGATAAGACGGGGGTGCAT440824342000434
914GRSQLQM4409GGGCGGTCGCAGTTGCAGATG441024332000433
915VNAGHGI4411GTTAATGCTGGGCATGGGATT441224332000433
916RILQSGV4413CGGATACTCCAATCGGGTGTG441424322000432
917RTLGIPS4415CGGACGTTGGGGATTCCTTCT4416243202432
918LGGLGGL4417CTAGGAGGGCTGGGAGGCTTA441824312000431
919GIVGSVP4419GGGATTGTGGGTAGTGTTCCG4420242702427
920SSDRLLA4421TCTAGCGACAGACTCTTAGCG4422242702427
921KDVVRGS4423AAGGATGTGGTGCGGGGTAGT442424262000426
922LGHSAEP4425CTGGGACACTCAGCAGAACCC442624262000426
923STIPNLM4427AGTACTATTCCTAATCTGATG4428242602426
924GGTIGGH4429GGGGGTACGATTGGTGGTCAT443024252000425
925VPAGLGR4431GTTCCCGCAGGCTTAGGCCGT4432242502425
926LVHTTNN4433CTAGTCCACACGACCAACAAC443424242000424
927MTSGNLM4435ATGACTTCCGGCAACCTCATG4436242402424
928KESLSGS4437AAGGAGTCGCTTTCGGGTTCT4438242313771046
929LRVTEIL4439TTGCGTGTGACGGAGATTCTG444024212000421
930GHNVGVH4441GGTCATAATGTTGGTGTTCAT444224192000419
931LDKVRPA4443TTGGATAAGGTGCGTCCGGCG444424192000419
932LVANTPT4445CTCGTCGCAAACACACCAACC4446241802418
933SHGSDYK4447TCACACGGATCCGACTACAAA444824172274143
934TVHAPGT4449ACCGTTCACGCGCCAGGCACT445024172000417
935LNGGHVM4451CTCAACGGCGGACACGTGATG445224172000417
936LTLSTGV4453CTTACGCTGAGTACTGGGGTG445424172000417
937VVQVNGR4455GTTGTTCAGGTTAATGGGCGT445624162007408
938RNGVTSS4457AGAAACGGAGTAACGAGTTCG445824162000416
939GSASGEA4459GGCTCCGCTTCCGGAGAAGCC446024152000415
940STQAVYV4461AGTACGCAGGCTGTTTATGTT4462241502415
941SVDNGKR4463TCAGTCGACAACGGCAAACGA4464241502415
942TLHDKVL4465ACTCTGCATGATAAGGTGTTG446624142000414
943NDVRGSN4467AACGACGTCAGAGGGTCCAAC446824144142000
944HMNITVS4469CATATGAATATTACGGTTTCG447024131975438
945TTAAIIR4471ACGACGGCGGCTATTATTAGG4472241302413
946VDHGGVV4473GTGGATCATGGTGGTGTGGTT447424122000412
947QGGYSGV4475CAAGGGGGATACTCTGGTGTT447624122000412
948EPVASTI4477GAGCCTGTGGCTTCTACTATT447824112000411
949LGDTAYS4479TTAGGCGACACCGCTTACTCA4480241002410
950NGSAGDH4481AACGGCTCGGCTGGAGACCAC448224092000409
951TAVHTTS4483ACCGCAGTTCACACCACATCC448424091911498
952TGARDQY4485ACTGGTGCTCGGGATCAGTAT4486240911901219
953NGSAGDH4487AATGGGAGTGCGGGTGATCAT448824082006401
954LAGMGGI4489CTTGCGGGTATGGGGGGGATT449024079341474
955THRDAGV4491ACTCATAGGGATGCTGGTGTG449224052000405
956YREMGGS4493TACAGAGAAATGGGCGGCTCC449424052000405
957ETNLYHA4495GAGACGAATTTGTATCATGCT449624042000404
958SGANSSN4497TCTGGTGCGAATTCGTCTAAT449824022000402
959IVNSREF4499ATTGTGAATAGTCGTGAGTTT450024021418984
960ESLGGPR4501GAATCCTTGGGAGGCCCTCGA4502240202402
961VVDSYNK4503GTTGTTGATTCGTATAATAAG450424012000401
962GGVSSTN4505GGAGGAGTCTCGTCTACCAAC450624002000400
963DALTRLA4507GATGCTCTGACGCGGTTGGCG450824002000400
964SLRAGVP4509AGCCTAAGAGCAGGTGTACCG4510240002400
965MGWGTNP4511ATGGGGTGGGGTACTAATCCG4512239902399
966VESGSLG4513GTTGAGAGTGGTTCTCTTGGG451423982000398
967QGGYSLG4515CAAGGGGGATACAGCTTAGGA451623971800597
968DSKDVHR4517GATAGTAAGGATGTTCATAGG4518239702397
969SANPVAR4519TCTGCGAATCCGGTTGCGAGG452023962000396
970SILSGVS4521TCAATATTATCAGGGGTATCC452223962000396
971HAADVQR4523CATGCGGCGGATGTGCAGAGG452423962000396
972LSRGAEK4525CTTTCGAGGGGTGCGGAGAAG452623952000395
973AEGGAPR4527GCAGAAGGGGGCGCCCCTCGG452823951733662
974SRNGNVV4529TCTCGGAATGGGAATGTTGTT4530239502395
975LVATTLS4531CTGGTTGCAACTACCCTTTCT453223942000394
976AHGHVKV4533GCGCATGGTCATGTGAAGGTG453423942000394
977NAGVAQA4535AACGCCGGAGTAGCCCAAGCA453623942000394
978NRDNVAF4537AATCGGGATAATGTTGCTTTT4538239402394
979SVHSGLL4539AGTGTTCATAGTGGGCTGCTT454023932000393
980RQMGITV4541CGGCAGATGGGTATTACTGTT454223932000393
981LSHIGGL4543CTATCACACATAGGAGGCCTA454423922000392
982GTVTLGY4545GGGACGGTGACTTTGGGGTAT454623912000391
983SYGDGGV4547TCTTACGGAGACGGAGGAGTG454823912000391
984NHVSGSS4549AATCATGTTTCTGGTAGTTCT4550239112531138
985GSRALSS4551GGATCACGTGCCCTGTCAAGT455223902000390
986GLNHVGL4553GGCCTCAACCACGTGGGCCTT455423892000389
987TLTNGMP4555ACGCTTACTAATGGGATGCCT455623891699690
988MGASVTH4557ATGGGAGCGTCCGTGACTCAC455823861864522
989VQQLAIK4559GTGCAGCAGTTGGCGATTAAG4560238502385
990LSNLSNG4561CTTAGTAATCTGTCGAATGGT456223842000384
991IAGVAQS4563ATAGCTGGAGTGGCTCAATCA456423841960424
992EYALTEA4565GAGTATGCGTTGACGGAGGCT4566238302383
993AADSSGR4567GCCGCAGACAGCAGTGGCAGG4568238302383
994ILVDAHT4569ATCCTAGTGGACGCGCACACA457023822000382
995ADVHVRL4571GCTGATGTGCATGTGCGTTTG4572238202382
996YVQAVPS4573TATGTGCAGGCGGTTCCTTCT457423812000381
997ALAQNNM4575GCCTTAGCCCAAAACAACATG457623802000380
998QGGDSGG4577CAAGGCGGAGACTCGGGTGGG4578237923790
999RVEISAK4579AGGGTTGAGATTTCTGCGAAG458023791802577
1000LGRVEHT4581TTAGGCAGAGTGGAACACACT4582237902379
Macaque_allCNS_peptide rank
SEQSEQ
IDID
RankPeptideNO:SequenceNO:k-medoids cluster #
1PTQGTVR4583CCCACACAAGGCACAGTCCGT4584128
2PTQGTFR4585CCGACACAAGGAACATTCAGG4586128
3PSQGTLR4587CCTTCTCAGGGGACGCTTCGG4588128
4NLGAALS4589AACCTTGGGGCTGCCCTATCG4590203
5PKPSHGE4591CCTAAACCATCTCACGGAGAA4592 62
6PTPGTLR4593CCTACTCCGGGGACGCTTCGG4594128
7PTQGTLR4595CCTACTCAGGGGACGCTTCGG4596128
8QDGPAVK4597CAGGATGGGCCTGCGGTGAAG4598260
9PNQGTLR4599CCAAACCAAGGTACTCTACGA4600128
10ESLAGVR4601GAATCGTTGGCAGGGGTGCGT4602316
11AADSSAR4603GCCGCTGACTCATCGGCCCGT4604 10
12SHGSDPK4605TCTCATGGTTCTGATCCGAAG4606174
13AAGVIPN4607GCCGCCGGAGTGATACCTAAC4608179
14KNPGVDT4609AAAAACCCTGGAGTTGACACG4610337
15PAQGTLR4611CCGGCGCAAGGAACACTACGA4612128
16GRSQLPM4613GGCCGATCACAACTTCCAATG4614239
17VTTLSPV4615GTCACGACTTTGAGTCCAGTT4616272
18HSEGVGR4617CACTCGGAAGGAGTCGGACGC4618134
19SHGYDSK4619TCTCATGGTTATGATTCGAAG4620174
20NQLGELV4621AACCAACTCGGCGAACTAGTG4622209
21NGMGDVT4623AACGGCATGGGGGACGTTACT4624312
22VGGNVVH4625GTTGGTGGTAATGTTGTTCAT4626111
23LVTGMSS4627CTTGTTACTGGGATGAGTTCT4628 22
24MNVGHVL4629ATGAATGTGGGTCATGTTCTG4630268
25MSISEPR4631ATGTCTATTAGTGAGCCGCGG4632207
26ALGDALR4633GCACTAGGCGACGCATTACGC4634191
27QYAVSGG4635CAATACGCAGTGAGCGGCGGT4636345
28VLASLGP4637GTTCTGGCTTCGCTTGGTCCT4638176
29GRDLTPA4639GGTCGGGATCTTACGCCTGCT4640235
30RIVDSVP4641AGGATTGTGGATAGTGTTCCG4642120
31VDHGGVV4643GTGGATCATGGTGGTGTGGTT4644274
32ASDAVLR4645GCATCCGACGCCGTCCTAAGG4646 31
33ILVDAYA4647ATACTAGTAGACGCGTACGCT4648175
34PTEGTLR4649CCGACAGAAGGCACACTGCGA4650128
35TDALTTK4651ACTGATGCGCTTACGACTAAG4652145
36RVDSEKL4653AGGGTGGATTCGGAGAAGCTT4654146
37KNPGVDS4655AAGAATCCGGGGGTGGATTCT4656337
38RTDGADH4657CGCACAGACGGAGCAGACCAC4658252
39LSSTDGV4659CTGAGTTCGACTGATGGGGTT4660151
40AVFSSQK4661GCTGTATTCTCCAGTCAAAAA4662 39
41TVITGAP4663ACTGTGATCACTGGCGCCCCC4664 44
42LESAAMI4665CTGGAGTCGGCTGCTATGATT4666 41
43RVLTSDV4667CGTGTTCTGACGTCTGATGTG4668159
44PEPRSSY4669CCTGAGCCGCGTAGTAGTTAT4670204
45ESRNDVV4671GAGTCGAGGAATGATGTTGTT4672173
46RHIADAS4673AGACACATAGCGGACGCGTCG4674347
47HAAGASS4675CATGCGGCGGGTGCTAGTAGT4676 46
48HTLSTGV4677CACACCCTAAGCACGGGAGTA4678171
49TVADPRA4679ACTGTTGCGGATCCGCGGGCG4680296
50SAGGSLQ4681AGTGCTGGTGGGAGTCTTCAG4682 49
51MANMLSV4683ATGGCGAACATGTTATCGGTG4684167
52SLGEGRH4685AGTTTAGGCGAAGGGCGTCAC4686 90
53RESLEAL4687AGGGAGAGTCTTGAGGCGTTG4688270
54LAGLGGP4689CTTGCGGGTTTGGGGGGGCCT4690195
55LSLNDVV4691CTGAGTTTGAATGATGTGGTT4692173
56ATDSSVR4693GCCACCGACAGCAGTGTCCGT4694 55
57STINTLM4695AGTACTATTAATACTCTGATG4696 56
58LSRDVAV4697TTGTCGAGGGATGTGGCGGTT4698 97
59QYVVSGA4699CAGTATGTTGTTAGTGGTGCG4700345
60LIGAALD4701CTAATCGGCGCAGCACTCGAC4702203
61TMANSER4703ACGATGGCAAACTCGGAACGC4704 60
62GINEHVA4705GGGATCAACGAACACGTAGCC4706331
63SNLGETV4707TCGAATTTGGGGGAGACGGTT4708180
64GARMVMT4709GGTGCGCGGATGGTTATGACT4710269
65AMGGETA4711GCGATGGGTGGTGAGACTGCT4712185
66PTHGTLR4713CCGACCCACGGTACACTGCGA4714128
67LNGVTIT4715CTCAACGGCGTCACCATCACC4716305
68SVSHVVV4717TCGGTCTCTCACGTCGTCGTA4718202
69DVVLLTR4719GATGTTGTTTTGTTGACTAGG4720  6
70VGLLATV4721GTCGGTCTCCTTGCAACAGTG4722336
71ASESSTR4723GCATCTGAAAGCTCAACACGG4724 70
72RVGSSED4725CGGGTTGGGAGCTCCGAAGAC4726306
73LRVTENP4727CTTCGGGTCACCGAAAACCCC4728237
74VTEHTQF4729GTGACTGAGCATACGCAGTTT4730162
75SQAEGSV4731TCCCAAGCGGAAGGCAGCGTG4732 74
76VLLGINT4733GTCCTGCTCGGAATAAACACC4734283
77LDSGIPR4735CTCGACTCTGGTATCCCCAGA4736134
78GLGLAAN4737GGTTTGGGTTTGGCGGCGAAT4738225
79VMSGTSH4739GTTATGTCGGGTACTAGTCAT4740238
80DVAAGYR4741GACGTAGCGGCAGGATACCGA4742143
81SIGDLGK4743AGTATCGGTGACCTAGGTAAA4744137
82NGSSIGV4745AACGGCTCATCTATCGGCGTG4746299
83LERGHMY4747CTCGAAAGAGGCCACATGTAC4748 41
84ITENASR4749ATTACTGAGAATGCGTCGCGG4750 83
85VHDSTPL4751GTGCATGATTCGACTCCGTTG4752325
86TLALSER4753ACCTTAGCCTTATCAGAACGA4754 85
87TVDSPMR4755ACCGTCGACAGCCCTATGCGA4756 40
88STLHTSI4757AGTACTCTTCATACTTCGATT4758254
89VGSLTAS4759GTGGGGTCGCTTACGGCTAGT4760 88
90MAGGTNP4761ATGGCAGGTGGCACAAACCCT4762195
91SLSDGSL4763TCTCTGTCTGATGGTTCTCTT4764 90
92IHFSGDN4765ATCCACTTCAGCGGCGACAAC4766 45
93TGRVEAA4767ACGGGTAGGGTTGAGGCGGCG4768333
94TTAAIVT4769ACGACGGCGGCTATTGTTACG4770 93
95SIQSEVT4771AGCATCCAATCCGAAGTTACC4772  4
96DSSGGGT4773GACAGCTCAGGCGGGGGCACA4774 37
97TMAISDR4775ACTATGGCGATTTCTGATCGG4776262
98RVENGGT4777CGAGTGGAAAACGGCGGGACC4778295
99REALALT4779AGGGAGGCGCTGGCTCTGACG4780270
100IVTPTNT4781ATTGTTACTCCTACGAATACG4782 77
101LTSDNLA4783CTTACCTCAGACAACCTAGCC4784280
102DAPRDGA4785GACGCACCCCGCGACGGGGCT4786151
103AVLSQNI4787GCTGTGTTGTCTCAGAATATT4788102
104VLLGSNR4789GTGCTTTTGGGTAGTAATAGG4790283
105SMAVTAK4791AGTATGGCGGTGACGGCGAAG4792104
106WSSELHA4793TGGTCTAGTGAGTTGCATGCT4794 33
107ENTVSPV4795GAAAACACAGTGAGCCCCGTC4796272
108LNMGPLH4797CTGAATATGGGTCCTTTGCAT4798231
109GRGTNDH4799GGTCGGGGTACGAATGATCAT4800225
110STEYAML4801TCTACTGAGTATGCGATGTTG4802  8
111MGSNGQV4803ATGGGGTCTAATGGGCAGGTT4804286
112LAGSVVV4805CTGGCGGGTTCGGTTGTTGTG4806111
113YSMTVTT4807TATAGTATGACGGTTACGACT4808 80
114VLVGTSL4809GTTCTTGTTGGGACGAGTTTG4810113
115MVTPTNR4811ATGGTGACACCCACAAACCGC4812 77
116VTTLTPV4813GTGACTACGCTTACTCCTGTG4814272
117QTGEAAV4815CAGACGGGGGAGGCGGCGGTT4816116
118NDRITST4817AACGACCGAATAACCTCAACT4818157
119SDGKTHT4819TCAGACGGCAAAACCCACACC4820 33
120TSLLPQT4821ACGTCTCTGCTTCCTCAGACT4822292
121PSLEHLA4823CCTAGTCTTGAGCATTTGGCT4824349
122DHGSFAK4825GATCATGGTAGTTTTGCGAAG4826174
123PTNGYPL4827CCCACAAACGGGTACCCGCTC4828341
124TLTDVVH4829ACTCTTACTGATGTGGTGCAT4830261
125LADGSVR4831TTAGCAGACGGCTCCGTCCGC4832255
126GGVSSTN4833GGTGGTGTTTCTTCGACTAAT4834335
127SHGTDSK4835AGTCACGGCACGGACTCTAAA4836174
128ALATDMS4837GCGCTGGCTACTGATATGTCG4838127
129QVRDTMT4839CAAGTCAGAGACACGATGACC4840318
130NGYTEGR4841AATGGGTATACGGAGGGGCGT4842299
131DSRVSGD4843GATAGTCGTGTGTCGGGGGAT4844 37
132VLSGEEL4845GTTCTTAGTGGGGAGGAGTTG4846131
133HNGQVGV4847CATAATGGGCAGGTTGGTGTG4848299
134HNSHVLT4849CACAACTCCCACGTATTAACC4850  0
135RPEIEVR4851CGGCCGGAGATTGAGGTTAGG4852 38
136KGSDSPM4853AAAGGATCGGACTCACCGATG4854278
137DQLNDGR4855GATCAGCTGAATGATGGGCGG4856 54
138SLLHDGA4857AGTTTGTTGCATGATGGGGCG4858 34
139RDTQYDH4859CGTGATACGCAGTATGATCAT4860261
140PREHNQA4861CCGCGTGAGCATAATCAGGCT4862338
141SRLENIS4863TCGCGTCTTGAAAACATCTCC4864349
142FDQTHKT4865TTTGATCAGACGCATAAGACT4866245
143MTGISIV4867ATGACAGGCATCTCTATCGTA4868142
144ASSHVTV4869GCTTCGAGTCATGTTACTGTG4870  0
145SHGSDLK4871AGCCACGGGAGCGACCTAAAA4872174
146NIGADPK4873AACATCGGGGCCGACCCCAAA4874 54
147TLGSLSQ4875ACACTAGGGTCCCTGTCACAA4876137
148PTQGTIR4877CCTACTCAGGGGACGATTCGG4878128
149FTGGTGT4879TTTACTGGTGGTACGGGTACT4880164
150IPSTGAQ4881ATTCCGAGTACGGGGGCGCAG4882 30
151STLHTTT4883AGTACTCTTCATACTACGACT4884150
152GGTNSAH4885GGTGGAACAAACTCAGCGCAC4886335
153NVGLVSP4887AACGTAGGGCTCGTATCACCA4888197
154VYESTVR4889GTTTATGAGAGTACGGTGAGG4890153
155MGASDTH4891ATGGGGGCTAGTGATACGCAT4892 26
156VIATGNP4893GTTATTGCTACGGGGAATCCT4894176
157PEQQKVW4895CCTGAGCAGCAGAAGGTTTGG4896 94
158SDGQFGR4897TCTGACGGTCAATTCGGACGA4898 54
159IMTSVTM4899ATCATGACAAGTGTTACAATG4900136
160AQDHGTL4901GCGCAGGATCATGGGACGTTG4902286
161GGLVVVG4903GGCGGACTAGTAGTCGTGGGG4904269
162TSVESNL4905ACGTCGGTGGAGTCGAATCTT4906135
163SVTDIKH4907TCGGTGACGGACATAAAACAC4908261
164LSMTDGL4909CTGAGTATGACTGATGGGCTT4910151
165LNMKADG4911TTAAACATGAAAGCAGACGGA4912192
166LNSGVSR4913CTCAACAGTGGTGTCAGCCGC4914134
167STIPTLL4915AGTACTATTCCTACTCTGTTG4916166
168TFGIDAS4917ACATTCGGAATCGACGCGTCC4918  5
169AVGVILN4919GCGGTGGGTGTTATTCTGAAT4920179
170GSREDVR4921GGGAGTAGGGAGGATGTGCGT4922343
171HLHNTLN4923CATCTTCATAATACTCTTAAT4924 56
172VSVAVGL4925GTTTCGGTGGCTGTTGGGTTG4926133
173LGVSRDL4927CTGGGTGTGTCTCGGGATCTG4928 99
174KGSDNTM4929AAGGGTTCTGATAATACTATG4930278
175ASIPTLN4931GCATCCATACCAACGCTAAAC4932332
176YHASDSK4933TATCATGCTTCTGATTCGAAG4934174
177QYSELHH4935CAATACTCCGAATTGCACCAC4936261
178DLTTPVR4937GATTTGACTACTCCGGTGCGT4938342
179IGTEISS4939ATTGGTACGGAGATTTCGTCG4940184
180STDMRSP4941AGTACGGATATGAGGTCGCCG4942319
181TKITNED4943ACAAAAATCACTAACGAAGAC4944258
182STLQGEA4945AGCACCCTCCAAGGGGAAGCA4946181
183PLLGNTI4947CCGCTTTTGGGGAATACGATT4948328
184NGLQVSI4949AATGGGCTGCAGGTTAGTATT4950190
185AVTNPLM4951GCGGTTACTAATCCTTTGATG4952208
186DVTVSMR4953GATGTTACTGTTTCTATGCGT4954342
187NQLAEQV4955AATCAGTTGGCGGAGCAGGTT4956226
188RPDASST4957AGGCCTGATGCTTCTTCGACG4958311
189DTSLRLM4959GACACCTCTCTACGCCTTATG4960 35
190TLPELKL4961ACGTTGCCGGAGTTGAAGCTT4962301
191QNGLQLL4963CAGAATGGGTTGCAGCTTTTG4964142
192ASREVLY4965GCCAGTCGCGAAGTACTCTAC4966343
193IASDIGR4967ATTGCTTCGGATATTGGTCGG4968309
194VADSYNL4969GTCGCAGACAGTTACAACCTA4970141
195STVGINV4971AGTACGGTCGGGATCAACGTT4972194
196GVAGRIL4973GGGGTGGCTGGGCGTATTCTG4974149
197NEAVNVR4975AATGAGGCTGTTAATGTTCGG4976 91
198TVGHDNK4977ACCGTAGGACACGACAACAAA4978 54
199TLQQLQL4979ACTCTCCAACAACTGCAATTG4980301
200ALSGLAN4981GCATTGAGCGGCCTGGCGAAC4982199
201ALGTQGS4983GCTCTGGGTACGCAGGGTTCT4984291
202PNERLAV4985CCTAACGAACGATTGGCAGTC4986144
203GVAATNT4987GGAGTTGCAGCCACAAACACG4988132
204WDHNSLK4989TGGGACCACAACAGCTTGAAA4990 61
205LVGVVEP4991CTTGTTGGTGTGGTTGAGCCG4992287
206TLTDRAS4993ACGTTGACGGATAGGGCGTCT4994205
207SGSNTGH4995TCGGGGTCTAATACGGGTCAT4996299
208VLASHGT4997GTTCTGGCTTCGCATGGTACT4998176
209AVGNVLL4999GCTGTGGGGAATGTGCTTTTG5000208
210NTVVNDP5001AACACAGTCGTGAACGACCCT5002120
211KLMDSRD5003AAGCTGATGGATTCGCGGGAT5004249
212RNQPEAM5005AGGAACCAACCAGAAGCCATG5006252
213VIAGLGV5007GTGATCGCGGGACTCGGCGTC5008212
214RGQSDPL5009CGGGGGCAGTCTGATCCGTTG5010 26
215GLNEHES5011GGGCTTAACGAACACGAATCT5012331
216GVVNDER5013GGCGTTGTCAACGACGAACGG5014 60
217GTVGSMV5015GGTACGGTGGGTTCTATGGTT5016216
218LTGERIL5017CTAACCGGTGAACGCATACTT5018142
219PTQGVSM5019CCAACCCAAGGAGTTTCGATG5020128
220AAREELN5021GCGGCTCGGGAGGAGCTTAAT5022343
221FNGLPAQ5023TTCAACGGTCTCCCCGCACAA5024160
222HTIAASM5025CACACCATAGCCGCAAGTATG5026221
223TDAGDGK5027ACAGACGCGGGGGACGGCAAA5028157
224SDLRPPL5029TCGGATCTTCGGCCGCCGCTG5030244
225AGLSQNL5031GCTGGGTTGTCTCAGAATCTT5032224
226GMGASSK5033GGTATGGGGGCGTCTTCTAAG5034225
227SQLAELV5035AGTCAGTTGGCGGAGCTGGTT5036209
228LTRGEEK5037CTTACGAGGGGTGAGGAGAAG5038294
229AGGVILN5039GCGGGGGGTGTTATTCTGAAT5040179
230GNGTGVL5041GGAAACGGCACCGGGGTCCTA5042172
231VVSGIPN5043GTGGTGTCTGGTATTCCGAAT5044199
232GVMAAGI5045GGAGTCATGGCCGCGGGTATC5046 34
233GVANESP5047GGTGTGGCGAATGAGAGTCCG5048197
234ELMASTI5049GAGCTTATGGCTTCTACTATT5050130
235NLGVVQV5051AACCTAGGAGTCGTACAAGTC5052 28
236RTTPDVP5053CGTACGACTCCCGACGTACCT5054107
237LESLSHH5055CTGGAGTCGCTTTCTCATCAT5056 41
238LSLTHGD5057CTGAGTTTGACTCATGGGGAT5058259
239DGVNTAL5059GATGGGGTTAATACGGCGTTG5060154
240QDGPAEK5061CAGGATGGGCCTGCGGAGAAG5062260
241SPAGLGK5063AGCCCCGCGGGCCTAGGCAAA5064 12
242RYNDEST5065AGATACAACGACGAATCCACT5066295
243VVAGTNS5067GTCGTTGCAGGTACAAACTCG5068108
244WSGQIHV5069TGGAGTGGTCAGATTCATGTG5070264
245ANSHTNS5071GCAAACAGTCACACCAACTCT5072303
246VVQAPGR5073GTTGTTCAGGCTCCTGGGCGT5074176
247ATQGTLR5075GCTACTCAGGGGACGCTTCGG5076128
248RVDPSGL5077CGTGTGGATCCTTCTGGGCTG5078 86
249TKDIGVM5079ACGAAAGACATAGGCGTAATG5080200
250GGKGEGP5081GGTGGGAAGGGTGAGGGTCCG5082149
251RGAVSTE5083CGGGGGGCTGTGTCGACTGAG5084250
252STDRESR5085TCGACTGATCGGGAGTCGCGG5086 72
253NLHTAEA5087AACCTCCACACTGCTGAAGCG5088302
254MSTAMSL5089ATGTCGACGGCGATGAGTCTG5090126
255TGSSAML5091ACGGGGAGTTCGGCGATGCTT5092125
256LISGTLR5093TTGATTTCTGGTACGCTGCGT5094124
257GIGGVIS5095GGTATTGGTGGTGTGATTTCG5096234
258RLENRGV5097CGGTTGGAGAATAGGGGGGTT5098243
259LPNGGGF5099CTGCCGAATGGGGGGGGGTTT5100 30
260TGDRDQN5101ACTGGTGATCGGGATCAGAAT5102251
261SLAITER5103AGTTTGGCGATTACTGAGCGG5104123
262STLISET5105TCCACGTTGATATCAGAAACC5106122
263DVRGSDI5107GACGTACGGGGGTCTGACATC5108156
264NLSLSLR5109AATCTGTCTCTGTCGTTGCGT5110263
265GSGGVSV5111GGTTCGGGTGGTGTTAGTGTG5112264
266LVSGLGP5113CTTGTGAGTGGGCTGGGTCCG5114212
267LTKSTEW5115CTCACCAAATCCACAGAATGG5116281
268TTRADPA5117ACTACTCGGGCTGATCCTGCG5118273
269ASMSAVN5119GCATCAATGTCAGCTGTCAAC5120121
270RVDSAQP5121AGAGTAGACAGTGCCCAACCC5122186
271EPSLGSK5123GAACCAAGTCTCGGGTCGAAA5124157
272YLGADAA5125TATCTTGGTGCTGATGCTGCT5126 68
273RDEAYRA5127AGGGATGAGGCTTATCGTGCG5128233
274RVAMSVT5129AGGGTGGCGATGTCTGTGACG5130 32
275SHGSDSN5131TCGCACGGCTCCGACTCCAAC5132174
276ETRMISE5133GAGACGCGTATGATTTCGGAG5134167
277AHIGTLT5135GCACACATCGGAACTCTCACC5136 53
278MGGVTNP5137ATGGGAGGTGTCACCAACCCC5138195
279PSPSVTL5139CCGTCGCCTAGTGTTACTTTG5140 95
280SSSGAAW5141TCGAGTAGTGGGGCGGCGTGG5142  8
281HTQGTLR5143CATACTCAGGGGACGCTTCGG5144128
282LTDVTQM5145TTAACCGACGTCACACAAATG5146281
283ILSSATD5147ATTCTTAGTTCGGCGACTGAT5148119
284NGSNDLS5149AACGGTAGCAACGACCTTTCA5150 59
285KGSDNHM5151AAAGGCAGTGACAACCACATG5152278
286QEQGTTT5153CAGGAGCAGGGTACGACTACT5154285
287PGVAMVT5155CCCGGGGTCGCTATGGTAACT5156 17
288LVGVSSE5157CTGGTGGGTGTGTCGTCTGAG5158287
289SGGTRGP5159TCTGGTGGGACTCGTGGTCCT5160310
290AIQTNDA5161GCAATCCAAACCAACGACGCG5162289
291LISTTLR5163TTGATTTCTACTACGCTGCGT5164118
292ALGDQAR5165GCGTTAGGGGACCAAGCGCGT5166291
293GLNDHVA5167GGTCTGAATGATCATGTGGCG5168331
294NDVSLAT5169AATGATGTGAGTCTGGCTACT5170293
295KMAITDD5171AAAATGGCTATAACAGACGAC5172147
296LSNHGPI5173CTGAGTAATCATGGGCCTATT5174286
297VLNDNLA5175GTGTTAAACGACAACTTAGCT5176280
298RHVHVEG5177CGCCACGTACACGTCGAAGGC5178236
299DGRAELR5179GATGGGCGGGCGGAGTTGCGT5180333
300SGISFLA5181AGCGGAATCAGCTTCTTGGCT5182154
301RISPEGT5183CGTATATCACCGGAAGGCACT5184295
302RVTPTNT5185CGCGTGACGCCAACTAACACT5186 77
303STTSSPS5187TCGACCACCTCATCCCCTAGC5188117
304LLHGIIA5189CTTTTACACGGAATAATCGCC5190298
305ASESSPP5191GCATCAGAATCATCACCACCC5192139
306TSREEQW5193ACTTCTCGTGAGGAGCAGTGG5194343
307AGDRDQY5195GCTGGTGATCGGGATCAGTAT5196251
308GVAIALQ5197GGGGTTGCTATTGCTCTTCAG5198307
309LLGGTLA5199TTGTTGGGGGGTACTCTGGCT5200298
310ALKEYES5201GCGCTGAAGGAGTATGAGTCG5202277
311RGGKEEM5203CGAGGTGGCAAAGAAGAAATG5204310
312VDFGDHT5205GTAGACTTCGGCGACCACACC5206312
313GADVNNH5207GGTGCTGACGTCAACAACCAC5208 62
314MNGGNVL5209ATGAACGGCGGCAACGTGCTC5210268
315HQGDTIV5211CATCAGGGGGATACGATTGTG5212275
316SHGSDSR5213AGCCACGGGTCGGACTCCCGG5214174
317TGHEGGS5215ACAGGCCACGAAGGAGGTTCG5216327
318PNERHTL5217CCTAACGAACGCCACACCTTG5218144
319TDALLMH5219ACAGACGCACTCCTCATGCAC5220157
320SVTERSG5221TCTGTGACGGAGAGGAGTGGT5222319
321LGHANGL5223TTAGGGCACGCAAACGGACTT5224171
322GVSDFQS5225GGGGTATCGGACTTCCAATCA5226330
323GVANVSP5227GGAGTTGCTAACGTCAGCCCA5228197
324ATLLPQT5229GCCACACTTCTGCCACAAACG5230122
325SSLLTTA5231TCGTCGTTGCTGACTACTGCT5232115
326LNGAPLL5233CTGAATGGTGCGCCGTTGCTG5234268
327SGSIVVV5235AGTGGTTCGATTGTGGTGGTT5236114
328DVAISMR5237GACGTAGCGATATCCATGCGA5238297
329LLADERV5239TTACTCGCAGACGAAAGGGTC5240296
330LTSGLAA5241TTGACGTCTGGTTTGGCGGCG5242112
331GPLNQSL5243GGTCCGCTGAATCAGTCTTTG5244224
332EGSEHVK5245GAAGGGTCCGAACACGTGAAA5246 35
333RQDNSDV5247CGGCAGGATAATTCGGATGTG5248159
334LGAGSLS5249TTGGGGGCGGGGAGTCTGTCT5250110
335KLAEGVR5251AAACTAGCCGAAGGAGTGCGG5252 44
336HGTLESQ5253CACGGCACCCTCGAATCGCAA5254178
337LDTSDRL5255TTGGACACGTCTGACCGGCTC5256248
338VRGEETV5257GTTCGTGGGGAAGAAACCGTC5258185
339VVLSLAT5259GTTGTCTTAAGTCTAGCCACT5260109
340RDDQGIP5261CGGGATGATCAGGGGATTCCG5262163
341AVAGTNS5263GCAGTTGCGGGTACAAACTCG5264108
342RGGVTTE5265CGTGGAGGCGTAACCACCGAA5266250
343RMTLTGD5267CGTATGACTTTGACTGGTGAT5268247
344DDAVSKR5269GATGATGCTGTTTCTAAGCGT5270143
345VNHGGVD5271GTAAACCACGGAGGAGTTGAC5272274
346PGEPLRL5273CCGGGAGAACCCTTGCGACTC5274279
347PGEHYEA5275CCGGGTGAGCATTATGAGGCT5276196
348PSQGMTR5277CCTAGTCAGGGTATGACTCGT5278128
349RASADVV5279AGGGCGAGTGCGGATGTTGTG5280209
350DAQSRLA5281GATGCTCAGTCGCGGTTGGCG5282290
351DPSLGSP5283GATCCGTCTCTGGGTTCTCCG5284 37
352RNVSDMT5285CGAAACGTGTCGGACATGACC5286347
353VQSADPR5287GTCCAATCCGCGGACCCTCGC5288 27
354GVSTLSL5289GGGGTTTCTACTCTGAGTCTT5290106
355REQQKAW5291CGAGAACAACAAAAAGCCTGG5292169
356DTASTQS5293GACACAGCATCTACTCAATCC5294105
357MSDSGTV5295ATGAGCGACTCGGGCACGGTT5296286
358DSRTVDS5297GACTCTCGAACCGTCGACTCA5298105
359NSGPQLS5299AACTCGGGCCCACAACTTTCG5300203
360QKDSLVA5301CAGAAGGATTCGTTGGTTGCT5302284
361TLATQEL5303ACTCTGGCGACGCAGGAGCTG5304103
362QSMTDGV5305CAGAGTATGACTGATGGGGTT5306151
363VTVAGSV5307GTTACGGTGGCTGGTTCGGTG5308101
364SVVGLDS5309TCAGTCGTCGGATTAGACTCG5310282
365MNGGHLM5311ATGAATGGGGGTCATCTTATG5312268
366DKVVDEV5313GATAAGGTGGTTGATGAGGTG5314226
367VVGTQDR5315GTTGTGGGGACTCAGGATAGG5316211
368DVAVYIR5317GATGTTGCTGTTTATATTCGT5318143
369EPSLGSR5319GAGCCGTCTCTGGGTTCTCGG5320182
370ASVSALL5321GCGAGTGTTTCTGCGTTGTTG5322100
371LSLDRPS5323CTAAGTCTAGACCGACCCTCG5324327
372PAIQGNF5325CCAGCCATCCAAGGAAACTTC5326135
373SVEPLSL5327TCCGTAGAACCTCTATCCCTC5328279
374KSSDTPM5329AAGAGTTCTGATACTCCTATG5330278
375SFDTYGA5331TCCTTCGACACTTACGGGGCC5332 62
376AVSDYAV5333GCAGTATCTGACTACGCAGTC5334330
377AGVSASL5335GCGGGTGTTTCTGCGTCGTTG5336 99
378VVSQLPV5337GTCGTCTCTCAACTACCGGTA5338276
379EIVLTVP5339GAGATTGTTTTGACTGTGCCG5340120
380MIGGHVQ5341ATGATTGGGGGTCATGTTCAG5342268
381TGLGLMV5343ACCGGACTCGGACTAATGGTA5344323
382VTTHSPV5345GTTACCACCCACAGTCCAGTT5346 67
383VLPHANT5347GTCCTACCACACGCCAACACA5348 87
384STVLVPK5349TCTACCGTACTAGTCCCTAAA5350244
385GGDALNQ5351GGGGGGGACGCCCTTAACCAA5352 31
386QIHDTAL5353CAAATCCACGACACAGCGCTC5354318
387ALTNGQR5355GCACTAACCAACGGTCAACGT5356 60
388PARYRLW5357CCGGCGCGGTATCGGCTTTGG5358 35
389RNEGINQ5359CGTAATGAGGGTATTAATCAG5360252
390QRSDSVM5361CAGCGGTCGGATAGTGTGATG5362275
391NRQENSY5363AATCGGCAGGAGAATTCGTAT5364349
392NDRNTSS5365AATGATAGGAATACGTCTTCG5366242
393MNGGHVL5367ATGAATGGGGGTCATGTTCTG5368268
394IPATADK5369ATCCCAGCCACGGCGGACAAA5370140
395YAGIAQG5371TATGCGGGGATTGCTCAGGGT5372269
396STQGGLA5373AGTACCCAAGGCGGATTAGCG5374274
397GLLKNLD5375GGTTTGCTAAAAAACCTCGAC5376349
398GLVQMSS5377GGTCTGGTGCAGATGTCTTCT5378326
399GVSVPNV5379GGGGTGAGTGTGCCGAATGTT5380132
400TTSRPEE5381ACTACTTCTCGGCCGGAGGAG5382292
401RDMGALV5383CGGGATATGGGTGCTCTTGTG5384231
402VHASSPT5385GTGCATGCTTCTAGTCCGACT5386325
403REQQKYW5387CGGGAACAACAAAAATACTGG5388169
404CNAAGCP5389TGTAATGCTGCGGGGTGTCCG5390 17
405PMRPGVA5391CCGATGCGGCCGGGTGTGGCT5392200
406FGGVINA5393TTCGGGGGAGTAATAAACGCT5394195
407STFSTVM5395AGCACATTCTCCACTGTTATG5396 98
408GHQNGGI5397GGGCACCAAAACGGCGGAATC5398265
409MTSGNLM5399ATGACCTCTGGCAACCTCATG5400280
410RESANAD5401CGTGAGTCTGCGAATGCTGAT5402233
411SGDVARH5403TCAGGCGACGTTGCCCGACAC5404 17
412VSANVTI5405GTTTCTGCGAATGTTACGATT5406 97
413VPGSTTT5407GTTCCAGGCTCAACGACTACC5408267
414PLVPQGG5409CCCTTAGTACCTCAAGGCGGT5410 34
415PGDRDQY5411CCAGGCGACCGAGACCAATAC5412251
416HVSGASL5413CACGTGTCCGGCGCCAGCTTA5414266
417YTSGTGT5415TACACCTCGGGCACAGGGACA5416 29
418PNTRDPI5417CCTAATACGCGGGATCCGATT5418144
419SPVGIIA5419TCTCCTGTGGGTATTATTGCG5420  7
420LGDSDET5421TTAGGAGACTCGGACGAAACC5422219
421NRHETLS5423AACCGCCACGAAACACTATCA5424349
422GSVSSTK5425GGCTCCGTCAGTTCTACGAAA5426 96
423VFTGTDP5427GTGTTCACCGGCACAGACCCT5428129
424YGSNVLS5429TACGGTTCTAACGTCCTCTCA5430110
425TDNGALS5431ACTGATAATGGTGCGTTGTCG5432110
426TGLGDRA5433ACCGGCTTGGGAGACAGGGCT5434273
427DPSLGYP5435GACCCCAGTTTGGGCTACCCT5436 37
428LSLTEGV5437CTGAGTTTGACTGAGGGGGTT5438259
429ARVLEKT5439GCCCGAGTCCTTGAAAAAACC5440122
430VDTSARD5441GTTGATACTAGTGCTCGTGAT5442248
431PTQETLR5443CCTACTCAGGAGACGCTTCGG5444128
432AALTREI5445GCTGCTCTTACGCGGGAGATT5446258
433RDLTNDV5447CGCGACTTAACTAACGACGTT5448159
434GLSERAQ5449GGCCTGTCCGAACGAGCACAA5450205
435DSLLPQT5451GATTCTCTGCTTCCTCAGACT5452292
436LEANVSH5453CTTGAGGCGAATGTTTCGCAT5454 19
437AGSTVTW5455GCGGGGTCGACTGTTACTTGG5456114
438YGVTLST5457TACGGCGTAACCCTCTCTACC5458 13
439GPSGAGI5459GGGCCATCAGGGGCAGGCATC5460 30
440VSNGHFV5461GTTAGTAATGGGCATTTTGTT5462268
441GVSLPMS5463GGCGTATCACTACCCATGAGC5464307
442MAASVTL5465ATGGCGGCTAGTGTTACGCTT5466 95
443KIGENAS5467AAGATTGGTGAGAATGCTTCT5468321
444ISMTLLP5469ATTTCGATGACTCTGCTGCCG5470184
445GAVSSTK5471GGTGCTGTTTCTTCGACTAAG5472 92
446TTLAHPA5473ACTACTCTGGCTCATCCTGCG5474244
447LMNDLLS5475CTTATGAACGACTTACTCTCC5476175
448TTAANVR5477ACGACGGCGGCTAATGTTAGG5478 91
449PNDRLTV5479CCAAACGACCGGTTGACGGTT5480144
450LQVEQVM5481CTTCAGGTTGAGCAGGTTATG5482329
451MLMGAET5483ATGCTCATGGGGGCAGAAACT5484257
452LSLTMPA5485CTCTCGCTTACAATGCCTGCC5486207
453KEIHVSV5487AAGGAGATTCATGTGTCGGTT5488 69
454MAVDVTK5489ATGGCAGTCGACGTAACCAAA5490256
455NSLATMV5491AATAGTCTGGCGACGATGGTG5492 89
456RSISGDW5493CGTTCCATAAGTGGCGACTGG5494159
457SLQQANT5495TCGCTTCAGCAGGCTAATACG5496 87
458PTTNPLL5497CCGACTACTAATCCGCTTCTG5498 56
459ADVLIRG5499GCGGACGTGCTCATACGCGGT5500269
460HVASAGA5501CATGTTGCTTCGGCGGGGGCG5502253
461LQDRTTL5503CTCCAAGACCGCACTACTCTC5504204
462NAHDTET5505AATGCGCATGATACTGAGACT5506246
463RVDSALL5507AGAGTAGACAGCGCTCTTTTA5508 86
464RNQGSES5509CGTAATCAGGGTAGTGAGAGT5510252
465EIMSSNR5511GAAATCATGTCGTCCAACCGT5512249
466GSRENAR5513GGGAGTAGGGAGAATGCGCGT5514 70
467GGDTSRS5515GGGGGTGATACGAGTCGTAGT5516335
468YLALTGI5517TATCTTGCGCTTACGGGGATT5518 78
469LDTSARL5519CTTGATACTAGTGCTCGTCTT5520248
470LLTLTQA5521CTGCTCACCCTGACTCAAGCG5522247
471SEQNKVW5523TCCGAACAAAACAAAGTATGG5524 94
472ADAAHAL5525GCGGACGCAGCCCACGCGCTC5526245
473GVAATNS5527GGGGTGGCTGCGACGAATTCT5528 84
474NSGSMHT5529AACTCAGGAAGCATGCACACT5530293
475PDGAAPM5531CCTGATGGTGCGGCTCCTATG5532341
476STLASPR5533TCAACCCTAGCCTCGCCTCGA5534244
477GADDAAL5535GGAGCCGACGACGCAGCCCTC5536315
478AGASAEA5537GCTGGGGCTAGTGCTGAGGCG5538228
479SRLEYIG5539AGCCGCCTTGAATACATCGGG5540349
480YTVGSLA5541TACACCGTTGGCTCACTCGCC5542314
481LVHLGTS5543TTGGTTCATCTTGGGACTTCT5544198
482GLYDAAT5545GGGCTTTATGATGCGGCGACT5546315
483KNGGHDL5547AAAAACGGTGGGCACGACCTA5548268
484NTENASR5549AATACTGAGAATGCGTCGCGG5550242
485HGTLVSQ5551CATGGGACTTTGGTGTCTCAG5552178
486HAGLGVT5553CATGCTGGTCTTGGTGTTACT5554163
487PSYQGNG5555CCGAGTTATCAGGGGAATGGT5556181
488MGDNYAR5557ATGGGTGATAATTATGCTCGG5558 10
489LEKDPMT5559TTGGAAAAAGACCCTATGACT5560318
490SMNGTSL5561AGTATGAATGGGACTAGTCTT5562 81
491SNLGNTS5563AGTAACCTTGGAAACACCTCG5564241
492GSGAGLH5565GGAAGTGGAGCTGGCCTTCAC5566172
493LANTVVT5567CTTGCTAATACGGTTGTGACG5568 80
494AVRENGI5569GCCGTTCGGGAAAACGGCATA5570 34
495VTELTRF5571GTGACTGAGCTTACGCGGTTT5572162
496RNLDLNH5573AGGAATCTTGATCTGAATCAT5574261
497ALASTQT5575GCACTAGCATCGACCCAAACT5576 79
498FISGALT5577TTCATATCCGGCGCCTTAACT5578240
499QSQTAVA5579CAGTCTCAGACGGCTGTTGCT5580  1
500VSSQLPM5581GTGTCGTCTCAGTTGCCGATG5582239
501VIALTEA5583GTGATTGCGTTGACGGAGGCT5584 78
502TTVEVSG5585ACAACCGTAGAAGTAAGCGGC5586236
503GVAGTNS5587GGAGTTGCGGGAACAAACTCC5588 84
504RESGEQA5589AGGGAGAGTGGGGAGCAGGCT5590233
505HIVLSHA5591CATATTGTGCTGAGTCATGCT5592 78
506ILGVYSD5593ATACTGGGCGTTTACTCCGAC5594287
507QGGTTLR5595CAAGGGGGGACTACTCTACGC5596 49
508YHTEKMF5597TACCACACCGAAAAAATGTTC5598320
509LEVGALR5599CTGGAAGTAGGCGCACTTCGT5600231
510GFGLTED5601GGGTTTGGGTTGACGGAGGAT5602322
511PLKGGGE5603CCGTTGAAAGGCGGGGGTGAA5604328
512GLVHMPS5605GGCTTAGTTCACATGCCCTCA5606326
513VTGHPTL5607GTTACGGGTCATCCGACTCTT5608  0
514WNHSTTV5609TGGAACCACTCCACGACAGTC5610  2
515PGEHYRL5611CCTGGAGAACACTACAGATTG5612196
516LSLTDLV5613CTGAGTTTGACTGATTTGGTT5614230
517GTGSTNV5615GGAACTGGATCGACAAACGTT5616229
518LSKEHAP5617TTGAGTAAGGAGCATGCTCCT5618 57
519LGDSAEA5619CTTGGGGATTCTGCTGAGGCG5620228
520GPRNSID5621GGCCCACGTAACTCTATCGAC5622343
521LVRGLTT5623TTGGTTCGTGGTCTTACGACT5624222
522REVSPLM5625CGAGAAGTAAGCCCCCTGATG5626215
523IMPSVTK5627ATAATGCCCTCTGTTACAAAA5628136
524WNSEVSV5629TGGAACAGTGAAGTTTCGGTG5630320
525FHGSDSK5631TTTCATGGTTCTGATTCGAAG5632174
526PNERLTQ5633CCGAATGAGAGGCTTACTCAG5634144
527GQEETGW5635GGGCAGGAGGAGACCGGCTGG5636169
528MVTTTNT5637ATGGTGACGACCACAAACACC5638 77
529NALGDGY5639AACGCGCTGGGCGACGGCTAC5640312
530LGDSAET5641CTTGGGGATTCTGCTGAGACG5642219
531TGAHTEV5643ACCGGAGCACACACCGAAGTC5644 82
532LVGNPST5645CTCGTGGGCAACCCGAGTACG5646287
533FPSMSGK5647TTCCCAAGCATGTCGGGGAAA5648 62
534KGSDTPL5649AAGGGTTCTGATACTCCTTTG5650278
535ANLGESV5651GCCAACCTCGGTGAATCCGTG5652218
536SVDSGLR5653AGTGTTGATAGTGGGCTGCGT5654217
537RTMGDST5655CGGACAATGGGTGACAGTACG5656312
538SLAISQR5657AGCCTGGCTATAAGCCAACGT5658 76
539SEISLSR5659TCTGAGATTAGTCTGTCTCGG5660 75
540LRGTENQ5661TTGCGTGGGACGGAGAATCAG5662237
541SGHVTAL5663AGTGGACACGTCACAGCTTTA5664114
542REISILS5665CGCGAAATATCGATACTATCT5666215
543ASTDFKM5667GCTAGTACTGATTTTAAGATG5668330
544GNSGDHF5669GGTAACTCTGGTGACCACTTC5670149
545AADSSVR5671GCTGCTGACAGCAGCGTTAGA5672 73
546QADSHGR5673CAAGCCGACTCGCACGGCCGT5674 62
547ADYGTSS5675GCGGACTACGGTACCAGCTCT5676108
548SGGVESK5677TCTGGTGGTGTTGAGTCGAAG5678310
549GNLLLTA5679GGTAATTTGCTGCTTACTGCT5680115
550MTDRHRV5681ATGACCGACCGTCACAGGGTC5682187
551MENAPGR5683ATGGAGAATGCTCCTGGGAGG5684 62
552EANHTGY5685GAAGCCAACCACACCGGATAC5686254
553AADRSVR5687GCAGCAGACCGCTCCGTACGT5688 72
554PIIEHAV5689CCCATAATAGAACACGCAGTA5690330
555NVDTSVR5691AATGTTGATACGAGTGTGCGG5692213
556NVTATLG5693AACGTCACAGCAACGCTGGGT5694203
557MKTQIEL5695ATGAAAACGCAAATAGAACTC5696126
558IGPRREV5697ATAGGACCTCGCCGTGAAGTA5698 82
559VLAAVDR5699GTCCTTGCTGCCGTCGACCGA5700211
560SVDSGLL5701AGTGTTGATAGTGGGCTGCTT5702210
561FIVGNGS5703TTCATCGTAGGCAACGGAAGT5704 22
562RYNVETA5705CGGTATAATGTTGAGACTGCG5706333
563AIVSIAQ5707GCGATTGTGTCGATTGCTCAG5708 71
564PDNNPRN5709CCTGATAATAATCCGCGGAAT5710 19
565KTVNVSV5711AAGACTGTGAATGTTAGTGTT5712 69
566QFHENIR5713CAGTTTCATGAGAATATTCGT5714153
567KGYDTPM5715AAAGGCTACGACACACCCATG5716278
568AVITEPK5717GCGGTGATTACTGAGCCTAAG5718207
569VSSTGEW5719GTTAGTTCTACGGGGGAGTGG5720198
570TGSIPSP5721ACCGGTTCAATCCCTTCCCCC5722184
571NQSAELV5723AATCAGTCGGCGGAGCTGGTT5724209
572VIGGLGI5725GTTATTGGTGGGCTTGGGATT5726212
573HFSSETS5727CACTTCTCTTCCGAAACTTCT5728 65
574GYRGVVD5729GGTTATAGGGGGGTTGTGGAT5730149
575SHGTDTK5731AGCCACGGAACGGACACCAAA5732174
576VMASPGP5733GTTATGGCTTCGCCTGGTCCT5734176
577LPNGGGL5735CTGCCGAATGGGGGGGGTTTG5736 30
578VGSVTDS5737GTTGGTAGCGTAACCGACTCC5738206
579TVMTSEP5739ACAGTTATGACCAGCGAACCT5740159
580PGNGTMV5741CCTGGTAACGGCACTATGGTG5742128
581SLGALVA5743TCGCTGGGTGCTCTGGTTGCT5744 68
582YLVTADN5745TATTTGGTTACTGCTGATAAT5746 45
583LTHLRVS5747CTGACTCACCTTCGTGTCAGC5748305
584HTVGSYV5749CATACGGTTGGGAGTTATGTT5750216
585LEDRSAS5751TTGGAGGATCGGTCGGCTAGT5752204
586VTTASPV5753GTGACTACGGCTTCTCCTGTG5754 67
587GVLGQTD5755GGTGTGTTGGGGCAGACTGAT5756149
588DIDRLHK5757GATATTGATAGGCTGCATAAG5758 12
589HNPGMDK5759CATAATCCGGGGATGGATAAG5760337
590TVGLTIA5761ACTGTGGGTTTGACGATTGCG5762201
591SPPPNAR5763AGCCCGCCGCCGAACGCGCGT5764183
592FLLGHTD5765TTCCTTCTGGGGCACACGGAC5766119
593VLTSPGP5767GTGCTCACAAGCCCGGGACCG5768176
594TAYDTLV5769ACGGCGTATGATACGTTGGTT5770339
595SVETGVL5771TCTGTGGAAACTGGCGTCTTA5772200
596TVKEYEL5773ACCGTTAAAGAATACGAACTC5774277
597MTVPGSP5775ATGACGGTTCCGGGTAGTCCG5776101
598YYSITSS5777TACTACTCCATCACATCCAGT5778 88
599STIPTLK5779AGTACTATTCCTACTCTGAAG5780332
600MVQSGLT5781ATGGTTCAGTCGGGGTTGACG5782170
601MGVGGGS5783ATGGGGGTCGGTGGTGGATCC5784110
602RVDSGQL5785AGGGTGGATTCGGGGCAGCTT5786186
603REISNLR5787CGGGAAATAAGCAACCTACGT5788188
604DHVLLTR5789GACCACGTGTTACTTACCCGG5790 75
605TRIGLSD5791ACACGAATAGGACTCAGTGAC5792323
606RVHSAQL5793AGGGTGCATTCGGCGCAGCTT5794302
607YEHSGLL5795TATGAGCATTCTGGTCTTTTG5796170
608VFTGTDT5797GTGTTCACAGGAACCGACACA5798129
609TLAINER5799ACTTTGGCGATTAATGAGCGG5800 85
610LGVTNVA5801CTAGGAGTGACCAACGTGGCC5802340
611GVANVSQ5803GGTGTGGCGAATGTGAGTCAG5804197
612ATVKDSG5805GCAACCGTAAAAGACTCGGGG5806236
613GEIDIAF5807GGAGAAATCGACATAGCCTTC5808 75
614LSLTDGV5809TTGTCCTTAACCGACGGAGTG5810259
615VLLMDRV5811GTACTTCTTATGGACCGAGTT5812296
616SSADYQV5813AGTTCTGCGGATTATCAGGTT5814330
617APRDPGV5815GCGCCGCGTGATCCTGGTGTT5816327
618AQAQTGW5817GCTCAAGCACAGACCGGCTGG5818169
619SNLHTST5819AGTAATCTTCATACTTCGACT5820303
620RVDSGLL5821AGGGTTGATAGTGGGCTGCTT5822 86
621SGGRITD5823AGCGGAGGGCGCATCACCGAC5824179
622LGIGQGP5825TTGGGTATTGGTCAGGGTCCT5826184
623MGGVTSV5827ATGGGGGGGGTTACTTCGGTG5828195
624SIYDNVK5829TCGATATACGACAACGTCAAA5830339
625VTSDAGW5831GTCACCTCTGACGCAGGGTGG5832309
626AHTEMSH5833GCCCACACCGAAATGTCTCAC5834261
627LLTQDAR5835TTGCTTACTCAGGATGCTCGG5836193
628YVGSPLV5837TATGTTGGTTCTCCGTTGGTG5838111
629ENAGTDV5839GAAAACGCCGGAACTGACGTC5840 29
630SIYDNDT5841TCCATCTACGACAACGACACC5842289
631AATSGGP5843GCAGCCACCAGTGGCGGGCCG5844 62
632STIPTLM5845TCGACGATACCAACCTTGATG5846308
633SGMQAEA5847TCGGGTATGCAGGCGGAGGCT5848192
634GNGDMFA5849GGGAATGGGGATATGTTTGCT5850264
635LNGGIGV5851CTTAATGGGGGTATTGGGGTT5852164
636TAVERAW5853ACGGCTGTTGAGCGGGCGTGG5854205
637NNGIVIA5855AATAATGGGATTGTGATTGCG5856305
638GPDTGAM5857GGCCCCGACACAGGCGCGATG5858315
639PSRGIPL5859CCGAGTCGTGGTATTCCTCTT5860341
640VGGAGEI5861GTTGGTGGGGCGGGTGAGATT5862101
641VLQLAAL5863GTTCTTCAACTCGCTGCCCTC5864 66
642LSDGGPL5865CTCTCGGACGGAGGCCCCCTC5866286
643VSGGVLD5867GTATCCGGCGGAGTACTAGAC5868161
644MSITEPR5869ATGTCTATTACTGAGCCGCGG5870207
645SGSNTGP5871AGCGGCTCCAACACTGGCCCG5872184
646SLRDTHY5873AGTCTTCGGGATACTCATTAT5874318
647MGDAGLR5875ATGGGGGATGCGGGGCTGCGG5876217
648IVMSSHI5877ATCGTCATGAGCTCCCACATC5878189
649ASPLPQT5879GCTAGTCCCTTGCCCCAAACC5880122
650SEISILR5881AGTGAGATTAGTATTCTGCGG5882188
651SEGLSRD5883TCGGAGGGTCTTTCGCGTGAT5884306
652LGSLVVH5885CTGGGAAGCTTAGTCGTTCAC5886114
653ETRLDSK5887GAAACCCGACTCGACTCGAAA5888 54
654ANQLAPV5889GCCAACCAATTGGCCCCCGTG5890272
655LFGPSAY5891TTATTCGGACCTTCCGCCTAC5892317
656SMTSESS5893TCAATGACTTCGGAATCGTCT5894 65
657MTDSGTV5895ATGACTGATAGTGGGACTGTG5896187
658FQVEQIM5897TTTCAGGTTGAGCAGATTATG5898329
659RVDSEQL5899AGGGTGGATTCGGAGCAGCTT5900146
660SNTGVTV5901TCGAATACTGGTGTTACGGTG5902180
661ITQAVYI5903ATCACACAAGCGGTATACATC5904300
662GALSSTK5905GGTGCTCTTTCTTCGACTAAG5906 63
663IMVDAHS5907ATTATGGTTGATGCTCATTCG5908175
664GSGVQPV5909GGGTCCGGCGTACAACCGGTA5910264
665GSGPGVA5911GGTTCTGGGCCGGGGGTGGCT5912163
666KHSSEMT5913AAACACAGCTCAGAAATGACC5914346
667TRTEDYT5915ACGCGTACGGAGGATTATACT5916349
668ILNPTAV5917ATTCTTAATCCGACGGCGGTG5918  5
669IGSSLSP5919ATTGGGTCGTCGCTTAGTCCT5920184
670SGFVVPV5921TCTGGGTTTGTTGTGCCGGTG5922114
671RTTPDVT5923CGCACGACCCCCGACGTAACA5924107
672APTATLR5925GCGCCTACTGCTACTCTTCGG5926183
673YDRIMSS5927TACGACCGCATAATGTCATCT5928168
674YGSNDLS5929TATGGGAGTAATGATCTGAGT5930110
675GDRGVVA5931GGTGATAGGGGGGTTGTGGCT5932 53
676QAALSDR5933CAAGCGGCACTATCAGACCGG5934182
677AADSSGR5935GCGGCGGATAGTTCTGGGCGG5936 62
678PTLGTLR5937CCTACTCTGGGGACGCTTCGG5938128
679PNLGNPS5939CCCAACCTCGGAAACCCATCT5940241
680PTQGTNR5941CCAACACAAGGTACAAACAGG5942128
681SRGVISS5943AGCCGAGGCGTAATCTCGTCA5944310
682ASVSSLR5945GCTAGTGTGTCTTCGCTGCGT5946 61
683LRVTEDL5947TTGCGTGTGACGGAGGATCTG5948237
684MTGLDDV5949ATGACGGGCCTAGACGACGTA5950142
685TSLGPMV5951ACTTCGTTAGGCCCGATGGTC5952323
686MAGGVQV5953ATGGCGGGTGGGGTGCAGGTT5954177
687NGASLAS5955AACGGAGCTTCCCTCGCAAGC5956 59
688TNGVLYT5957ACAAACGGCGTCCTTTACACG5958 93
689MNGGHVQ5959ATGAACGGAGGGCACGTGCAA5960268
690VMASTGP5961GTAATGGCGTCAACAGGACCG5962176
691VLASLGD5963GTACTCGCGTCGTTGGGCGAC5964212
692ILVDALA5965ATTCTGGTTGATGCTCTTGCG5966175
693SADSSVR5967TCGGCGGATAGTTCTGTGCGG5968 58
694NRELALG5969AACCGCGAACTCGCACTCGGG5970263
695SHASDSK5971TCGCACGCATCAGACTCTAAA5972174
696AGHSNAV5973GCTGGGCATTCTAATGCGGTT5974158
697MVTPTNS5975ATGGTGACGCCGACCAACAGT5976 77
698TIDRFGS5977ACAATAGACCGATTCGGAAGT5978204
699RGAEVLL5979CGGGGTGCGGAGGTGCTGCTG5980313
700TFAISDR5981ACTTTTGCGATTTCTGATCGG5982262
701SQGSDSK5983AGTCAAGGCTCCGACTCAAAA5984 54
702ASGVRPV5985GCGTCAGGTGTTAGACCGGTA5986272
703SDATGVL5987TCCGACGCTACCGGTGTGCTA5988 44
704LTLSNGV5989CTTACGCTGAGTAATGGGGTG5990171
705DSDSGRR5991GATTCTGATAGTGGGCGGCGG5992217
706LYTSDRV5993CTATACACATCTGACCGAGTG5994248
707QDAHVAI5995CAGGATGCGCATGTGGCTATT5996  0
708IVDSGLL5997ATTGTTGATAGTGGGCTGCTT5998170
709LYGGSSA5999CTCTACGGAGGGTCCTCGGCT6000317
710NFGRDTL6001AATTTTGGTCGTGATACTCTG6002174
711TPVYTVK6003ACCCCCGTCTACACCGTAAAA6004145
712AAVVPRY6005GCAGCAGTAGTACCACGATAC6006122
713SNVALTG6007AGCAACGTTGCACTGACCGGC6008 64
714ASMGTVA6009GCGTCCATGGGAACCGTAGCC6010 53
715TIGVVAN6011ACGATTGGGGTTGTGGCGAAT6012137
716TSVLPQT6013ACGTCTGTGCTTCCTCAGACT6014292
717LHAGESR6015CTTCATGCTGGTGAGTCTAGG6016134
718STTSSPR6017TCTACGACGAGTTCGCCGCGT6018 52
719PGHGPVR6019CCCGGGCACGGACCTGTACGC6020128
720FTSGTGN6021TTCACAAGCGGGACCGGAAAC6022112
721QILGASS6023CAAATCTTAGGGGCCTCGAGT6024 51
722EVRDTKT6025GAAGTTCGGGACACAAAAACG6026246
723VLPSPGP6027GTTCTGCCTTCGCCTGGTCCT6028176
724INNFAPP6029ATAAACAACTTCGCACCGCCC6030139
725ELRPQSS6031GAACTCCGGCCCCAATCATCT6032 65
726LTDKMTS6033TTGACTGATAAGATGACGTCG6034334
727IYPQSST6035ATATACCCACAAAGCTCCACC6036317
728VVSGLLH6037GTTGTCTCCGGGTTGCTACAC6038238
729ATVAGQY6039GCTACCGTGGCAGGCCAATAC6040101
730NLGGVQL6041AACTTAGGAGGCGTCCAATTG6042177
731SEPSGTL6043AGCGAACCCTCCGGAACTTTA6044 25
732MNGGHVI6045ATGAATGGGGGTCATGTTATT6046268
733VVNVGQT6047GTAGTGAACGTCGGACAAACT6048198
734GLTEYTA6049GGTCTAACCGAATACACAGCT6050331
735ILASPGP6051ATACTTGCGTCACCCGGACCG6052176
736VGSVMAS6053GTGGGGTCGGTTATGGCTAGT6054168
737SPQGVLA6055TCGCCGCAGGGGGTTCTTGCT6056274
738VGPSVLQ6057GTAGGTCCATCCGTACTACAA6058161
739GVRDTNI6059GGAGTTCGAGACACAAACATA6060132
740ALQSAQV6061GCACTACAATCTGCACAAGTT6062 50
741GGVSATA6063GGAGGAGTCAGCGCAACGGCT6064165
742NPSPTET6065AACCCTAGCCCGACCGAAACC6066311
743AIVSIAR6067GCGATTGTGTCGATTGCTCGG6068 48
744AVPREGM6069GCCGTCCCGCGCGAAGGAATG6070 34
745PGAHYQA6071CCGGGTGCGCATTATCAGGCT6072196
746SPPSSQR6073TCACCCCCTTCATCCCAACGC6074 58
747RSNTGEW6075CGGTCAAACACCGGCGAATGG6076163
748HLYTGTG6077CACTTATACACTGGCACCGGA6078 44
749NGPMKAD6079AACGGTCCAATGAAAGCAGAC6080 59
750AADTSVR6081GCGGCGGATACTTCTGTGCGG6082 47
751GLEKMTS6083GGTCTGGAGAAGATGACTTCT6084326
752IIISSAN6085ATAATCATATCCTCGGCCAAC6086 45
753GLVKMPT6087GGTCTGGTGAAGATGCCTACT6088326
754SLPPYGR6089AGCCTGCCCCCCTACGGCCGT6090279
755TSLGLMQ6091ACTAGCCTTGGCTTAATGCAA6092323
756LSRGAEN6093CTTTCGAGGGGTGCGGAGAAT6094324
757AVKEYEL6095GCCGTTAAAGAATACGAACTC6096277
758TTPSPRT6097ACGACCCCTAGCCCACGAACA6098292
759HGTLVSK6099CACGGCACCCTTGTTTCCAAA6100178
760VTELTQV6101GTCACCGAACTCACACAAGTC6102162
761NGNMATF6103AATGGGAATATGGCGACTTTT6104336
762EGGDSGG6105GAAGGCGGAGACAGCGGTGGA6106 49
763MGDIVTL6107ATGGGGGATATTGTTACGCTT6108 95
764LIVTENQ6109TTGATTGTGACGGAGAATCAG6110237
765SVATGVL6111AGCGTGGCTACAGGCGTGCTC6112 44
766VSPSVLQ6113GTTAGTCCTTCGGTGCTTCAG6114161
767VTGLTVQ6115GTTACCGGGCTGACAGTACAA6116160
768ASQDRGS6117GCATCTCAAGACCGGGGCTCT6118327
769SSVSSPR6119TCCAGCGTCTCCTCTCCTCGC6120 43
770PILGAST6121CCGATTCTTGGTGCTAGTACG6122328
771SQLSVML6123AGCCAACTTTCAGTAATGCTT6124 42
772TDALTSK6125ACAGACGCACTCACCAGTAAA6126157
773YLEGTLL6127TACCTGGAAGGGACATTGCTC6128113
774YQRTESL6129TATCAGAGGACGGAGTCTCTG6130320
775QGGGSLN6131CAGGGGGGGGGTAGTCTGAAT6132 49
776PGSEIRG6133CCTGGCTCCGAAATAAGAGGC6134 33
777PSRGITL6135CCGAGTCGTGGTATTACTCTT6136341
778GVAGTDS6137GGGGTGGCTGGGACGGATTCT6138156
779MNGGHVM6139ATGAACGGTGGACACGTGATG6140268
780TVPNTDL6141ACTGTGCCTAATACTGATTTG6142 60
781GHQALNA6143GGCCACCAAGCATTAAACGCC6144235
782DPKTGWR6145GATCCGAAGACTGGGTGGCGT6146316
783VTQAVYV6147GTTACGCAGGCTGTTTATGTT6148300
784FETGGVS6149TTCGAAACCGGAGGCGTTTCC6150240
785IADMGGN6151ATTGCTGATATGGGTGGTAAT6152 62
786PGYSSQT6153CCGGGGTATAGTTCTCAGACG6154158
787LLLGVQS6155CTCCTATTAGGAGTACAATCG6156155
788AVDSSVR6157GCTGTTGACTCCAGCGTTAGA6158 40
789YESTRGQ6159TATGAGTCGACGAGGGGTCAG6160151
790LNSPLHV6161CTGAATAGTCCGCTGCATGTT6162112
791ADTAHPV6163GCCGACACCGCCCACCCCGTT6164245
792LPKGGGF6165CTGCCGAAGGGGGGGGGGTTT6166 30
793EGVSALL6167GAGGGTGTTTCTGCGTTGTTG6168154
794PNERLTL6169CCAAACGAACGTTTGACCTTA6170144
795SGGLMTG6171AGTGGTGGTCTTATGACTGGT6172179
796VIETRLS6173GTCATCGAAACTCGCCTTTCC6174152
797LANMLQV6175TTGGCAAACATGCTTCAAGTG6176167
798SPTSSPH6177TCACCTACATCCTCACCACAC6178 52
799GVGGTYS6179GGAGTTGGGGGCACATACAGT6180234
800AAESSVR6181GCGGCGGAGAGTTCTGTGCGG6182 38
801MNDAGRD6183ATGAATGATGCTGGGCGTGAT6184286
802GISGEVS6185GGTATTTCGGGGGAGGTGAGT6186149
803PQLIVPK6187CCTCAGCTTATTGTTCCTAAG6188244
804LRVTENQ6189TTGCGTGTGACGGAGAATCAG6190237
805TSPGLMV6191ACATCACCCGGCCTGATGGTT6192323
806TTAAIDR6193ACCACTGCAGCCATCGACCGA6194148
807HGNGYLS6195CACGGAAACGGGTACCTTTCA6196110
808VVSDYTV6197GTTGTTAGTGATTATACTGTG6198330
809RLAITER6199AGATTGGCGATTACTGAGCGG6200147
810PGVDTGV6201CCTGGTGTTGATACTGGTGTT6202 99
811DTSASST6203GATACGTCGGCGTCGTCGACT6204 37
812ANEHNIA6205GCTAATGAGCATAATATTGCG6206338
813IAHGYST6207ATCGCCCACGGATACAGCACA6208222
814SLAISER6209AGCTTAGCCATCAGCGAAAGG6210 36
815RDLTTDL6211CGTGATCTGACGACTGATCTG6212159
816EASSRLL6213GAAGCTTCGTCGCGACTTCTC6214 35
817AVKEYQS6215GCTGTTAAAGAATACCAATCT6216277
818GIAVGEV6217GGTATTGCTGTGGGGGAGGTT6218 44
819RSITIGP6219CGTTCGATTACTATTGGGCCG6220133
820LGDGTTR6221CTGGGGGATGGTACGACTCGG6222255
821ALMSSGV6223GCGTTGATGTCCTCGGGGGTT6224 34
822TYSDGTT6225ACTTATAGTGATGGGACGACT6226 90
823ASGEVQS6227GCGTCGGGGGAGGTTCAGTCT6228 33
824FAGVQQA6229TTCGCAGGAGTCCAACAAGCT6230287
825ESSRLQI6231GAGAGTTCGCGTCTTCAGATT6232239
826DSGKDRT6233GATTCTGGTAAGGATCGTACG6234 37
827MLALAVT6235ATGTTGGCGCTGGCTGTGACG6236 32
828GERMGMT6237GGTGAGCGGATGGGTATGACT6238270
829MADGASM6239ATGGCGGATGGTGCGTCTATG6240255
830RHLTSDV6241CGACACCTCACATCCGACGTC6242159
831EVLSLAP6243GAGGTGCTGTCTCTTGCTCCG6244109
832DIAVSMR6245GACATCGCGGTATCGATGAGA6246143
833RSAGTST6247AGGTCTGCAGGAACCTCCACA6248 29
834TSYDTVV6249ACATCATACGACACCGTCGTG6250339
835VGASTAW6251GTGGGCGCCAGCACCGCGTGG6252 88
836RVELTGT6253CGCGTAGAATTGACCGGCACG6254295
837VQGPLTG6255GTGCAGGGTCCGCTGACTGGT6256 14
838DRVISSL6257GATCGGGTTATTAGTTCTTTG6258 37
839PLILSPS6259CCCTTGATCTTATCTCCAAGT6260117
840VRQLDSR6261GTGAGGCAGCTGGATTCGCGG6262 27
841LLAGADR6263TTGCTTGCTGGTGCTGATCGT6264140
842YSTERSV6265TATTCGACTGAGAGGTCTGTT6266320
843GPMASVV6267GGGCCGATGGCGTCTGTGGTT6268216
844MLGGGAS6269ATGCTCGGCGGAGGTGCCTCC6270298
845LRGQPGV6271CTGCGCGGCCAACCCGGCGTG6272164
846GNGTRVL6273GGAAACGGCACCAGGGTCCTA6274172
847GLVQIVA6275GGGCTTGTTCAGATTGTTGCG6276326
848SFRDTVP6277AGTTTTAGGGATACGGTGCCT6278318
849SLNSVKV6279TCCCTAAACTCGGTCAAAGTG6280 28
850HLSRDHS6281CACCTGTCACGTGACCACTCA6282127
851SGDRDQN6283TCTGGTGATCGGGATCAGAAT6284251
852SGPMKAV6285AGTGGGCCGATGAAGGCGGTT6286304
853AGGGTPR6287GCGGGGGGTGGGACTCCGAGG6288 49
854NLRGEHT6289AATTTGCGTGGGGAGCATACG6290131
855GGTGEGP6291GGTGGGACGGGTGAGGGTCCG6292149
856LRVPENQ6293TTGCGTGTGCCGGAGAATCAG6294237
857AAGLILN6295GCCGCAGGCCTCATCCTTAAC6296179
858TGERDQN6297ACTGGTGAACGGGATCAGAAT6298251
859TIAAHVP6299ACCATAGCAGCCCACGTACCC6300 91
860SFAITER6301AGTTTTGCGATTACTGAGCGG6302322
861FTIKDNR6303TTCACCATAAAAGACAACAGA6304237
862ESRENVR6305GAATCCCGTGAAAACGTCAGA6306316
863LPRLGGL6307CTTCCGCGTTTGGGGGGGCTT6308 30
864GPDTGAK6309GGTCCAGACACAGGAGCCAAA6310 10
865TGGLLYS6311ACTGGTGGGCTTCTTTATAGT6312179
866RSGSGVA6313CGGTCGGGCTCCGGAGTCGCC6314163
867LTGSIGL6315TTAACTGGGTCAATTGGACTC6316164
868TLPHAGL6317ACCCTCCCCCACGCAGGGTTA6318 34
869VDHGMGL6319GTTGATCATGGTATGGGTTTG6320212
870TVELNHV6321ACGGTTGAGCTGAATCATGTT6322162
871VLSSDLR6323GTGCTTTCGAGTGATCTTCGT6324118
872TLTYTET6325ACATTGACATACACTGAAACC6326 79
873FIDSQLG6327TTTATTGATAGTCAGCTGGGT6328152
874FSTNSNH6329TTCTCGACCAACAGCAACCAC6330138
875MGASDTL6331ATGGGGGCTAGTGATACGCTT6332 26
876TTGKVSG6333ACCACGGGTAAAGTGTCGGGG6334236
877MDELRGR6335ATGGACGAATTACGCGGCAGA6336157
878SVDNGLL6337TCCGTCGACAACGGCTTACTG6338210
879TGMQVSI6339ACCGGTATGCAAGTGTCGATC6340190
880SVVSGLL6341TCTGTGGTGTCAGGTCTTTTG6342 25
881EQYLGSP6343GAGCAGTATCTGGGTTCTCCG6344 74
882LSHTEGD6345TTATCACACACCGAAGGGGAC6346259
883DFSVAHT6347GACTTCTCTGTAGCGCACACT6348 37
884MATPTNT6349ATGGCAACGCCAACTAACACC6350 77
885NTEDRRV6351AACACAGAAGACCGGCGAGTT6352242
886AGVLKAL6353GCAGGAGTATTAAAAGCCCTC6354304
887NAHALMV6355AACGCCCACGCACTCATGGTC6356 89
888VHVDNSN6357GTGCATGTTGATAATAGTAAT6358 45
889LSIRQGP6359TTGAGTATTCGTCAGGGTCCT6360259
890GVNHAVA6361GGAGTCAACCACGCCGTCGCC6362  4
891AYVTQGG6363GCCTACGTAACACAAGGCGGC6364345
892WDDQTSG6365TGGGATGATCAGACTTCGGGG6366204
893LDLTSDV6367CTTGATCTGACGTCTGATGTG6368159
894IPSDFPN6369ATACCATCCGACTTCCCGAAC6370271
895SLVRGLL6371AGTCTTGTTCGGGGTTTGCTG6372 25
896IVYAVGE6373ATAGTCTACGCTGTTGGAGAA6374133
897LAGLGGM6375CTAGCTGGCCTCGGTGGAATG6376164
898SDEAYRA6377AGCGATGAGGCTTATCGTGCG6378245
899VGQVPGR6379GTGGGGCAAGTCCCGGGTAGG6380168
900SVDSALL6381TCCGTGGACTCTGCTTTGCTG6382 24
901LSLRDGV6383CTATCCCTTAGGGACGGAGTC6384259
902STIPTPM6385TCCACAATCCCAACCCCCATG6386308
903IPRIHSL6387ATTCCTCGGATTCATTCTCTT6388245
904LSGIMVS6389TTGTCGGGGATTATGGTTTCG6390305
905GFVQSRM6391GGGTTTGTTCAGAGTCGGATG6392326
906SSQGTTK6393TCTTCGCAGGGTACGACTAAG6394 23
907NHVGDRL6395AATCATGTTGGTGATCGTTTG6396226
908VESTAFT6397GTTGAGAGTACGGCTTTTACG6398214
909LVAGQAM6399CTGGTGGCGGGGCAGGCTATG6400 21
910GLVRIQD6401GGACTGGTTCGGATCCAAGAC6402326
911TNTDSSL6403ACGAATACGGATTCTAGTCTG6404 20
912TGLQVST6405ACTGGGCTGCAGGTTAGTACT6406227
913AHGDKDL6407GCACACGGCGACAAAGACCTT6408291
914SSANLSN6409TCGTCCGCCAACCTTTCGAAC6410 19
915EIAFTVP6411GAAATAGCATTCACCGTACCT6412120
916SGEPLGL6413TCTGGGGAGCCGCTTGGGCTT6414279
917QNVGVTK6415CAAAACGTAGGAGTTACGAAA6416 64
918LSNLSNG6417CTTAGTAATCTGTCGAATGGT6418247
919LSTGEEM6419CTTTCGACGGGTGAGGAGATG6420232
920EGGGAQR6421GAGGGGGGTGGGGCTCAGAGG6422 49
921DVRGSVN6423GACGTCCGGGGGTCTGTCAAC6424149
922LNGDTGY6425CTTAATGGTGATACGGGGTAT6426164
923MGDNYDR6427ATGGGCGACAACTACGACCGC6428228
924QLRPLQT6429CAACTGCGTCCTTTGCAAACG6430318
925AGVMNDL6431GCCGGTGTTATGAACGACCTT6432304
926LLENARV6433CTGCTGGAGAATGCGAGGGTG6434243
927LVVDASR6435TTGGTAGTAGACGCAAGTCGC6436 18
928LATHDAR6437CTCGCAACGCACGACGCACGA6438193
929VRQLDSN6439GTAAGACAACTTGACTCTAAC6440 66
930QARDTKT6441CAAGCTCGAGACACCAAAACA6442246
931TGDREQN6443ACTGGTGATCGGGAACAGAAT6444251
932SGAAAAT6445AGCGGGGCCGCAGCCGCCACC6446 17
933GRKGEGP6447GGTAGGAAGGGTGAGGGTCCG6448149
934QNVGVTQ6449CAGAATGTGGGGGTGACTCAG645064
935DSAPAAR6451GATTCGGCTCCGGCGGCTCGG6452  16
936ANQNVII6453GCAAACCAAAACGTAATAATA6454265
937GAHIVSA6455GGGGCGCACATAGTCTCCGCA6456106
938NSDLASP6457AATAGTGATTTGGCGTCTCCT6458242
939AASMVVG6459GCTGCGAGTATGGTTGTTGGG6460269
940VVSEIPL6461GTCGTTAGCGAAATCCCCCTC6462271
941QAESAAR6463CAAGCGGAATCAGCGGCTAGA6464  16
942LSKEHAH6465TTATCGAAAGAACACGCCCAC646657
943TNLADTA6467ACTAATCTGGCTGATACTGCG6468273
944ADREVRY6469GCGGATCGGGAGGTGCGTTAT6470 15
945NISVTPV6471AATATTAGTGTTACGCCGGTT6472276
946PSRGNEG6473CCCAGTCGCGGGAACGAAGGC6474294
947MMLNQGS6475ATGATGCTTAACCAAGGCAGC6476259
948VHSQDVS6477GTGCATTCGCAGGATGTGTCT6478346
949ANAEVQR6479GCGAATGCGGAGGTTCAGCGT6480 15
950LGPGITL6481TTAGGCCCCGGTATCACCCTC6482 95
951NVAELVA6483AACGTCGCAGAATTGGTGGCA6484288
952ILSGLTS6485ATTCTGAGTGGGTTGACTTCT6486 14
953VNVSPTT6487GTGAATGTTAGTCCTACTACT6488 13
954GERDARI6489GGTGAGAGGGATGCTAGGATT6490315
955IGMSAST6491ATAGGTATGAGCGCGTCCACC6492 13
956LSRGEEK6493CTTTCGAGGGGTGAGGAGAAG6494294
957KNKGVDP6495AAAAACAAAGGCGTCGACCCA6496337
958HQDRTTL6497CATCAGGATAGGACGACGCTT6498144
959RISTEGT6499CGCATCAGCACAGAAGGCACT6500295
960ALSGLAK6501GCACTGTCCGGACTCGCAAAA6502 12
961IGASVKL6503ATCGGTGCATCGGTAAAACTG6504 11
962ISLNAAE6505ATTTCGCTGAATGCGGCGGAG6506  9
963SHGSDTK6507TCCCACGGAAGTGACACCAAA6508174
964HGRDALV6509CATGGGCGGGATGCTCTTGTG6510154
965GVAGTYL6511GGGGTGGCTGGGACGTATCTG6512234
966STEGAAL6513AGTACGGAGGGGGCGGCTCTG6514  8
967SVMGVVR6515TCCGTCATGGGAGTAGTTCGT6516  7
968SVVVTAR6517TCGGTCGTCGTAACAGCTCGG6518  6
969PLVGAPV6519CCGCTGGTTGGGGCTCCGGTT6520328
970MGGATNP6521ATGGGGGGGGCTACTAATCCG6522195
971NGPMEAV6523AACGGACCAATGGAAGCAGTC6524333
972QVTDTKT6525CAGGTGACTGATACTAAGACT6526246
973NSKDVQR6527AACTCCAAAGACGTACAAAGA6528 15
974RTTEPRF6529CGTACTACGGAGCCTCGTTTT6530248
975VVGLTAA6531GTTGTCGGCTTAACCGCAGCG6532  5
976PGEHYQV6533CCTGGCGAACACTACCAAGTG6534196
977RAVENMG6535CGCGCAGTAGAAAACATGGGC6536349
978RVMGEEV6537CGTGTGATGGGGGAGGAGGTT6538312
979KYSGAES6539AAATACTCTGGCGCGGAATCT6540348
980MLVTETV6541ATGTTGGTCACTGAAACGGTA6542259
981VFVEKSA6543GTTTTTGTTGAGAAGAGTGCG6544344
982SVNQAVT6545TCTGTGAATCAGGCGGTTACG6546  4
983GHSATAA6547GGACACTCCGCTACCGCCGCA6548235
984HDTSASV6549CATGATACTAGTGCTAGTGTT6550223
985SVDSGLI6551TCCGTAGACTCCGGACTTATC6552220
986VGKVMDV6553GTCGGAAAAGTCATGGACGTC6554206
987AIVSIAK6555GCCATCGTTTCAATAGCAAAA6556  3
988PAEHYQA6557CCGGCTGAGCATTATCAGGCT6558196
989VSLTDGL6559GTGAGTTTGACTGATGGGCTT6560259
990SVTDVNH6561TCTGTTACTGATGTTAATCAT6562261
991GLNEHEA6563GGTCTGAATGAGCATGAGGCG6564331
992ANVGRDD6565GCAAACGTTGGCCGCGACGAC6566218
993AGYSSLT6567GCTGGATACTCGTCACTCACA6568158
994QLQPQQT6569CAACTCCAACCCCAACAAACC6570 50
995MRVTENQ6571ATGAGGGTCACTGAAAACCAA6572237
996ITEQTTI6573ATTACTGAGCAGACTACTATT6574  2
997VVDSDNL6575GTTGTTGATTCGGATAATCTG6576141
998FSTDTSS6577TTTTCGACGGATACGTCGTCT6578138
Macaque all CNS_sequence Rank
SEQ
ID
RankPeptideNO:SequenceSEQ ID NO:
1PSQGTLR6579CCTTCTCAGGGGACGCTTCGG6580
2PTQGTVR6581CCCACACAAGGCACAGTCCGT6582
3TDALTTK6583ACTGATGCGCTTACGACTAAG6584
4TDAGDGK6585ACAGACGCGGGGGACGGCAAA6586
5MTGISIV6587ATGACAGGCATCTCTATCGTA6588
6NGYTEGR6589AATGGGTATACGGAGGGGCGT6590
7PTQGTVR6591CCTACTCAGGGGACGGTTCGG6592
8SLVTSST6593TCGCTTGTTACTTCTAGTACG6594
9PTQGTFR6595CCGACACAAGGAACATTCAGG6596
10PTQGTIR6597CCTACTCAGGGGACGATTCGG6598
11AIVSIAQ6599GCGATTGTGTCGATTGCTCAG6600
12LTSGLAA6601TTGACGTCTGGTTTGGCGGCG6602
13PTQGTFR6603CCTACTCAGGGGACGTTTCGG6604
14STIPTMK6605AGTACTATTCCTACTATGAAG6606
15GTVGSMV6607GGTACGGTGGGTTCTATGGTT6608
16ELMASTI6609GAGCTTATGGCTTCTACTATT6610
17SPVGIIA6611TCTCCTGTGGGTATTATTGCG6612
18TSREEQW6613ACTTCTCGTGAGGAGCAGTGG6614
19RASADVV6615AGGGCGAGTGCGGATGTTGTG6616
20RYNDEST6617AGATACAACGACGAATCCACT6618
21HTIAASM6619CACACCATAGCCGCAAGTATG6620
22HTQGTLR6621CATACTCAGGGGACGCTTCGG6622
23DGRAELR6623GATGGGCGGGCGGAGTTGCGT6624
24AADSSAR6625GCCGCTGACTCATCGGCCCGT6626
25NLGAALS6627AACCTTGGGGCTGCCCTATCG6628
26ALNEHVA6629GCTCTGAATGAGCATGTGGCG6630
27IMVDAHS6631ATTATGGTTGATGCTCATTCG6632
28PAQGTLR6633CCGGCGCAAGGAACACTACGA6634
29SIGDLGK6635AGTATCGGTGACCTAGGTAAA6636
30PSQGTLR6637CCGTCCCAAGGAACACTCAGG6638
31PTHGTLR6639CCGACCCACGGTACACTGCGA6640
32NRELALG6641AACCGCGAACTCGCACTCGGG6642
33AGGGDPR6643GCTGGTGGAGGTGACCCCCGA6644
34PNERPTV6645CCCAACGAACGTCCAACGGTC6646
35AGVSASL6647GCGGGTGTTTCTGCGTCGTTG6648
36VIAGVGI6649GTAATCGCGGGAGTAGGCATC6650
37AIQTNDA6651GCAATCCAAACCAACGACGCG6652
38RTMGDST6653CGGACAATGGGTGACAGTACG6654
39KNQDMQV6655AAGAATCAGGATATGCAGGTG6656
40NGNMATF6657AATGGGAATATGGCGACTTTT6658
41MVTHTNK6659ATGGTAACACACACAAACAAA6660
42SLLLTTP6661TCATTACTATTGACGACACCC6662
43SSVQGIL6663TCCTCAGTCCAAGGAATACTA6664
44RIVDSVP6665AGGATTGTGGATAGTGTTCCG6666
45PTEGTLR6667CCGACAGAAGGCACACTGCGA6668
46YLVTTEN6669TATTTGGTTACTACTGAGAAT6670
47PTQGTLR6671CCTACTCAGGGGACGCTTCGG6672
48LLAGADR6673TTGCTTGCTGGTGCTGATCGT6674
49HSKGFDY6675CACAGTAAAGGTTTCGACTAC6676
50FQVEQVK6677TTTCAGGTTGAGCAGGTTAAG6678
51TGGRDQY6679ACTGGTGGTCGGGATCAGTAT6680
52PTPGTLR6681CCTACTCCGGGGACGCTTCGG6682
53LGDSAEA6683CTTGGGGATTCTGCTGAGGCG6684
54RVDSEQH6685AGGGTGGATTCGGAGCAGCAT6686
55SVDSGML6687AGTGTTGATAGTGGGATGCTT6688
56NTENASR6689AATACTGAGAATGCGTCGCGG6690
57PTLGTLR6691CCTACTCTGGGGACGCTTCGG6692
58RVALDLP6693AGGGTGGCGCTGGATTTGCCG6694
59AGDRDQY6695GCTGGTGATCGGGATCAGTAT6696
60VIALTEA6697GTGATTGCGTTGACGGAGGCT6698
61STIHTLK6699AGTACTATTCATACTCTGAAG6700
62REALALT6701AGGGAGGCGCTGGCTCTGACG6702
63TTSGNLM6703ACGACGTCGGGGAATCTTATG6704
64LTLSNGV6705CTTACGCTGAGTAATGGGGTG6706
65GVVNDER6707GGCGTTGTCAACGACGAACGG6708
66DKVVDEV6709GATAAGGTGGTTGATGAGGTG6710
67MAASVTL6711ATGGCGGCTAGTGTTACGCTT6712
68LTGERIL6713CTAACCGGTGAACGCATACTT6714
69NTVVNDP6715AACACAGTCGTGAACGACCCT6716
70VNAALGI6717GTTAATGCTGCGCTTGGGATT6718
71SRELTGS6719TCGAGGGAGTTGACTGGGTCG6720
72TVDSPMR6721ACCGTCGACAGCCCTATGCGA6722
73ATDSSVR6723GCCACCGACAGCAGTGTCCGT6724
74LESLSHH6725CTGGAGTCGCTTTCTCATCAT6726
75ALGTQGS6727GCTCTGGGTACGCAGGGTTCT6728
76RTTPDVP6729CGTACGACTCCCGACGTACCT6730
77VLNDNLA6731GTGTTAAACGACAACTTAGCT6732
78CNAAGCP6733TGTAATGCTGCGGGGTGTCCG6734
79LHAGESR6735CTTCATGCTGGTGAGTCTAGG6736
80HAGLGVT6737CATGCTGGTCTTGGTGTTACT6738
81KIGENAS6739AAGATTGGTGAGAATGCTTCT6740
82NAHDTET6741AATGCGCATGATACTGAGACT6742
83TGLQDSN6743ACAGGATTGCAAGACTCGAAC6744
84ASNPGRW6745GCGAGTAACCCTGGAAGGTGG6746
85VVGTQDR6747GTTGTGGGGACTCAGGATAGG6748
86VHASSPT6749GTGCATGCTTCTAGTCCGACT6750
87NGLQVSI6751AATGGGCTGCAGGTTAGTATT6752
88TVPNTDL6753ACTGTGCCTAATACTGATTTG6754
89FLLGHTD6755TTCCTTCTGGGGCACACGGAC6756
90REISILS6757CGCGAAATATCGATACTATCT6758
91DSLLPQT6759GATTCTCTGCTTCCTCAGACT6760
92RGGVTTE6761CGTGGAGGCGTAACCACCGAA6762
93HVASAGA6763CATGTTGCTTCGGCGGGGGCG6764
94FGGVINA6765TTCGGGGGAGTAATAAACGCT6766
95NGMGDVT6767AACGGCATGGGGGACGTTACT6768
96TTVEVSG6769ACAACCGTAGAAGTAAGCGGC6770
97VGGNVVH6771GTTGGTGGTAATGTTGTTCAT6772
98PMRPGVA6773CCGATGCGGCCGGGTGTGGCT6774
99RESGEQA6775AGGGAGAGTGGGGAGCAGGCT6776
100LSLTEGV6777CTGAGTTTGACTGAGGGGGTT6778
101PKPSHGE6779CCTAAACCATCTCACGGAGAA6780
102YQRTESL6781TATCAGAGGACGGAGTCTCTG6782
103PGEHYRL6783CCTGGAGAACACTACAGATTG6784
104TSVESNL6785ACGTCGGTGGAGTCGAATCTT6786
105GSGAGLH6787GGAAGTGGAGCTGGCCTTCAC6788
106MAGGTNP6789ATGGCAGGTGGCACAAACCCT6790
107RESLEAL6791AGGGAGAGTCTTGAGGCGTTG6792
108KGYDTPM6793AAAGGCTACGACACACCCATG6794
109DQLNDGR6795GATCAGCTGAATGATGGGCGG6796
110TIGVVAN6797ACGATTGGGGTTGTGGCGAAT6798
111LSTGSQL6799CTGTCTACGGGGTCGCAGCTG6800
112VPGSTTT6801GTTCCAGGCTCAACGACTACC6802
113MGDAGLR6803ATGGGGGATGCGGGGCTGCGG6804
114SPTSSPH6805TCACCTACATCCTCACCACAC6806
115ESLAGVR6807GAATCGTTGGCAGGGGTGCGT6808
116KLAEGVR6809AAACTAGCCGAAGGAGTGCGG6810
117GNSGDHF6811GGTAACTCTGGTGACCACTTC6812
118AVITEPK6813GCGGTGATTACTGAGCCTAAG6814
119APTATLR6815GCGCCTACTGCTACTCTTCGG6816
120STFSTVM6817AGCACATTCTCCACTGTTATG6818
121VLASLGD6819GTACTCGCGTCGTTGGGCGAC6820
122AAGVIPN6821GCCGCCGGAGTGATACCTAAC6822
123PGEPLRL6823CCGGGAGAACCCTTGCGACTC6824
124VTSDAGW6825GTCACCTCTGACGCAGGGTGG6826
125STLISET6827TCCACGTTGATATCAGAAACC6828
126VGGAGEI6829GTTGGTGGGGCGGGTGAGATT6830
127KTAQVQP6831AAGACGGCGCAGGTGCAGCCG6832
128SMNGTSL6833AGTATGAATGGGACTAGTCTT6834
129MVGLMGA6835ATGGTGGGTCTGATGGGGGCT6836
130LNVVDLQ6837TTGAACGTTGTGGACTTGCAA6838
131SVVSGLL6839TCTGTGGTGTCAGGTCTTTTG6840
132MAGGVQV6841ATGGCGGGTGGGGTGCAGGTT6842
133SVTERSG6843TCTGTGACGGAGAGGAGTGGT6844
134PNQGTLR6845CCAAACCAAGGTACTCTACGA6846
135GLNEHEA6847GGTCTGAATGAGCATGAGGCG6848
136RVENGGT6849CGAGTGGAAAACGGCGGGACC6850
137VSLTDGL6851GTGAGTTTGACTGATGGGCTT6852
138SRLENIS6853TCGCGTCTTGAAAACATCTCC6854
139SVVPNVQ6855TCGGTGGTGCCGAATGTGCAG6856
140TSMGIMV6857ACGAGTATGGGTATTATGGTG6858
141TGLGDRA6859ACCGGCTTGGGAGACAGGGCT6860
142VGSVTDS6861GTTGGTAGCGTAACCGACTCC6862
143RTDGADH6863CGCACAGACGGAGCAGACCAC6864
144MSISEPR6865ATGTCTATTAGTGAGCCGCGG6866
145MGGVTNP6867ATGGGAGGTGTCACCAACCCC6868
146PSRGNEG6869CCCAGTCGCGGGAACGAAGGC6870
147SAGGSLQ6871AGTGCTGGTGGGAGTCTTCAG6872
148GPRNSID6873GGCCCACGTAACTCTATCGAC6874
149VLSGEEL6875GTTCTTAGTGGGGAGGAGTTG6876
150TRTEDYT6877ACGCGTACGGAGGATTATACT6878
151VTGHPTL6879GTTACGGGTCATCCGACTCTT6880
152LETVGSP6881CTGGAGACGGTTGGTTCTCCG6882
153VGRDFPA6883GTGGGTCGGGATTTTCCGGCT6884
154STQGGLA6885AGTACCCAAGGCGGATTAGCG6886
155TAVERAW6887ACGGCTGTTGAGCGGGCGTGG6888
156PLVGAPV6889CCGCTGGTTGGGGCTCCGGTT6890
157ITQAAYV6891ATCACACAAGCGGCGTACGTG6892
158LGGDVVA6893TTGGGTGGTGATGTGGTGGCG6894
159NGSSIGV6895AACGGCTCATCTATCGGCGTG6896
160TGSIPSP6897ACCGGTTCAATCCCTTCCCCC6898
161GLEKMTS6899GGTCTGGAGAAGATGACTTCT6900
162QADDHGR6901CAGGCGGATGATCATGGTAGG6902
163TMLAGSI6903ACCATGCTAGCAGGCAGCATC6904
164ASIPTLN6905GCATCCATACCAACGCTAAAC6906
165LHNLTQP6907CTTCATAATCTTACGCAGCCT6908
166LTAISDH6909CTTACGGCGATTAGTGATCAT6910
167VAALGMT6911GTTGCTGCTTTGGGTATGACT6912
168ALGDALR6913GCACTAGGCGACGCATTACGC6914
169GSGNGGS6915GGTAGTGGGAATGGTGGGAGT6916
170SADSSVR6917TCGGCGGATAGTTCTGTGCGG6918
171SVATGVL6919AGCGTGGCTACAGGCGTGCTC6920
172LVTGMSS6921CTTGTTACTGGGATGAGTTCT6922
173MVTSGLT6923ATGGTTACGTCGGGGTTGACG6924
174PREHNQA6925CCGCGTGAGCATAATCAGGCT6926
175TSPGLMV6927ACATCACCCGGCCTGATGGTT6928
176PQHIDPE6929CCTCAGCATATTGATCCTGAG6930
177GGVSATA6931GGAGGAGTCAGCGCAACGGCT6932
178SLVQGTV6933AGTCTTGTGCAGGGGACTGTT6934
179SGEPLGL6935TCTGGGGAGCCGCTTGGGCTT6936
180SQLSVML6937AGCCAACTTTCAGTAATGCTT6938
181TLTDVVH6939ACTCTTACTGATGTGGTGCAT6940
182QDGPAVK6941CAGGATGGGCCTGCGGTGAAG6942
183RGQSDPL6943CGGGGGCAGTCTGATCCGTTG6944
184IMVGTTT6945ATAATGGTAGGTACGACTACG6946
185LGSDESR6947CTGGGGTCGGATGAGAGTCGG6948
186NLGGVQL6949AACTTAGGAGGCGTCCAATTG6950
187SEQNKVW6951TCCGAACAAAACAAAGTATGG6952
188ANSHTNS6953GCAAACAGTCACACCAACTCT6954
189ATVKDSG6955GCAACCGTAAAAGACTCGGGG6956
190NGLSAST6957AATGGGCTGTCTGCTTCTACT6958
191VMASTGP6959GTAATGGCGTCAACAGGACCG6960
192VTTHSPV6961GTTACCACCCACAGTCCAGTT6962
193LSLNDVV6963CTGAGTTTGAATGATGTGGTT6964
194AHTEMSH6965GCCCACACCGAAATGTCTCAC6966
195DVAVSMI6967GATGTTGCTGTTTCTATGATT6968
196FPAGVGQ6969TTTCCGGCTGGTGTTGGGCAG6970
197VTTLSPV6971GTCACGACTTTGAGTCCAGTT6972
198LIGAALD6973CTAATCGGCGCAGCACTCGAC6974
199VLSSDLR6975GTGCTTTCGAGTGATCTTCGT6976
200QHGAEAR6977CAGCATGGGGCGGAGGCGAGG6978
201SHGSDPK6979TCTCATGGTTCTGATCCGAAG6980
202MNGGNVL6981ATGAACGGCGGCAACGTGCTC6982
203RVDSGLL6983AGGGTTGATAGTGGGCTGCTT6984
204IPSTGAQ6985ATTCCGAGTACGGGGGCGCAG6986
205VIAGLGF6987GTAATCGCAGGCTTAGGTTTC6988
206FGVSALS6989TTTGGTGTTAGTGCTCTTTCT6990
207SPAGLLA6991TCGCCGGCGGGGTTGCTTGCG6992
208STIPTPM6993TCCACAATCCCAACCCCCATG6994
209PKPSHGE6995CCGAAGCCTAGTCATGGTGAG6996
210DALSSLR6997GACGCTTTATCCAGCTTGCGA6998
211STDMRSP6999AGTACGGATATGAGGTCGCCG7000
212AVFSSQK7001GCTGTATTCTCCAGTCAAAAA7002
213RSEVNGV7003CGGAGTGAGGTGAATGGGGTT7004
214ATVAGQY7005GCTACCGTGGCAGGCCAATAC7006
215SVVVTAR7007TCGGTCGTCGTAACAGCTCGG7008
216GIDTSQP7009GGCATAGACACATCCCAACCC7010
217VVQVPGR7011GTAGTGCAAGTACCAGGACGC7012
218ITGVYDK7013ATAACTGGCGTTTACGACAAA7014
219VVDSYNL7015GTGGTAGACTCTTACAACTTA7016
220SLGEGRH7017AGTTTAGGCGAAGGGCGTCAC7018
221SIGLPAQ7019AGTATTGGGCTTCCTGCGCAG7020
222ASTVSTV7021GCCTCCACAGTAAGTACGGTC7022
223MEALAVT7023ATGGAAGCTTTGGCGGTAACA7024
224REISNLR7025CGGGAAATAAGCAACCTACGT7026
225VLDTVGN7027GTTCTGGATACGGTTGGTAAT7028
226AGLGSTS7029GCTGGTCTGGGGTCGACTAGT7030
227VEVPSTN7031GTTGAAGTCCCTTCTACGAAC7032
228VDHGGVV7033GTGGATCATGGTGGTGTGGTT7034
229GAHIVSA7035GGGGCGCACATAGTCTCCGCA7036
230TSREELR7037ACAAGTAGGGAAGAATTGCGA7038
231NGSDTTM7039AATGGTTCTGATACTACTATG7040
232KNPGVDT7041AAAAACCCTGGAGTTGACACG7042
233THDKLSV7043ACTCATGATAAGCTTAGTGTT7044
234NADYGGD7045AATGCTGATTATGGGGGTGAT7046
235ILATETS7047ATCCTAGCCACAGAAACCAGC7048
236TTMADPA7049ACAACAATGGCGGACCCCGCC7050
237VTEHTQF7051GTGACTGAGCATACGCAGTTT7052
238LTGISNV7053TTAACCGGCATCTCAAACGTA7054
239PVLAAAN7055CCTGTTCTTGCGGCAGCGAAC7056
240HATVVNS7057CATGCGACTGTTGTTAATTCG7058
241AMDNGAF7059GCTATGGATAATGGTGCTTTT7060
242SVNSIPV7061TCGGTCAACAGTATACCAGTC7062
243NLGVVPL7063AATTTGGGTGTGGTTCCGCTG7064
244QNSNGLL7065CAGAATAGTAATGGGCTTTTG7066
245ESSRLQI7067GAGAGTTCGCGTCTTCAGATT7068
246ERQLDSH7069GAGAGGCAGCTGGATTCGCAT7070
247ATTVSPV7071GCAACCACTGTGAGCCCCGTA7072
248TPPPNGR7073ACGCCTCCTCCTAATGGTAGG7074
249QDGPAVK7075CAAGACGGCCCGGCAGTTAAA7076
250VGVNGSH7077GTGGGTGTGAATGGTTCTCAT7078
251TVPNTVL7079ACAGTACCCAACACAGTCCTT7080
252HGVSIEL7081CATGGTGTTTCGATTGAGCTG7082
253SSQGTTK7083TCTTCGCAGGGTACGACTAAG7084
254TVGHDNK7085ACCGTAGGACACGACAACAAA7086
255QGGHSGG7087CAGGGTGGTCATAGTGGGGGT7088
256LTDGTVV7089CTTACTGATGGGACTGTTGTT7090
257VGASTAW7091GTGGGCGCCAGCACCGCGTGG7092
258VSRGEEM7093GTAAGCCGCGGCGAAGAAATG7094
259SIYDNDT7095TCCATCTACGACAACGACACC7096
260RETVDST7097CGTGAGACTGTGGATAGTACT7098
261STEGAAL7099AGTACGGAGGGGGCGGCTCTG7100
262ASREVIY7101GCATCGAGAGAAGTCATCTAC7102
263TEALAVK7103ACAGAAGCACTTGCGGTAAAA7104
264PTPGTLR7105CCGACACCAGGAACTTTAAGA7106
265GSGGVSV7107GGTTCGGGTGGTGTTAGTGTG7108
266PTQGVSM7109CCAACCCAAGGAGTTTCGATG7110
267VRQLDSN7111GTAAGACAACTTGACTCTAAC7112
268MGVLTTV7113ATGGGGGTGTTGACTACGGTG7114
269SLSDGSL7115TCTCTGTCTGATGGTTCTCTT7116
270HSEGVGR7117CACTCGGAAGGAGTCGGACGC7118
271WNLDMNN7119TGGAACCTAGACATGAACAAC7120
272LGHKAGD7121TTGGGGCATAAGGCTGGTGAT7122
273NLGVVNL7123AACTTAGGCGTCGTCAACCTT7124
274SGGRITD7125AGCGGAGGGCGCATCACCGAC7126
275TVITGAP7127ACTGTGATCACTGGCGCCCCC7128
276SQLAELV7129AGTCAGTTGGCGGAGCTGGTT7130
277SVTDVRH7131TCTGTTACTGATGTTAGGCAT7132
278DVAGSMR7133GATGTTGCTGGTTCTATGCGT7134
279VVQAPGR7135GTTGTTCAGGCTCCTGGGCGT7136
280NLGAALS7137AATCTGGGTGCGGCGCTTTCT7138
281KYSGAES7139AAATACTCTGGCGCGGAATCT7140
282VSATLGQ7141GTATCAGCCACACTAGGCCAA7142
283AGVSELL7143GCGGGTGTTTCTGAGTTGTTG7144
284PLLGNTI7145CCGCTTTTGGGGAATACGATT7146
285TAGLSHP7147ACCGCAGGATTGTCACACCCT7148
286RSNSAEW7149AGATCGAACTCCGCGGAATGG7150
287DTGVGTR7151GATACTGGGGTTGGTACGCGT7152
288PLILSPS7153CCCTTGATCTTATCTCCAAGT7154
289SVNQAVT7155TCTGTGAATCAGGCGGTTACG7156
290AHGERLS7157GCTCACGGAGAAAGACTTAGC7158
291TLASSER7159ACTTTGGCGAGTTCTGAGCGG7160
292VHDSTPL7161GTGCATGATTCGACTCCGTTG7162
293HAAGASS7163CATGCGGCGGGTGCTAGTAGT7164
294GNGTGVL7165GGAAACGGCACCGGGGTCCTA7166
295VLTSPGP7167GTGCTCACAAGCCCGGGACCG7168
296LSTGAQM7169TTATCAACCGGAGCTCAAATG7170
297MIASGLS7171ATGATTGCGTCGGGTTTGTCG7172
298RYDVEST7173CGGTATGATGTTGAGTCTACG7174
299VLVGTSL7175GTTCTTGTTGGGACGAGTTTG7176
300LNTTESK7177CTTAACACCACCGAAAGCAAA7178
301SNIPTLM7179TCGAACATCCCGACATTAATG7180
302VLGGPAV7181GTGCTGGGTGGTCCTGCGGTG7182
303VIGGLGI7183GTTATTGGTGGGCTTGGGATT7184
304DLASAGH7185GATCTGGCGAGTGCTGGGCAT7186
305SVTDIKH7187TCGGTGACGGACATAAAACAC7188
306TNHQEPN7189ACGAATCATCAGGAGCCTAAT7190
307VLNEHVA7191GTCCTTAACGAACACGTAGCT7192
308HNTGMDM7193CATAATACGGGGATGGATATG7194
309GRSQLPM7195GGCCGATCACAACTTCCAATG7196
310DSGKDRT7197GATTCTGGTAAGGATCGTACG7198
311LLAGISI7199TTGCTTGCTGGGATTAGTATT7200
312TDVVLHK7201ACTGACGTCGTATTACACAAA7202
313VIETRLS7203GTCATCGAAACTCGCCTTTCC7204
314FGAELHK7205TTTGGGGCTGAGTTGCATAAG7206
315SHGTDSK7207AGTCACGGCACGGACTCTAAA7208
316AVDSSVR7209GCTGTTGACTCCAGCGTTAGA7210
317PNQGTLR7211CCTAATCAGGGGACGCTTCGG7212
318VVSVTAS7213GTGGTGTCGGTTACGGCTAGT7214
319VNVSYGD7215GTGAATGTTTCGTATGGTGAT7216
320VTTVYPV7217GTTACGACAGTATACCCGGTA7218
321QYVVSGA7219CAGTATGTTGTTAGTGGTGCG7220
322VLSGEVL7221GTCTTGTCTGGAGAAGTCCTT7222
323AAGVILN7223GCGGCGGGTGTTATTCTGAAT7224
324SMTSESS7225TCAATGACTTCGGAATCGTCT7226
325ILVDTHA7227ATTCTGGTTGATACTCATGCG7228
326YVTFGEN7229TACGTAACCTTCGGTGAAAAC7230
327NSDLMGR7231AACAGTGACCTAATGGGCCGA7232
328IVDYQGK7233ATCGTAGACTACCAAGGCAAA7234
329PHQGSES7235CCTCATCAGGGTAGTGAGAGT7236
330LSRGEEK7237CTTTCGAGGGGTGAGGAGAAG7238
331LSRDVAV7239TTGTCGAGGGATGTGGCGGTT7240
332PTQGTLR7241CCAACGCAAGGTACCTTGCGA7242
333REQQKYW7243CGGGAACAACAAAAATACTGG7244
334EQSMGSP7245GAGCAGTCTATGGGTTCTCCG7246
335KGSETPM7247AAAGGGTCAGAAACACCGATG7248
336KEYITAV7249AAAGAATACATAACAGCGGTA7250
337HGTLESQ7251CACGGCACCCTCGAATCGCAA7252
338ESLAGVR7253GAGAGTCTTGCTGGTGTTAGG7254
339NDRNTSS7255AATGATAGGAATACGTCTTCG7256
340AAVSALL7257GCCGCAGTATCCGCACTATTA7258
341LRVTENP7259CTTCGGGTCACCGAAAACCCC7260
342IAILAAS7261ATTGCGATTCTTGCTGCTTCG7262
343MLTGIAT7263ATGTTGACGGGGATTGCTACT7264
344STIPALM7265AGTACTATTCCTGCTCTGATG7266
345AAREVIN7267GCGGCTCGGGAGGTGATTAAT7268
346IVMAEVH7269ATCGTAATGGCCGAAGTACAC7270
347LVVDASR7271TTGGTAGTAGACGCAAGTCGC7272
348HQHMVEG7273CACCAACACATGGTTGAAGGA7274
349ENSGGHF7275GAGAATAGTGGGGGTCATTTT7276
350YSMTVTT7277TATAGTATGACGGTTACGACT7278
351SHASDSK7279TCGCACGCATCAGACTCTAAA7280
352AVSDYTV7281GCCGTGAGCGACTACACAGTC7282
353STIPTLL7283AGTACTATTCCTACTCTGTTG7284
354SPSAFPK7285TCCCCTTCAGCATTCCCAAAA7286
355NFGEVQL7287AATTTTGGTGAGGTTCAGCTG7288
356GIETRGL7289GGAATCGAAACACGCGGTCTC7290
357NVNQDSL7291AACGTAAACCAAGACTCACTC7292
358AGLLTKV7293GCAGGACTCCTTACAAAAGTA7294
359LSIRQGP7295TTGAGTATTCGTCAGGGTCCT7296
360PALQGNF7297CCGGCTCTTCAGGGTAATTTT7298
361LDSGIPR7299CTCGACTCTGGTATCCCCAGA7300
362SYSDGSS7301TCATACTCGGACGGCAGCAGC7302
363FQDTIGV7303TTTCAGGATACGATTGGGGTG7304
364VLLGIDR7305GTTTTGCTAGGAATCGACCGT7306
365SHGYDSK7307TCTCATGGTTATGATTCGAAG7308
366SVDTGLL7309AGCGTCGACACGGGCCTCTTA7310
367EGGGAQR7311GAGGGGGGTGGGGCTCAGAGG7312
368AALSQEF7313GCGGCCCTGTCTCAAGAATTC7314
369NSISLIN7315AACTCTATCAGCCTCATAAAC7316
370LSRGEEM7317CTTTCGAGGGGTGAGGAGATG7318
371IGMSAST7319ATAGGTATGAGCGCGTCCACC7320
372MGSDTTM7321ATGGGTTCTGATACTACTATG7322
373SLVLTSH7323TCATTAGTCCTTACGAGCCAC7324
374LGGDAVA7325CTAGGAGGAGACGCAGTTGCA7326
375PIQGTLR7327CCTATTCAGGGGACGCTTCGG7328
376RVELALT7329AGAGTCGAACTTGCCTTAACA7330
377VDHGGVH7331GTTGACCACGGAGGGGTCCAC7332
378IASDIGR7333ATTGCTTCGGATATTGGTCGG7334
379PNERLAV7335CCTAACGAACGATTGGCAGTC7336
380DLSTFPV7337GACCTCTCGACATTCCCTGTA7338
381DSSKAEW7339GATAGTAGTAAGGCTGAGTGG7340
382GAFAPAT7341GGCGCATTCGCACCAGCAACA7342
383TMSLSLR7343ACTATGTCTCTGTCGTTGCGT7344
384VDDIKSW7345GTTGACGACATAAAATCCTGG7346
385TTLADHA7347ACTACTCTGGCTGATCATGCG7348
386KSDVEYL7349AAATCAGACGTCGAATACCTA7350
387MNGGYVL7351ATGAACGGCGGATACGTACTT7352
388MAVDVTK7353ATGGCAGTCGACGTAACCAAA7354
389TDALTSK7355ACAGACGCACTCACCAGTAAA7356
390AAGGILN7357GCGGCGGGTGGTATTCTGAAT7358
391LSEGRAY7359CTTAGTGAGGGTCGTGCGTAT7360
392SVSHVVV7361TCGGTCTCTCACGTCGTCGTA7362
393FISGALT7363TTCATATCCGGCGCCTTAACT7364
394VTGLTVQ7365GTTACCGGGCTGACAGTACAA7366
395HSASLIE7367CACTCAGCATCCCTCATAGAA7368
396ITQAVYI7369ATCACACAAGCGGTATACATC7370
397ASMSAEH7371GCTAGTATGTCTGCGGAGCAT7372
398SVTDVNH7373TCTGTTACTGATGTTAATCAT7374
399LGTSDVR7375TTGGGGACGAGTGATGTGCGT7376
400QSNHAPV7377CAATCAAACCACGCCCCGGTC7378
401SMAVTAK7379AGTATGGCGGTGACGGCGAAG7380
402RMTGDLT7381CGTATGACTGGAGACCTAACC7382
403SHGSDPK7383AGCCACGGGTCAGACCCTAAA7384
404GLGDSGE7385GGGTTGGGGGATTCGGGTGAG7386
405SVTLLGV7387AGTGTGACTCTGTTGGGTGTG7388
406PNDGPSK7389CCTAATGATGGGCCTAGTAAG7390
407FDSAPRY7391TTTGATTCTGCGCCGCGGTAT7392
408VVDAYNL7393GTCGTAGACGCTTACAACTTA7394
409NEAVNVR7395AATGAGGCTGTTAATGTTCGG7396
410TLALSER7397ACCTTAGCCTTATCAGAACGA7398
411LRDSAEP7399CTCCGAGACTCAGCGGAACCA7400
412PDNNPRN7401CCTGATAATAATCCGCGGAAT7402
413YHASDSK7403TATCATGCTTCTGATTCGAAG7404
414KGYDTNM7405AAAGGCTACGACACAAACATG7406
415HTTGAEM7407CACACTACTGGGGCCGAAATG7408
416GLNDNVA7409GGTCTGAATGATAATGTGGCG7410
417PQLIVPK7411CCTCAGCTTATTGTTCCTAAG7412
418IIVDNGS7413ATAATAGTCGACAACGGATCA7414
419QNESGMK7415CAAAACGAAAGCGGGATGAAA7416
420NQLGELV7417AACCAACTCGGCGAACTAGTG7418
421RDLTSDM7419AGAGACTTGACTTCGGACATG7420
422GVSVLNV7421GGGGTGAGTGTGCTGAATGTT7422
423AADSSGR7423GCGGCGGATAGTTCTGGGCGG7424
424ALNEHEA7425GCTCTGAATGAGCATGAGGCG7426
425LRVTENQ7427TTGCGTGTGACGGAGAATCAG7428
426MTVPGSP7429ATGACGGTTCCGGGTAGTCCG7430
427TLAITER7431ACTTTGGCGATTACTGAGCGG7432
428KNPGVDT7433AAGAATCCGGGGGTGGATACT7434
429RVALDET7435AGGGTGGCGCTGGATGAGACG7436
430MNVGHVL7437ATGAATGTGGGTCATGTTCTG7438
431LRVTENK7439TTGCGTGTGACGGAGAATAAG7440
432TLGMSTR7441ACGTTGGGGATGTCTACTCGT7442
433REHSAQL7443AGGGAGCATTCGGCGCAGCTT7444
434HGNLVSQ7445CATGGGAATTTGGTGTCTCAG7446
435VQGPQTG7447GTGCAGGGTCCGCAGACTGGT7448
436VTTLTPV7449GTGACTACGCTTACTCCTGTG7450
437DSHVSGM7451GATAGTCATGTGTCGGGGATG7452
438LVTPMHM7453CTCGTAACTCCCATGCACATG7454
439RVDSEKL7455AGGGTGGATTCGGAGAAGCTT7456
440RPEIEVR7457CGGCCGGAGATTGAGGTTAGG7458
441QDGPAEK7459CAGGATGGGCCTGCGGAGAAG7460
442MVTPTNT7461ATGGTTACTCCTACGAATACG7462
443LSKGSQL7463CTGTCTAAGGGGTCGCAGCTG7464
444VIVLTEA7465GTGATTGTGTTGACGGAGGCT7466
445QSLTDGV7467CAGAGTTTGACTGATGGGGTT7468
446MNGAHVL7469ATGAATGGGGCTCATGTTCTG7470
447RVALDLT7471AGGGTGGCGCTGGATTTGACG7472
448SQSAFPN7473AGTCAGTCGGCTTTTCCTAAT7474
449LTRGEEK7475CTTACGAGGGGTGAGGAGAAG7476
450MGASDTL7477ATGGGGGCTAGTGATACGCTT7478
451NQLAELV7479AATCAGTTGGCGGAGCTGGTT7480
452LGDSADQ7481CTTGGGGATTCTGCTGATCAG7482
453GVSVLND7483GGGGTGAGTGTGCTGAATGAT7484
454TTAAIVK7485ACGACGGCGGCTATTGTTAAG7486
455MGASDTH7487ATGGGGGCTAGTGATACGCAT7488
456DLNEHVA7489GATCTGAATGAGCATGTGGCG7490
457MNGGHAL7491ATGAATGGGGGTCATGCTCTG7492
458SNGLPAQ7493AGTAATGGGCTTCCTGCGCAG7494
459TTTGNLM7495ACGACGACGGGGAATCTTATG7496
460LAGSTGP7497TTGGCGGGGTCGACGGGTCCG7498
461AVKEYEL7499GCCGTTAAAGAATACGAACTC7500
462VIAGHGN7501GTTATTGCTGGGCATGGGAAT7502
463LGDSAET7503CTTGGGGATTCTGCTGAGACG7504
464MVTPTNK7505ATGGTTACTCCTACGAATAAG7506
465AIVSIAR7507GCGATTGTGTCGATTGCTCGG7508
466SPTSSPT7509TCTCCGACGAGTTCGCCGACT7510
467ESRNDVV7511GAGTCGAGGAATGATGTTGTT7512
468TGSSAML7513ACGGGGAGTTCGGCGATGCTT7514
469TVNSIPV7515ACGGTCAACAGTATACCAGTC7516
470ITENASR7517ATTACTGAGAATGCGTCGCGG7518
471PILGAST7519CCGATTCTTGGTGCTAGTACG7520
472GGKGEGP7521GGTGGGAAGGGTGAGGGTCCG7522
473IVMDENH7523ATCGTAATGGACGAAAACCAC7524
474MGASVTL7525ATGGGGGCTAGTGTTACGCTT7526
475AVKEYEA7527GCGGTGAAGGAGTATGAGGCG7528
476TVGLSIA7529ACTGTGGGTTTGTCGATTGCG7530
477QVIDTKT7531CAGGTGATTGATACTAAGACT7532
478DVVLLTR7533GATGTTGTTTTGTTGACTAGG7534
479DVRGSDI7535GACGTACGGGGGTCTGACATC7536
480FAEVAQA7537TTTGCGGAGGTTGCGCAGGCG7538
481TSLLPQT7539ACGTCTCTGCTTCCTCAGACT7540
482AADSSAR7541GCGGCGGATAGTTCTGCGCGG7542
483LGDSAES7543CTTGGGGATTCTGCTGAGTCG7544
484SVDSGLL7545AGTGTTGATAGTGGGCTGCTT7546
485GRDLTPA7547GGTCGGGATCTTACGCCTGCT7548
486PAREVLY7549CCGGCTCGGGAGGTGCTTTAT7550
487GSDIKHE7551GGTTCTGATATTAAGCATGAG7552
488LAGSPGP7553TTGGCGGGGTCGCCGGGTCCG7554
489DVTVSMR7555GATGTTACTGTTTCTATGCGT7556
490RVDSGQL7557AGGGTGGATTCGGGGCAGCTT7558
491MNGGNVM7559ATGAATGGGGGTAATGTTATG7560
492PQLIVPA7561CCTCAGCTTATTGTTCCTGCG7562
493VTTHTPV7563GTGACTACGCATACTCCTGTG7564
494MQITGLH7565ATGCAGATTACTGGTCTTCAT7566
495AAREELN7567GCGGCTCGGGAGGAGCTTAAT7568
496AAREVLM7569GCGGCTCGGGAGGTGCTTATG7570
497PGGHYQA7571CCGGGTGGGCATTATCAGGCT7572
498EVAGTYS7573GAGGTGGCTGGGACGTATTCT7574
499VVDSNNL7575GTTGTTGATTCGAATAATCTG7576
500VRQLDSL7577GTGAGGCAGCTGGATTCGCTG7578
501QVTDTKT7579CAGGTGACTGATACTAAGACT7580
502VNDGLGI7581GTTAATGATGGGCTTGGGATT7582
503TTSANLM7583ACGACGTCGGCGAATCTTATG7584
504GSHVSGD7585GGTAGTCATGTGTCGGGGGAT7586
505GRSQLPM7587GGGCGGTCGCAGTTGCCGATG7588
506TVLAASH7589ACGGTGTTGGCTGCGTCTCAT7590
507SPSAFPN7591AGTCCGTCGGCTTTTCCTAAT7592
508QSMTDGV7593CAGAGTATGACTGATGGGGTT7594
509QSLSKDK7595CAGAGTCTTAGTAAGGATAAG7596
510REALSVT7597AGGGAGGCGCTGTCTGTGACG7598
511VIAGHGI7599GTTATTGCTGGGCATGGGATT7600
512MNGGHVI7601ATGAATGGGGGTCATGTTATT7602
513NQLAEQV7603AATCAGTTGGCGGAGCAGGTT7604
514PAREVHY7605CCGGCTCGGGAGGTGCATTAT7606
515VLASLGP7607GTTCTGGCTTCGCTTGGTCCT7608
516PARELHY7609CCGGCTCGGGAGCTGCATTAT7610
517MSITEPR7611ATGTCTATTACTGAGCCGCGG7612
518AAGVIPN7613GCGGCGGGTGTTATTCCGAAT7614
519VTRGTGN7615GTTACTCGTGGTACGGGTAAT7616
520GVGVLNV7617GGGGTGGGTGTGCTGAATGTT7618
521QSPGSQL7619CAGTCTCCGGGGTCGCAGCTG7620
522RPEIAGR7621CGGCCGGAGATTGCGGGTAGG7622
523PILGASS7623CCGATTCTTGGTGCTAGTTCG7624
524RVDSEQL7625AGGGTGGATTCGGAGCAGCTT7626
525TTYDTLV7627ACGACGTATGATACGTTGGTT7628
526VFVEKSA7629GTTTTTGTTGAGAAGAGTGCG7630
527VGSLTAS7631GTGGGGTCGCTTACGGCTAGT7632
528QNMGVTQ7633CAGAATATGGGGGTGACTCAG7634
529GVSVPNV7635GGGGTGAGTGTGCCGAATGTT7636
530GSRENAR7637GGGAGTAGGGAGAATGCGCGT7638
531PGEHYEA7639CCGGGTGAGCATTATGAGGCT7640
532AVGVILN7641GCGGTGGGTGTTATTCTGAAT7642
533TLAINER7643ACTTTGGCGATTAATGAGCGG7644
534MNGGHVL7645ATGAATGGGGGTCATGTTCTG7646
535LGGVSSE7647CTGGGGGGTGTGTCGTCTGAG7648
536TVGLTIA7649ACTGTGGGTTTGACGATTGCG7650
537WNGRETT7651TGGAATGGTCGGGAGACTACT7652
538RHVHVEG7653CGCCACGTACACGTCGAAGGC7654
539DSRVSGD7655GATAGTCGTGTGTCGGGGGAT7656
540RPEIAVR7657CGGCCGGAGATTGCGGTTAGG7658
541DVSVSMR7659GATGTTTCTGTTTCTATGCGT7660
542IVTPTNT7661ATTGTTACTCCTACGAATACG7662
543SPTSSPP7663TCTCCGACGAGTTCGCCGCCT7664
544NHGTDSK7665AACCACGGAACAGACTCTAAA7666
545KNPGVDS7667AAGAATCCGGGGGTGGATTCT7668
546PDRHGGL7669CCTGATCGGCATGGTGGGCTG7670
547VVDSDNL7671GTTGTTGATTCGGATAATCTG7672
548NQLGELV7673AATCAGTTGGGGGAGCTGGTT7674
549NQLAEPV7675AATCAGTTGGCGGAGCCGGTT7676
550WIGRETT7677TGGATTGGTCGGGAGACTACT7678
551SGAPLRL7679TCTGGGGCGCCGCTTAGGCTT7680
552VLLGINM7681GTGCTTTTGGGTATTAATATG7682
553SHGYDSK7683TCGCACGGCTACGACTCTAAA7684
554PGAHYQA7685CCGGGTGCGCATTATCAGGCT7686
555ASESSPP7687GCATCAGAATCATCACCACCC7688
556QNVGVTQ7689CAGAATGTGGGGGTGACTCAG7690
557GEQQKVW7691GGTGAGCAGCAGAAGGTTTGG7692
558AQAQTGW7693GCTCAAGCACAGACCGGCTGG7694
559STLHTTT7695AGTACTCTTCATACTACGACT7696
560AVLSQNL7697GCTGTGTTGTCTCAGAATCTT7698
561GAVSSTK7699GGTGCTGTTTCTTCGACTAAG7700
562PTQETLR7701CCTACTCAGGAGACGCTTCGG7702
563QYVVSGV7703CAGTATGTTGTTAGTGGTGTG7704
564LAGLGGP7705CTTGCGGGTTTGGGGGGGCCT7706
565QTMKDFY7707CAGACGATGAAGGATTTTTAT7708
566VGSVMAS7709GTGGGGTCGGTTATGGCTAGT7710
567AHIGTLT7711GCACACATCGGAACTCTCACC7712
568MVTPTIT7713ATGGTTACTCCTACGATTACG7714
569MLTPTNT7715ATGCTTACTCCTACGAATACG7716
570TGDRDQN7717ACTGGTGATCGGGATCAGAAT7718
571GGVSSTN7719GGTGGTGTTTCTTCGACTAAT7720
572LSNHGPI7721CTGAGTAATCATGGGCCTATT7722
573ALTNGQR7723GCACTAACCAACGGTCAACGT7724
574NQLSELV7725AATCAGTTGTCGGAGCTGGTT7726
575GALTSTK7727GGTGCTCTTACTTCGACTAAG7728
576VGSVTAS7729GTGGGGTCGGTTACGGCTAGT7730
577VLASHGT7731GTTCTGGCTTCGCATGGTACT7732
578AVKEYET7733GCGGTGAAGGAGTATGAGACG7734
579RGGVSTE7735CGGGGGGGTGTGTCGACTGAG7736
580SGGKEEM7737AGTGGGGGTAAGGAGGAGATG7738
581HGTLVSQ7739CATGGGACTTTGGTGTCTCAG7740
582LMNDLLS7741CTTATGAACGACTTACTCTCC7742
583DAPRDGA7743GACGCACCCCGCGACGGGGCT7744
584RTTEPRF7745CGTACTACGGAGCCTCGTTTT7746
585TLPELNL7747ACGTTGCCGGAGTTGAATCTT7748
586LTKSTEW7749CTCACCAAATCCACAGAATGG7750
587QVPDNKT7751CAGGTGCCTGATAATAAGACT7752
588QGGDSGG7753CAGGGTGGTGATAGTGGGGGT7754
589LSTGEEM7755CTTTCGACGGGTGAGGAGATG7756
590PEPRSSY7757CCTGAGCCGCGTAGTAGTTAT7758
591LISTTLR7759TTGATTTCTACTACGCTGCGT7760
592RVTPTNT7761CGCGTGACGCCAACTAACACT7762
593HKDRTTL7763CATAAGGATAGGACGACGCTT7764
594STEYAML7765TCTACTGAGTATGCGATGTTG7766
595NLGAELS7767AACTTGGGGGCAGAACTATCG7768
596QNGLQLL7769CAGAATGGGTTGCAGCTTTTG7770
597LISGTLR7771TTGATTTCTGGTACGCTGCGT7772
598ANQNVII7773GCAAACCAAAACGTAATAATA7774
599SPPPNAR7775AGCCCGCCGCCGAACGCGCGT7776
600SSADYQV7777AGTTCTGCGGATTATCAGGTT7778
601KQVSMES7779AAGCAGGTGTCGATGGAGTCG7780
602RVALDVT7781AGGGTGGCGCTGGATGTGACG7782
603LNMGPLH7783CTGAATATGGGTCCTTTGCAT7784
604IPRIHSL7785ATTCCTCGGATTCATTCTCTT7786
605IGSSLSP7787ATTGGGTCGTCGCTTAGTCCT7788
606LEKDPMT7789TTGGAAAAAGACCCTATGACT7790
607MVTNTNT7791ATGGTTACTAATACGAATACG7792
608RIASNLA7793AGGATTGCTTCTAATCTGGCG7794
609VVAGTNS7795GTCGTTGCAGGTACAAACTCG7796
610GVAATNS7797GGGGTGGCTGCGACGAATTCT7798
611KGSVTPM7799AAGGGTTCTGTTACTCCTATG7800
612HDTSASV7801CATGATACTAGTGCTAGTGTT7802
613SLAITER7803AGTTTGGCGATTACTGAGCGG7804
614HGRDALV7805CATGGGCGGGATGCTCTTGTG7806
615DIAGLGI7807GATATTGCCGGGCTTGGGATT7808
616ANQLAPV7809GCCAACCAATTGGCCCCCGTG7810
617NGASLAS7811AACGGAGCTTCCCTCGCAAGC7812
618*KMSAYV7813TGAAAGATGTCCGCTTATGTG7814
619GVAGRIL7815GGGGTGGCTGGGCGTATTCTG7816
620TLAISGR7817ACTTTGGCGATTTCTGGGCGG7818
621NLHTAEA7819AACCTCCACACTGCTGAAGCG7820
622SIAVGLS7821AGTATTGCGGTGGGTTTGTCG7822
623TGQQVSI7823ACTGGGCAGCAGGTTAGTATT7824
624LPRLGGL7825CTTCCGCGTTTGGGGGGGCTT7826
625DTASTQS7827GACACAGCATCTACTCAATCC7828
626PQLIVPV7829CCTCAGCTTATTGTTCCTGTG7830
627GLNDHVA7831GGTCTGAATGATCATGTGGCG7832
628RDEAYRA7833AGGGATGAGGCTTATCGTGCG7834
629RISPEGT7835CGTATATCACCGGAAGGCACT7836
630HSEGVGR7837CATAGTGAGGGTGTTGGGCGG7838
631KGSDNTM7839AAGGGTTCTGATAATACTATG7840
632LPNGGGF7841CTGCCGAATGGGGGGGGGTTT7842
633AVTNPLM7843GCGGTTACTAATCCTTTGATG7844
634VTVAGSV7845GTTACGGTGGCTGGTTCGGTG7846
635RDDQGIP7847CGGGATGATCAGGGGATTCCG7848
636HTLSTGV7849CACACCCTAAGCACGGGAGTA7850
637SGGTRGP7851TCTGGTGGGACTCGTGGTCCT7852
638RHIADAS7853AGACACATAGCGGACGCGTCG7854
639SGISFLA7855AGCGGAATCAGCTTCTTGGCT7856
640SALTQGY7857TCGGCGCTAACCCAAGGATAC7858
641LNGAPLL7859CTGAATGGTGCGCCGTTGCTG7860
642STVGINV7861AGTACGGTCGGGATCAACGTT7862
643QEQGTTT7863CAGGAGCAGGGTACGACTACT7864
644MIGGHVQ7865ATGATTGGGGGTCATGTTCAG7866
645RVLTSDV7867CGTGTTCTGACGTCTGATGTG7868
646GFGLTED7869GGGTTTGGGTTGACGGAGGAT7870
647DAQSRLA7871GATGCTCAGTCGCGGTTGGCG7872
648RESANAD7873CGTGAGTCTGCGAATGCTGAT7874
649LLHGIIA7875CTTTTACACGGAATAATCGCC7876
650GMGASSK7877GGTATGGGGGCGTCTTCTAAG7878
651RNEGINQ7879CGTAATGAGGGTATTAATCAG7880
652PGVAMVT7881CCCGGGGTCGCTATGGTAACT7882
653ASQLTQT7883GCGTCTCAGCTTACTCAGACT7884
654ALGDQAR7885GCGTTAGGGGACCAAGCGCGT7886
655LTDVTQM7887TTAACCGACGTCACACAAATG7888
656DVAISMR7889GACGTAGCGATATCCATGCGA7890
657YGSNVLS7891TACGGTTCTAACGTCCTCTCA7892
658VYHGGVD7893GTGTATCATGGTGGTGTGGAT7894
659SFDTYGA7895TCCTTCGACACTTACGGGGCC7896
660PTTNPLL7897CCGACTACTAATCCGCTTCTG7898
661RVAMSVT7899AGGGTGGCGATGTCTGTGACG7900
662HIVLSHA7901CATATTGTGCTGAGTCATGCT7902
663TLQELQL7903ACGTTGCAGGAGTTGCAGCTT7904
664DPSLGSP7905GATCCGTCTCTGGGTTCTCCG7906
665LAGSVVV7907CTGGCGGGTTCGGTTGTTGTG7908
666ILVDAYA7909ATACTAGTAGACGCGTACGCT7910
667GVANVSP7911GGAGTTGCTAACGTCAGCCCA7912
668RMTLTGD7913CGTATGACTTTGACTGGTGAT7914
669SGGVESK7915TCTGGTGGTGTTGAGTCGAAG7916
670QIHDTAL7917CAAATCCACGACACAGCGCTC7918
671FQVEQIM7919TTTCAGGTTGAGCAGATTATG7920
672GLVQMSS7921GGTCTGGTGCAGATGTCTTCT7922
673FPSMSGK7923TTCCCAAGCATGTCGGGGAAA7924
674VSNGHFV7925GTTAGTAATGGGCATTTTGTT7926
675STVGSSP7927AGTACGGTGGGGTCGTCGCCG7928
676YLVTADN7929TATTTGGTTACTGCTGATAAT7930
677TTRADPA7931ACTACTCGGGCTGATCCTGCG7932
678PLVPQGG7933CCCTTAGTACCTCAAGGCGGT7934
679GARMVMT7935GGTGCGCGGATGGTTATGACT7936
680MKTQIEL7937ATGAAAACGCAAATAGAACTC7938
681VNHGGVD7939GTAAACCACGGAGGAGTTGAC7940
682MVQSGLT7941ATGGTTCAGTCGGGGTTGACG7942
683QYAVSGG7943CAATACGCAGTGAGCGGCGGT7944
684MTSGNLM7945ATGACCTCTGGCAACCTCATG7946
685TTLAHPA7947ACTACTCTGGCTCATCCTGCG7948
686REQQKAW7949CGAGAACAACAAAAAGCCTGG7950
687VTTLSPV7951GTGACTACGCTTTCTCCTGTG7952
688LQDRTTL7953CTCCAAGACCGCACTACTCTC7954
689SLGALVA7955TCGCTGGGTGCTCTGGTTGCT7956
690GADDAAL7957GGAGCCGACGACGCAGCCCTC7958
691IGPRREV7959ATAGGACCTCGCCGTGAAGTA7960
692QSQTAVA7961CAGTCTCAGACGGCTGTTGCT7962
693VDFGDHT7963GTAGACTTCGGCGACCACACC7964
694SLRDTHY7965AGTCTTCGGGATACTCATTAT7966
695YEHSGLL7967TATGAGCATTCTGGTCTTTTG7968
696VTELTRF7969GTGACTGAGCTTACGCGGTTT7970
697LTHLRVS7971CTGACTCACCTTCGTGTCAGC7972
698QRSDSVM7973CAGCGGTCGGATAGTGTGATG7974
699LSKEHAP7975TTGAGTAAGGAGCATGCTCCT7976
700PDGAAPM7977CCTGATGGTGCGGCTCCTATG7978
701LTTPIEL7979CTAACTACCCCTATAGAACTC7980
702AAVVPRY7981GCAGCAGTAGTACCACGATAC7982
703VVLSLAT7983GTTGTCTTAAGTCTAGCCACT7984
704QDAHVAI7985CAGGATGCGCATGTGGCTATT7986
705LGHANGL7987TTAGGGCACGCAAACGGACTT7988
706SPQGVLA7989TCGCCGCAGGGGGTTCTTGCT7990
707LSLTMPA7991CTCTCGCTTACAATGCCTGCC7992
708YVGSPLV7993TATGTTGGTTCTCCGTTGGTG7994
709QILGASS7995CAAATCTTAGGGGCCTCGAGT7996
710NSGSMHT7997AACTCAGGAAGCATGCACACT7998
711GVLGQTD7999GGTGTGTTGGGGCAGACTGAT8000
712MNVGHVL8001ATGAACGTAGGGCACGTCCTC8002
713PNTRDPI8003CCTAATACGCGGGATCCGATT8004
714VEKRHMV8005GTGGAGAAGAGGCATATGGTG8006
715VVSGIPN8007GTGGTGTCTGGTATTCCGAAT8008
716KNGGHDL8009AAAAACGGTGGGCACGACCTA8010
717YESTRGQ8011TATGAGTCGACGAGGGGTCAG8012
718NALGDGY8013AACGCGCTGGGCGACGGCTAC8014
719GLYDAAT8015GGGCTTTATGATGCGGCGACT8016
720LVAGQAM8017CTGGTGGCGGGGCAGGCTATG8018
721DSRTVDS8019GACTCTCGAACCGTCGACTCA8020
722GDRGVVA8021GGTGATAGGGGGGTTGTGGCT8022
723GLESSVP8023GGCCTTGAAAGCTCTGTACCC8024
724LSRGAEN8025CTTTCGAGGGGTGCGGAGAAT8026
725ISMTLLP8027ATTTCGATGACTCTGCTGCCG8028
726AVGNVLL8029GCTGTGGGGAATGTGCTTTTG8030
727QYAVSGG8031CAGTATGCTGTTAGTGGTGGG8032
728PAQGTLR8033CCTGCTCAGGGGACGCTTCGG8034
729DVAVYIR8035GATGTTGCTGTTTATATTCGT8036
730VLQLAAL8037GTTCTTCAACTCGCTGCCCTC8038
731DDAVSKR8039GATGATGCTGTTTCTAAGCGT8040
732LEDRSAS8041TTGGAGGATCGGTCGGCTAGT8042
733PSYQGNG8043CCGAGTTATCAGGGGAATGGT8044
734LGDSDET8045TTAGGAGACTCGGACGAAACC8046
735GNLLLTA8047GGTAATTTGCTGCTTACTGCT8048
736EGVSALL8049GAGGGTGTTTCTGCGTTGTTG8050
737GHQNGGI8051GGGCACCAAAACGGCGGAATC8052
738RSISGDW8053CGTTCCATAAGTGGCGACTGG8054
739YLALTGI8055TATCTTGCGCTTACGGGGATT8056
740LSDGGPL8057CTCTCGGACGGAGGCCCCCTC8058
741LEANVSH8059CTTGAGGCGAATGTTTCGCAT8060
742GLSERAQ8061GGCCTGTCCGAACGAGCACAA8062
743SGFVVPV8063TCTGGGTTTGTTGTGCCGGTG8064
744GVMLLTE8065GGGGTTATGTTGCTGACTGAG8066
745STTSSPS8067TCGACCACCTCATCCCCTAGC8068
746FNGLPAQ8069TTCAACGGTCTCCCCGCACAA8070
747HVSGASL8071CACGTGTCCGGCGCCAGCTTA8072
748GGDTSRS8073GGGGGTGATACGAGTCGTAGT8074
749AVAGTNS8075GCAGTTGCGGGTACAAACTCG8076
750VMSGTSH8077GTTATGTCGGGTACTAGTCAT8078
751YAGIAQG8079TATGCGGGGATTGCTCAGGGT8080
752MLALAVT8081ATGTTGGCGCTGGCTGTGACG8082
753AALTREI8083GCTGCTCTTACGCGGGAGATT8084
754AIVGMLS8085GCGATTGTGGGTATGCTGTCG8086
755MANMLSV8087ATGGCGAACATGTTATCGGTG8088
756LLADERV8089TTACTCGCAGACGAAAGGGTC8090
757LSSTDGV8091CTGAGTTCGACTGATGGGGTT8092
758VTQNLSE8093GTGACGCAGAATTTGAGTGAG8094
759PARYRLW8095CCGGCGCGGTATCGGCTTTGG8096
760GGDALNQ8097GGGGGGGACGCCCTTAACCAA8098
761VMASPGP8099GTTATGGCTTCGCCTGGTCCT8100
762PGDRDQY8101CCAGGCGACCGAGACCAATAC8102
763LGSLVVH8103CTGGGAAGCTTAGTCGTTCAC8104
764LEVGALR8105CTGGAAGTAGGCGCACTTCGT8106
765VSPSVLQ8107GTTAGTCCTTCGGTGCTTCAG8108
766GISGEVS8109GGTATTTCGGGGGAGGTGAGT8110
767RGAEVLL8111CGGGGTGCGGAGGTGCTGCTG8112
768GVAGTNS8113GGAGTTGCGGGAACAAACTCC8114
769LNGGIGV8115CTTAATGGGGGTATTGGGGTT8116
770TIAAHVP8117ACCATAGCAGCCCACGTACCC8118
771LNGISFV8119TTGAATGGGATTTCGTTTGTG8120
772MGVGGGS8121ATGGGGGTCGGTGGTGGATCC8122
773PLKGGGE8123CCGTTGAAAGGCGGGGGTGAA8124
774RVAQALT8125AGGGTGGCGCAGGCTCTGACG8126
775EASSRLL8127GAAGCTTCGTCGCGACTTCTC8128
776SQAEGSV8129TCCCAAGCGGAAGGCAGCGTG8130
777NSGPQLS8131AACTCGGGCCCACAACTTTCG8132
778VQSADPR8133GTCCAATCCGCGGACCCTCGC8134
779VSDSSIN8135GTGTCGGATTCGTCTATTAAT8136
780TVKEYEL8137ACCGTTAAAGAATACGAACTC8138
781MENAPGR8139ATGGAGAATGCTCCTGGGAGG8140
782GNGDMFA8141GGGAATGGGGATATGTTTGCT8142
783HTSGTSS8143CATACGAGTGGGACGTCGTCG8144
784VIASNEP8145GTTATAGCCTCCAACGAACCG8146
785GINEHVA8147GGGATCAACGAACACGTAGCC8148
786HNSHVLT8149CACAACTCCCACGTATTAACC8150
787QANMLTV8151CAGGCTAATATGTTGACTGTT8152
788VFTGTDP8153GTGTTCACCGGCACAGACCCT8154
789ASDAVLR8155GCATCCGACGCCGTCCTAAGG8156
790ASDAVLR8157GCTAGTGATGCGGTGTTGCGT8158
791RDLTNDV8159CGCGACTTAACTAACGACGTT8160
792RVHSAQL8161AGGGTGCATTCGGCGCAGCTT8162
793SGNAWDE8163AGTGGGAATGCTTGGGATGAG8164
794GHQALNA8165GGCCACCAAGCATTAAACGCC8166
795IADMGGN8167ATTGCTGATATGGGTGGTAAT8168
796SMDSTSR8169TCTATGGATTCGACGTCTAGG8170
797GVSLPMS8171GGCGTATCACTACCCATGAGC8172
798AALAGSR8173GCGGCTCTGGCGGGGTCTAGG8174
799ILGVYSD8175ATACTGGGCGTTTACTCCGAC8176
800APRDPGV8177GCGCCGCGTGATCCTGGTGTT8178
801NRHETLS8179AACCGCCACGAAACACTATCA8180
802LGDGTTR8181CTGGGGGATGGTACGACTCGG8182
803RNHDQTH8183AGAAACCACGACCAAACACAC8184
804MTDSGTV8185ATGACTGATAGTGGGACTGTG8186
805NHHGDRL8187AACCACCACGGAGACAGGCTG8188
806LANTVVT8189CTTGCTAATACGGTTGTGACG8190
807QFHENIR8191CAGTTTCATGAGAATATTCGT8192
808NFGRDTL8193AATTTTGGTCGTGATACTCTG8194
809SGSNTGP8195AGCGGCTCCAACACTGGCCCG8196
810EPAMGMR8197GAGCCGGCGATGGGGATGAGG8198
811ENAGTDV8199GAAAACGCCGGAACTGACGTC8200
812IIISSAN8201ATAATCATATCCTCGGCCAAC8202
813NHVGDRL8203AATCATGTTGGTGATCGTTTG8204
814SGGLMTG8205AGTGGTGGTCTTATGACTGGT8206
815GRGTNDH8207GGTCGGGGTACGAATGATCAT8208
816LANMLQV8209TTGGCAAACATGCTTCAAGTG8210
817TNTDSSL8211ACGAATACGGATTCTAGTCTG8212
818GSGPGVA8213GGTTCTGGGCCGGGGGTGGCT8214
819ADVLIRG8215GCGGACGTGCTCATACGCGGT8216
820TLQQLQL8217ACTCTCCAACAACTGCAATTG8218
821TMANSER8219ACGATGGCAAACTCGGAACGC8220
822WDDQTSG8221TGGGATGATCAGACTTCGGGG8222
823GTGSTNV8223GGAACTGGATCGACAAACGTT8224
824GPSGAGI8225GGGCCATCAGGGGCAGGCATC8226
825NAAVIYD8227AACGCTGCAGTGATATACGAC8228
826SNLGETV8229TCGAATTTGGGGGAGACGGTT8230
827EPSLGSR8231GAGCCGTCTCTGGGTTCTCGG8232
828IGASVKL8233ATCGGTGCATCGGTAAAACTG8234
829SRGVISS8235AGCCGAGGCGTAATCTCGTCA8236
830RVMGEEV8237CGTGTGATGGGGGAGGAGGTT8238
831YSTERSV8239TATTCGACTGAGAGGTCTGTT8240
832AGGGTPR8241GCGGGGGGTGGGACTCCGAGG8242
833VLPSPGP8243GTTCTGCCTTCGCCTGGTCCT8244
834TSVLPQT8245ACGTCTGTGCTTCCTCAGACT8246
835ILASPGP8247ATACTTGCGTCACCCGGACCG8248
836GEIDIAF8249GGAGAAATCGACATAGCCTTC8250
837GWADSVP8251GGTTGGGCTGATTCGGTTCCG8252
838GVAATNT8253GGAGTTGCAGCCACAAACACG8254
839LVGNPST8255CTCGTGGGCAACCCGAGTACG8256
840YGVTLST8257TACGGCGTAACCCTCTCTACC8258
841ASMGTVA8259GCGTCCATGGGAACCGTAGCC8260
842WSNSEQH8261TGGTCGAATTCGGAGCAGCAT8262
843REVSPLM8263CGAGAAGTAAGCCCCCTGATG8264
844QAESAAR8265CAAGCGGAATCAGCGGCTAGA8266
845ALQSAQV8267GCACTACAATCTGCACAAGTT8268
846PNDRLTV8269CCAAACGACCGGTTGACGGTT8270
847LIVTENQ8271TTGATTGTGACGGAGAATCAG8272
848GLVHMPS8273GGCTTAGTTCACATGCCCTCA8274
849MADGASM8275ATGGCGGATGGTGCGTCTATG8276
850RAVENMG8277CGCGCAGTAGAAAACATGGGC8278
851LNGVTIT8279CTCAACGGCGTCACCATCACC8280
852RYNVETA8281CGGTATAATGTTGAGACTGCG8282
853SLLHDGA8283AGTTTGTTGCATGATGGGGCG8284
854TRIGLSD8285ACACGAATAGGACTCAGTGAC8286
855NAHALMV8287AACGCCCACGCACTCATGGTC8288
856VEVQAGK8289GTGGAGGTTCAGGCTGGGAAG8290
857RGGVLSE8291CGAGGTGGGGTACTCAGTGAA8292
858KNQDTKM8293AAGAATCAGGATACGAAGATG8294
859QLRPLQT8295CAACTGCGTCCTTTGCAAACG8296
860LLENARV8297CTGCTGGAGAATGCGAGGGTG8298
861LFGPSAY8299TTATTCGGACCTTCCGCCTAC8300
862RIDAELL8301CGTATTGATGCTGAGTTGTTG8302
863VVSGLLH8303GTTGTCTCCGGGTTGCTACAC8304
864MGGVTSV8305ATGGGGGGGGTTACTTCGGTG8306
865TVADPRA8307ACTGTTGCGGATCCGCGGGCG8308
866TGLQVST8309ACTGGGCTGCAGGTTAGTACT8310
867ANEHNIA8311GCTAATGAGCATAATATTGCG8312
868STLASPR8313TCAACCCTAGCCTCGCCTCGA8314
869IHFSGDN8315ATCCACTTCAGCGGCGACAAC8316
870GLVQIVA8317GGGCTTGTTCAGATTGTTGCG8318
871TAYDTLV8319ACGGCGTATGATACGTTGGTT8320
872AVKEYQS8321GCTGTTAAAGAATACCAATCT8322
873ASSHVTV8323GCTTCGAGTCATGTTACTGTG8324
874STLSTFD8325TCGACTTTGAGTACGTTTGAT8326
875LDLTSDV8327CTTGATCTGACGTCTGATGTG8328
876QYNVEST8329CAGTATAATGTTGAGTCTACG8330
877SVEPLSL8331TCCGTAGAACCTCTATCCCTC8332
878PGHGPVR8333CCCGGGCACGGACCTGTACGC8334
879VRQLDSR8335GTGAGGCAGCTGGATTCGCGG8336
880MMLNQGS8337ATGATGCTTAACCAAGGCAGC8338
881EPSLSSP8339GAGCCGTCTCTGAGTTCTCCG8340
882PGVDTGV8341CCTGGTGTTGATACTGGTGTT8342
883SGDVARH8343TCAGGCGACGTTGCCCGACAC8344
884ADYGTSS8345GCGGACTACGGTACCAGCTCT8346
885VHSQDVS8347GTGCATTCGCAGGATGTGTCT8348
886VIAGLGV8349GTGATCGCGGGACTCGGCGTC8350
887VHVDNSN8351GTGCATGTTGATAATAGTAAT8352
888QSGVF*C8353CAGTCGGGGGTGTTCTGATGC8354
889AQDHGTL8355GCGCAGGATCATGGGACGTTG8356
890SRLEYIG8357AGCCGCCTTGAATACATCGGG8358
891VLLGINT8359GTCCTGCTCGGAATAAACACC8360
892LGIGQGP8361TTGGGTATTGGTCAGGGTCCT8362
893NVTATLG8363AACGTCACAGCAACGCTGGGT8364
894EVLSLAP8365GAGGTGCTGTCTCTTGCTCCG8366
895TNGVLYT8367ACAAACGGCGTCCTTTACACG8368
896RFVGSVP8369AGGTTTGTGGGTAGTGTTCCG8370
897TNGYRED8371ACTAATGGTTATAGGGAGGAT8372
898LESAAMI8373CTGGAGTCGGCTGCTATGATT8374
899VPLPSGK8375GTTCCTCTGCCGAGTGGGAAG8376
900NSKDVQR8377AACTCCAAAGACGTACAAAGA8378
901GVGGTYS8379GGAGTTGGGGGCACATACAGT8380
902LTDKMTS8381TTGACTGATAAGATGACGTCG8382
903SGAAAAT8383AGCGGGGCCGCAGCCGCCACC8384
904MVTTTNT8385ATGGTGACGACCACAAACACC8386
905TSLGLMQ8387ACTAGCCTTGGCTTAATGCAA8388
906LVHLGTS8389TTGGTTCATCTTGGGACTTCT8390
907NGMGDVT8391AATGGGATGGGTGATGTGACG8392
908LNSPLHV8393CTGAATAGTCCGCTGCATGTT8394
909GSRESVR8395GGGAGTAGGGAGAGTGTGCGT8396
910DNSPMDL8397GACAACAGCCCCATGGACCTA8398
911VVSPQPV8399GTGGTTTCGCCTCAACCGGTG8400
912STINTLM8401AGTACTATTAATACTCTGATG8402
913THGDAGG8403ACTCATGGGGATGCTGGTGGG8404
914AVLAGSS8405GCGGTTCTGGCGGGGTCTAGT8406
915YTSGTGT8407TACACCTCGGGCACAGGGACA8408
916GPDTGAM8409GGCCCCGACACAGGCGCGATG8410
917SGMQAEA8411TCGGGTATGCAGGCGGAGGCT8412
918LATHDAR8413CTCGCAACGCACGACGCACGA8414
919YDRIMSS8415TACGACCGCATAATGTCATCT8416
920RHHGTES8417CGTCATCATGGTACTGAGAGT8418
921MAVKSPP8419ATGGCTGTGAAGTCGCCGCCG8420
922EVRDTKT8421GAAGTTCGGGACACAAAAACG8422
923GFVQSRM8423GGGTTTGTTCAGAGTCGGATG8424
924VLAAVDR8425GTCCTTGCTGCCGTCGACCGA8426
925VTTVPPV8427GTGACTACGGTTCCTCCTGTG8428
926HFSSETS8429CACTTCTCTTCCGAAACTTCT8430
927TTVTVSL8431ACGACGGTGACGGTGTCGTTG8432
928AESRLFV8433GCGGAGAGTAGGCTGTTTGTG8434
929LSGGFTA8435TTGAGTGGTGGTTTTACGGCG8436
930NSDLASP8437AATAGTGATTTGGCGTCTCCT8438
931LDHGASA8439TTAGACCACGGAGCGTCGGCG8440
932YGSNDLS8441TATGGGAGTAATGATCTGAGT8442
933VIASNEH8443GTCATAGCCTCAAACGAACAC8444
934LTGSIGL8445TTAACTGGGTCAATTGGACTC8446
935HLSRDHS8447CACCTGTCACGTGACCACTCA8448
936NLRGEHT8449AATTTGCGTGGGGAGCATACG8450
937ILVDALA8451ATTCTGGTTGATGCTCTTGCG8452
938SGYDTSV8453AGTGGGTATGATACGTCGGTT8454
939HKDKWVG8455CACAAAGACAAATGGGTTGGG8456
940MTGNSFV8457ATGACAGGCAACAGCTTCGTA8458
941YTVGSLA8459TACACCGTTGGCTCACTCGCC8460
942SVSKPFL8461AGTGTGAGTAAGCCTTTTTTG8462
943TVMTSEP8463ACAGTTATGACCAGCGAACCT8464
944SMGYVSA8465TCGATGGGTTATGTTTCGGCT8466
945ILVDAYA8467ATTCTGGTTGATGCTTATGCG8468
946QGGTTLR8469CAAGGGGGGACTACTCTACGC8470
947SEGLSRD8471TCGGAGGGTCTTTCGCGTGAT8472
948FTGGTGT8473TTTACTGGTGGTACGGGTACT8474
949RSGSGVA8475CGGTCGGGCTCCGGAGTCGCC8476
950VLASLGP8477GTGCTCGCCAGTCTCGGCCCC8478
951LVTGMSS8479CTTGTCACGGGCATGTCAAGC8480
952ALASTQT8481GCACTAGCATCGACCCAAACT8482
953SLVRGLL8483AGTCTTGTTCGGGGTTTGCTG8484
954VGQVPGR8485GTGGGGCAAGTCCCGGGTAGG8486
955NGPMKAD8487AACGGTCCAATGAAAGCAGAC8488
956GPMASVV8489GGGCCGATGGCGTCTGTGGTT8490
957LVSGLGP8491CTTGTGAGTGGGCTGGGTCCG8492
958AADRSVR8493GCAGCAGACCGCTCCGTACGT8494
959AATSGGP8495GCAGCCACCAGTGGCGGGCCG8496
960RDLTSNV8497CGAGACTTAACTAGCAACGTA8498
961IVMSSHI8499ATCGTCATGAGCTCCCACATC8500
962GPLNQSL8501GGTCCGCTGAATCAGTCTTTG8502
963TDGRTLH8503ACGGATGGTAGGACGCTGCAT8504
964TGLQVSF8505ACTGGGCTGCAGGTTAGTTTT8506
965RVTTHTP8507CGTGTTACTACTCATACGCCG8508
966EVGSIGS8509GAGGTTGGTAGTATTGGTTCT8510
967TDVHSTS8511ACGGATGTGCATTCGACTTCG8512
968TFAISDR8513ACTTTTGCGATTTCTGATCGG8514
969TVLAAAH8515ACGGTGTTGGCTGCGGCTCAT8516
970MNDAGRD8517ATGAATGATGCTGGGCGTGAT8518
971PAEHYQA8519CCGGCTGAGCATTATCAGGCT8520
972DRSTAEW8521GACCGCTCCACAGCAGAATGG8522
973LYGGSSA8523CTCTACGGAGGGTCCTCGGCT8524
974VTQAVYV8525GTTACGCAGGCTGTTTATGTT8526
975GVNHAVA8527GGAGTCAACCACGCCGTCGCC8528
976DSAPAAR8529GATTCGGCTCCGGCGGCTCGG8530
977DPKTGWR8531GATCCGAAGACTGGGTGGCGT8532
978SIVGSVQ8533TCAATCGTAGGCTCAGTCCAA8534
979DSDSGRR8535GATTCTGATAGTGGGCGGCGG8536
980EQYLGSP8537GAGCAGTATCTGGGTTCTCCG8538
981LSLDRPS8539CTAAGTCTAGACCGACCCTCG8540
982MGDIVTL8541ATGGGGGATATTGTTACGCTT8542
983SFRDTVP8543AGTTTTAGGGATACGGTGCCT8544
984RGLSDPV8545CGGGGGCTGTCTGATCCGGTG8546
985TGGLLYS8547ACTGGTGGGCTTCTTTATAGT8548
986VVLSGIS8549GTGGTTTTGTCGGGGATTTCT8550
987GVAGTYL8551GGGGTGGCTGGGACGTATCTG8552
988LNGSHGP8553CTGAATGGGTCGCATGGGCCG8554
989PSGALMT8555CCTTCAGGCGCCTTGATGACG8556
990LSLTDGV8557TTGTCCTTAACCGACGGAGTG8558
991GRDLTPA8559GGGCGTGACCTGACTCCAGCG8560
992AGHSNAV8561GCTGGGCATTCTAATGCGGTT8562
993GVAGTDS8563GGGGTGGCTGGGACGGATTCT8564
994AELGIRY8565GCTGAGCTGGGGATTAGGTAT8566
995PLSNAAL8567CCTCTATCTAACGCAGCACTG8568
996IGLSVST8569ATTGGGCTGTCTGTTTCTACT8570
997RSITIGP8571CGTTCGATTACTATTGGGCCG8572
998GLVRIQD8573GGACTGGTTCGGATCCAAGAC8574
999LSGIMVS8575TTGTCGGGGATTATGGTTTCG8576
1000SWQSDTD8577TCGTGGCAGTCTGATACGGAT8578

Table 9. PAL2 and AAV9 transgene expression and vector genome abundance in one cynomolgus macaque. aTransgene mRNA expression normalized to expression of GAPDH mRNA as detected by qPCR with a standard curve bVector DNA normalized to the number of GAPDH genomic DNA copies as detected by qPCR with a standard curve

TABLE 9
PAL2AAV9PAL2/AV9PAL2 vectorAAV9 vectorPAL2/AAV9
TissuemRNA*mRNA*mRNAgenomes/cellbgenomes/cellbvector DNA
Frontal lobe3.81E−057.82E−064.876.25E−3 4.86E−031.29
Temporal lobe 4.8E−051.10E−054.371.13E−021.09E−021.04
(anterior)
Temporal lobe6.27E−051.17E−055.351.01E−028.07E−031.25
(posterior)
Parietal lobe7.98E−051.35E−055.931.10E−029.11E−03.21
(anterior)
Parietal lobe1.19E−042.28E−055.231.37E−021.13E−021.22
(posterior)
Occipital lobe1.06E−041.93E−055.506.73E−034.83E−031.39
Thalamus5.54E−051.20E−054.612.04E−027.96E−032.56
Midbrain9.67E−051.97E−054.909.28E−035.01E−031.85
Corpus8.27E−053.03E−052.733.67E−031.93E−031.91
callosum
Cerebellum6.69E−053.16E−052.112.18E−031.44E−031.51
Neuroretina1.63E−041.22E−0513.403.19E−038.52E−043.75
RPE4.01E−041.72E−042.341.12E−011.85E−010.61
Brain Stem9.34E−054.00E−052.338.97E−035.76E−031.56
Cervical4.34E−041.44E−043.012.08E−021.42E−021.46
spinal cord
Thoracic9.41E−043.19E−042.942.38E−021.84E−021.29
spinal cord
Lumbar spinal1.71E−034.78E−043.583.15E−021.74E−021.80
cord
Cauda equina3.90E−026.81E−035.746.35E−023.42E−021.86
Cervical DRG3.12E−023.23E−039.645.08E−023.32E−021.53
Thoracic DRG1.55E−021.97E−037.833.57E−021.93E−021.84
Lumbar DRG3.98E−026.07E−036.561.74E−011.37E−011.27
Triceps5.08E−021.72E−022.952.03E−012.49E−010.82
Quadriceps5.37E−031.46E−033.682.70E−024.36E−020.62
Diaphragm2.79E−028.01E−033.481.55E−011.97E−010.79
Heart2.76E−021.15E−022.391.37E−012.04E−010.67
Kidney4.81E−041.71E−042.811.53E−011.73E−010.88
Lung8.12E−043.95E−042.063.44E−014.10E−010.84
Thymus3.38E−031.85E−031.831.44E−024.69E−033.06
Gonad3.20E−032.89E−031.111.83E−022.25E−020.81
Liver8.35E−021.73E−010.48 1.31E+−015.02E+010.26
Spleen1.84E−043.78E−040.496.29E+001.27E+004.97

Applicant assessed the relative performance of mouse- and macaque-derived engineered variants in order to determine if any variants had strong neurotropic properties in both mice and macaques. Applicant performed a benchmarking experiment in C57BL/6J and BALB/cJ mice as well as in cynomolgus macaques comparing four mouse-derived variants and eight macaque-derived variants from this study with AAV9. This panel also included three promising engineered variants developed by the Gradinaru lab using the M-CREATE platform: PHP.C2, which is known to transduce the CNS of both C57BL/6J and BALB/cJ mice,23 and AAV.CAP-B 10 and AAV.CAP-B22, two PHP.eB-derived variants that were initially selected in Cre-transgenic mice but have demonstrated enhanced neurotropic activity in marmosets” (FIG. 17A). Applicant generated rAAVs with each of these 16 capsids packaging a human frataxin (hFN) transgene—the gene involved in the degenerative neurological disorder Freidreich's ataxia—under control of the constitutive CBh promoter. The transgene of rAAVs produced with each capsid contained a unique set of fifty 20-mer barcodes in the 3′UTR region, which allowed us to associate sequenced hFXN transcripts with a specific capsid variant (FIG. 17B). We administered a pool containing equal proportions of each of these 16 capsid variants by intravenous (IV) injection to C57BL/6J and BALB/cJ mice and cynomolgus macaques at a total combined dose of 3E+13 vg/kg.

Applicant found that the efficacy of each variant tested was linked to the animal model in which it was initially identified and no variant exhibited cross-species CNS-tropic behavior. Quantification of hFXN mRNA expression revealed that none of the eight variants selected in macaques were capable of enhanced transduction of the brain or spinal cord of either mouse strain (FIG. 17C-17D). Likewise, none of the seven mouse-derived variants that we tested effectively transduced the macaque CNS (FIG. 4E). Remarkably, the AAV.CAP-B10 and AAV.CAP-B22 variants, which have previously been shown to outperform AAV9 in transducing the marmoset brain following systemic administration,25 did not show increased performance in any area of the macaque CNS (FIGS. 4E and 20A).

Applicant were able to verify the efficacy of mouse-derived variants in the mouse strain in which each was initially discovered. All four mouse-derived variants identified in this study significantly outperformed AAV9 in transducing the brain and spinal cord of both mouse strains by a considerable margin, although of these four variants, only M.Mus.1 and M.Mus.2 were detargeted from the liver (FIG. 17C-17D). The AAV.CAP-B10, AAV.CAP-B22, and PHP.C2 variants performed exceptionally well in the C57BL/6J brain and spinal cord (FIG. 17C). PHP.C2 was also able to successfully transduce the BALB/cJ brain and spinal cord in line with previous findings23 (FIG. 17D). However, as with the PHP.eB variant from which they were derived,27,28 the CNS tropism of AAV.CAP-B10 and AAV.CAP-B22 did not extend to BALB/cJ mice (FIG. 17D).

Applicant found that a number of variants discovered during the macaque selections in this study had increased potency over AAV9 in the macaque CNS. Three PAL family variants, PAL1A-PAL1C, were significantly better at transducing all four lobes of the macaque brain as well as the thalamus, midbrain, and corpus callosum, but not the cerebellum, brain stem, or spinal cord (FIGS. 17E and 20A). These three variants were additionally significantly detargeted from the dorsal root ganglia (DRG) (FIG. 17E). M.Fas.1-3 did not demonstrate significantly increased potency in the cerebrum, but unlike the PAL variants, they effectively transduced the spinal cord (FIG. 17E). All macaque-derived variants except for M.Fas.3 were significantly detargeted from the macaque liver compared to AAV9 as measured by both transgene mRNA expression and vector genome delivery (FIGS. 17E and 20B).

Applicant attempted to further optimize the PAL motif by performing a second-generation selection in cynomolgus macaques with the PAL motif fixed, varying only the second and sixth position of the 7-mer insert as well as the three flanking residues immediately upstream of the insert. Modifications to this upstream flanking region, corresponding to SAQ in wild-type AAV9, have previously resulted in the enhanced transduction of PHP.eB compared to PHP.B.22 From this selection Applicant chose the second-generation PAL variant PAL2, with the sequence EVGPTQGTVR (SEQ ID NO: 332), for further study due to its relatively high performance and its similarity with the top first-generation variant PAL1A. Applicant produced rAAVs with AAV9 and PAL2 each encoding hFXN under control of the CBh promoter and systemically administered 3E+13 vg/kg of each virus, for a total dose of 6E+13 vg/kg, to one female cynomolgus macaque. In order to distinguish between genomes and transcripts from the two different capsids, Applicant tagged the hFXN transgene with an HA or FLAG epitope tag in PAL2 and AAV9 capsids, respectively.

Applicant assessed both vector transgene delivery and expression throughout a variety of tissues and found that PAL2 facilitated between a fourfold and sixfold increase in transgene mRNA expression throughout the cerebrum compared to AAV9, except in the corpus callosum, where we only observed a 2.7-fold improvement (FIG. 18A and Table 8). As seen with the first-generation PAL1 variants, PAL2 transduction lagged in the cerebellum compared to the cerebrum, and in this experiment, we found only a 2.1-fold increase in mRNA expression from PAL2 in the cerebellum (FIG. 18A). Improvements in vector genome delivery were more modest; throughout the cerebrum and cerebellum we observed less than twofold more PAL2 vector genomes compared to AAV9 (FIG. 18A and Table 8). In line with our observations of first-generation PAL1 variants, PAL2 demonstrated one quarter of the vector genome abundance and one half of the mRNA expression in the liver relative to AAV9 (FIG. 18A and Table 8).

To further characterize transgene expression from PAL2, we performed immunostaining for the HA-tagged hFXN transgene. Applicant found that PAL2 transduction was broadly distributed throughout the cerebrum, and cells expressing HA-hFXN were found in diverse regions (FIG. 18B). Though AAV9 transduction in the brain is thought to be mostly limited to astrocytes rather than neurons,14-33 PAL2 demonstrated distinct neurotropic behavior: HA-hFXN expression was frequently observed in NeuN+ neurons in both the cortex and hippocampus (FIG. 18C-18D). Though PAL2 also outperformed AAV9 in transgene delivery and expression in the spinal cord (FIG. 18A and Table 8), transduction in the spinal cord was more limited to non-neuronal cell types (FIG. 18E).

As rAAVs have been successfully employed in the treatment of ocular diseases,34 Applicant also assessed the relative efficiency of PAL2 in the retinal pigment epithelium (RPE) and neuroretina (retina absent the RPE). Applicant found that PAL2 outperformed AAV9 in both transgene delivery and expression in the neuroretina by a factor of 3.8 and 13.4, respectively (FIG. 18A and Table 8). PAL2 vector genome abundance in the RPE was only 0.6-fold that of AAV9, but PAL2 nonetheless facilitated 2.3-fold greater mRNA expression in the RPE. Expression of the HA-hFXN transgene in the neuroretina was largely limited to photoreceptor cells, with expression particularly concentrated in the outer plexiform layer where bipolar and horizontal cells synapse with photoreceptors (FIG. 18F).

Though three first-generation PAL1 variants were significantly detargeted from the DRG, we found that PAL2 had increased DRG tropism compared to AAV9 (FIG. 18A and Table 8). Transduction of the DRG has been associated with neuroinflammation and neurodegeneration that can result in ataxia and other PNS deficits.17,20,35-37 Applicant therefore assessed the spinal cord and DRG for abnormal pathology. As the macaque was administered a pool containing both AAV9 and PAL2, Applicant are unable to distinguish the effects of one capsid from another; however, Applicant was able to assess the combined effect of the two vectors in the context of this experiment. Multiple DRG and spinal cord sections from the cervical, thoracic, and lumbar regions of the spine were analyzed by a neuropathologist who established severity scores ranging from 0 (within normal limits) to 5 as previously described.36,38 The macaque did not show abnormal pathology in any region tested (FIG. 18G).

In this example, Applicant used the previously described DELIVER method30 to identify the novel PAL family of capsids that offer enhanced transduction in the CNS of cynomolgus macaques after a single dose IV infusion (FIGS. 17A-17E and 18A-18H). This is the first example of engineered AAVs evolved de novo in macaques demonstrating increased CNS tropism in macaques following systemic administration. Applicant identified this family of capsids after just two rounds of selection in macaques, illustrating the utility of DELIVER in identifying potent AAV capsid variants in an additional tissue type. In a pooled characterization experiment assessing the performance of multiple engineered rAAVs in macaques, three PAL capsid variants (PAL1A-C) were capable of a moderate but statistically significant two- to threefold increase in transgene expression throughout the cerebrum (FIGS. 17A-17E and 20A-20B). The second-generation variant PAL2 displayed an even greater four to six-fold improvement in transgene expression in most areas of the cerebrum in one macaque. PAL2 was notably 13-fold more potent than AAV9 at transducing the neuroretina of one macaque (FIG. 18A-18H), suggesting the feasibility of using a systemically administered rAAV to treat a disease affecting both the brain and retina, such as Krabbe disease. Additional studies with a greater number of animal subjects will be required to fully assess the performance of this variant.

In addition to demonstrating increased CNS tropism in macaques, the PAL variants displayed a striking decrease in liver tropism both in terms of vector genome delivery and transgene mRNA expression (FIGS. 17E, 20B, and 18A, and Table 8). Identification of vectors with reduced liver tropism is key to harnessing the advantages conferred by systemic administration, as sequestration of viral particles in the liver following IV infusion both decreases the effective dose at the target tissue and can lead to severe liver toxicity.1,2,15,17-20 These results therefore suggest that PAL vectors could achieve therapeutic efficacy following systemic administration at a reduced dose and with a lower risk of liver toxicity.

Though the PAL variants are capable of enhanced transduction of the macaque CNS, we found that engineered variants identified in mice were universally unsuccessful. Variants such as MDV1A that were selected in mice via DELIVER were able to potently transduce the CNS of two mouse strains (FIG. 15A-15H), but none of the four mouse-selected variants identified in this study outperformed AAV9 in transducing any area of the macaque CNS (FIG. 17A-17E). Even more surprisingly, AAV.CAP-B10 and AAV.CAP-B22, two variants that were selected in mice and shown to have enhanced neurotropic properties in marmosets,23,25 also failed to outperform AAV9 in transducing the CNS of cynomolgus macaques, a primate more closely related to humans. The failure of AAV transduction profiles to translate from mice to primates is well documented and has hampered development of CNS-targeted rAAV therapies,26,29 but this finding that the performance of some variants in one primate species may not translate even to another primate species has worrying implications for the field. Though variants that retain their overall transduction behavior across a variety of model organism species are powerful tools from a preclinical standpoint and have been found for the skeletal muscle,30 the complexity of the CNS appears to pose additional challenges. It is therefore can be important that engineered AAVs are selected and evaluated in an appropriate animal model—one with the highest possible degree of similarity to humans—in order to maximize the likelihood of therapeutic efficacy in treating human neurological disorders.

The properties of the PAL variants and other variants identified in this study may be further enhanced in a number of ways. Firstly, additional iterations of directed evolution focusing on the 7-mer insert motif, flanking amino acids, or other areas of the capsid may result in improved or otherwise altered transduction properties as has been observed in the development of PHP.eB, AAV.CAP-B10, and AAV.CAP-B22.22,25 Secondly, though the advantages of systemic administration motivating this study are clear, refinement of intra-CSF delivery routes remains a promising area of research and may result in more robust transgene expression in the CNS33 at the possible expense of a higher risk of neuroinflammation and neurodegeneration.35-37,39 The combination of a PAL variant with an intra-CSF delivery method such as intrathecal or intracisternal injection may prove fruitful and suggest more varied applications for these variants. Finally, the inclusion of tissue-specific microRNA targets on the vector transgene can reduce transgene expression and associated side effects in off-target tissues. Similar strategies utilizing microRNAs have shown promising results in vivo in the context of both liver and DRG detargeting.38,40-43

In summary, this Example identifies of a variety of AAV capsid variants with neurotropic properties in either mice or cynomolgus macaques, including a more extensively characterized family of variants containing a PAL motif that are capable of enhanced transduction of the macaque CNS and reduced sequestration in the liver following a single IV infusion. These results suggest that rAAV-based therapies with PAL variants may achieve therapeutic efficacy at a reduced dose, minimizing both safety concerns and vector manufacturing challenges. Applicant additionally provides a list of the 1000 most highly enriched capsid variants in the CNS of macaques and two mouse strains (Table 8); further investigation and characterization of these variants may identify additional candidates for CNS gene therapy. Though Applicant was unable to identify any variants able to potently transduce both the mouse and macaque CNS, this finding indicates a critical need for appropriate animal models and a move away from the current paradigm of evolving CNS-tropic AAVs in mice. This Example, particularly the characterization of the PAL family of variants in macaques, represents a significant advancement towards safe and effective rAAV therapies for diseases of the CNS in humans.

All animal care, housing, and experimental procedures were carried out in accordance with the Broad Institute Institutional Animal Care and Use Committee (IACUC) and Biomere's IACUC.

Eight week old male and female C57BL/6J (JAX, #000664) and BALB/cJ (JAX, #000651) mice were purchased from the Jackson laboratory. All mouse AAV injections were performed retro-orbitally. Tissue samples were collected from the mice two weeks post-injection after whole body perfusion with either Dulbecco's phosphate-buffered saline (DPBS) (Gibco, #14190144) or DPBS followed by 4% paraformaldehyde (PFA).

Non-human primate studies were performed at Biomere (Worcester, MA, USA) in accordance with their standard operating protocols and procedures approved by their IACUC. Male and female cynomolgus macaques, approximately 2 years of age, with a serum AAV9 neutralizing antibody titer of less than 1:3 were selected for in vivo studies. For all experiments, macaques were injected via an IV bolus injection. Animals were euthanized after 3 weeks and perfused with DPBS, after which CNS, muscle, and organ tissues were harvested. Tissue samples were preserved in RNAlater stabilization solution (Invitrogen, #AM7024) prior to downstream processing.

CMV-EGFP plasmids used to produce EGFP-encoding AAV9 and MDV1A were generated by cloning the cytomegalovirus (CMV) promoter, EGFP coding sequence, and bovine growth hormone polyadenylation signal (bGH pA) into the pZac2.1 construct purchased from the University of Pennsylvania vector core. The AAV capsid library recipient plasmid was generated by assembling the human synapsin 1 (hSyn) promoter, AAV2 rep, AAV9 cap, and SV40 polyadenylation signal into an ITR-containing backbone. The AAV9 cap gene on the library recipient plasmid was modified to contain BsmBI restriction sites immediately after Q486 and Q588 to facilitate insertion of a variable peptide sequence. The pZac2.1-CBh-hFXN-HA-bGH and pZac2.1-CBh-hFXN-FLAG-bGH plasmids were assembled by cloning the hybrid CBh promoter,44 human frataxin coding sequence, HA tag, and bGH pA into the pZac2.1 plasmid backbone between the ITRs. As previously described, for the pooled characterization experiment, 12 bp barcodes were inserted immediately after the HA tag in the pZac2.1-CBh-hFXN-HA-bGH plasmid.30 Each variant in the pooled characterization experiment was associated with 50 unique barcodes that were randomly generated with a minimum Hamming distance of four between any two barcodes.

First round AAV capsid library plasmids were prepared by amplifying a section of the AAV9 cap gene with an NNK degenerate reverse primer to produce fragments encoding every possible random 7-mer peptide insertion after Q588. These fragments were then introduced into the BsmBI-digested capsid library recipient plasmid. This library has a theoretical diversity of 207 (1.28E+9) variants at the amino acid level, and we were able to identify at least 5E+6 unique capsid variants in our first-round capsid libraries based on next-generation sequencing. Second round libraries were generated through a similar method, but instead of NNK degenerate primers, a synthetic oligo pool (Agilent, Santa Clara, CA) was used to produce only selected variants of interest and synonymous DNA codon replicates. Libraries with the fixed PAL motif X1X2X3PX4QGTX5R were generated with a reverse primer containing NNK degenerate codons at the variable positions X1-X5. All cloning was performed using the NEBuilder HiFi DNA assembly master mix (New England Biolabs, Ipswitch, MA).

AAV capsid libraries and rAAVs were produced in HEK293 cells (CRL-1573, ATCC, Mannassas, VA) with the usual triple-plasmid transfection method.45 Briefly, HEK293 cells were seeded into 15 cm dishes at a density of 2E+7 cells per dish and transfected the following day using PEI MAX (Polysciences, Warrington, PA). For individual rAAV production, cells were transfected with 16 μg pALDX-80 (Aldevron, Fargo, ND), 8 μg Rep2/Cap plasmid, and 8 μg of the ITR-containing transgene plasmid per dish. rAAVs were harvested from the cells and media and purified by ultracentrifugation over an iodixanol gradient as previously described.45 A slightly modified protocol was used for the production of AAV capsid libraries. First, only 10 ng of the AAV capsid plasmid library was used per dish in order to prevent cross-packaging of variants and the formation of mosaic capsids, and 8 μg of pUC19 plasmid was included in the transfection to maintain the total amount of transfected plasmid. Second, 8 μg of Rep-AAP plasmid (a generous gift from Benjamin Deverman)21 was used in place of the Rep2/Cap plasmid. Finally, virus was harvested after 60 hours rather than the usual 120 hours in order to limit secondary transduction of virus-producing cells. All AAVs were titered by qPCR.

First- and second-round selections were performed in eight week old C57BL/6J and BALB/cJ mice and in two year old macaques. Six male and six female mice from each strain were used for each selection, and each mouse received a 1E+12 vg injected dose of either the AQ or DG capsid variant library. For the first round of selection in macaques, one male and one female were injected with 1E+13 vg/kg capsid library. For the second round of selection in macaques, two males and one female were injected with 3E+13 vg/kg AQ capsid library. For selection on the fixed PAL motif with modified flanking amino acids in macaques, two males were injected with 3E+13 vg/kg. In all selection experiments, three weeks after injection, animals were euthanized by perfusion with saline and whole brains were harvested. Spinal cords were additionally harvested from macaques. Fresh tissues were cut into 2 mm cubes and snap-frozen in liquid nitrogen before being stored at −80° C. Total RNA was extracted from at least 80% of the total tissue volume with TRIzol (Thermo Fisher, Waltham, MA) and mRNA was enriched from total RNA samples with oligo dT beads (New England Biolabs) and treated with Turbo DNase (Thermo Fisher). Subsequently, cDNA was synthesized with SuperScript IV reverse transcriptase (Thermo Fisher) and a capsid-specific primer (5′-GAAAGTTGCCGTCCGTGTGAGG-3′ (SEQ ID NO: 8590)). Capsid variant sequences were then amplified with Q5 High-Fidelity 2× master mix (New England Biolabs) and primers flanking the 7-mer insert (5′-ACAAGTGGCCACAAACCACCA-3′ (SEQ ID NO: 8591) and 5′-GGTTTTUAACCCAGCCGGTC-3′ (SEQ ID NO: 8592)) that added Illumina adaptors and unique indices (New England Biolabs). Amplicons were pooled at an equimolar ratio and sequenced on an Illumina NextSeq.

In Vivo rAAV Characterization

For comparison of vector genome delivery and transgene mRNA expression between AAV9 and MDV1A in mice, four male and four female 8 week old C57BL/6J mice and four male and four female 8 week old BALB/cJ mice were injected with 1E+12 vg of AAV9- or MDV1A-CMV-EGFP. Tissues were harvested two weeks after injection. For comparison of transgene expression via immunostaining, 8 week old C57BL/6J and BALB/cJ mice were injected with 5E+11 vg of AAV9- or MDV1A-CMV-EGFP. Tissues were again harvested two weeks after injection. For comparison of PAL2 and AAV9, one male two year old macaque was injected with 3E+13 vg/kg each of AAV9-CBh-hFXN-FLAG and PAL2-CBh-hFXN-HA. The macaque was euthanized by saline perfusion and tissues were harvested 3 weeks after injection.

For the pooled rAAV characterization experiment, eight in-house macaque-derived capsids, four in-house mouse-derived capsids, AAV.CAP-B10, AAV.CAP-B22, PHP.C2, and AAV9 were used to produce rAAVs packaging the barcoded CBh-hFXN-HA-bGH transgene. Equal amounts of each of the 16 barcoded rAAV pools were mixed and injected into two male and one female two year old macaques, three male and four female 8 week old C57BL/6J mice, and two male and two female 8 week old BALB/cJ mice. All animals were injected with a combined dose of 3E+13 vg/kg, or 1.875E+12 vg/kg per capsid variant. Animals were euthanized by saline perfusion and tissues were harvested 4 weeks after injection and total RNA was extracted and treated as described above, and macaque liver DNA was additionally isolated with QuickExtract DNA extract solution (Lucigen, Middleton, WI). cDNA was synthesized with a bGH pA-specific primer (5′-TTCACTGCATTCTAGTTGTGGTTTG-3′ (SEQ ID NO: 8583)) and DNA and cDNA were amplified with Q5 High-Fidelity 2X master mix and primers flanking the barcode region (5′-CCATACGATGTTCCAGATTACGC-3′ (SEQ ID NO: 8594) and 5′-CAATGTATCTTATCATGTCTGCTCGA-3′ (SEQ ID NO: 8595)). Amplicons with Illumina adapters and unique indices were pooled at equimolar ratios and sequenced on an Illumina NextSeq.

Next generation sequencing analysis of the results of selection experiments was performed as previously described.30 Briefly, Illumina sequencing reads were demultiplexed with bcl2fastq2-v2.17.1 and the 21 bp variant sequence was extracted from each read. Variants were counted in each sample and normalized to the sequencing depth of the run to assign each variant a reads per million (RPM) score. Variants were ranked according to the ratio of variant RPM in the sample to variant RPM in the matched sequenced virus library sample to account for unequal distribution of variants in the injected virus library. The highest scoring amino acid variants from the first round of selection in each animal model (10,000 from mice and 20,000 from cynomolgus macaques) were chosen for the second round selection. For each such amino acid variant, a sequence encoding the same peptide by synonymous DNA codons was included in the design of the second-round library to control for DNA sequence-specific effects. For variants with multiple synonymous sequences already observed in experimental samples, the highest scoring synonymous variant was included. For other variants, an artificial sequence was generated by randomizing each codon in the original sequence to a synonymous codon where possible. 5% of variants in the second round library encoded stop codons and were artificially added to the library to control for cross-packaging events during virus library production. Following the second round of selection, DNA sequence variants were ranked as described above, and amino acid variants were ranked according to the sum of the ranks of the two corresponding synonymous sequences. Variants identified in the selection with the fixed PAL motif were ranked as in the first round of selection.

For the pooled rAAV characterization experiment, a ratio was calculated for each barcode of the sample RPM to the RPM of that barcode in the matched sequenced virus library. For each capsid variant, the 10 strongest and 5 weakest barcodes across all samples in the sequencing run were identified according to this metric and removed as outliers from downstream analysis. The remaining 35 midrange barcodes for each variant were then used to determine average transgene expression in each sample as described above.

Pairwise dissimilarity scores between the top 1000 CNS-tropic capsid variants (corrected for synonymous DNA codon sequences) were calculated by adding the single-residue substitution score at each of the seven positions according to the BLOSUM62 substitution matrix. A matrix of dissimilarity scores was converted into a distance matrix by computing the distance metric d(s, t) between any two peptide sequences s and t by analogy with the scalar product as follows:

d ( s , t ) = s s + t t - 2 s t
    • where s|t is the dissimilarity score between s and t.46 This distance matrix was then used for k-medoids clustering with the scikit-learn package. The number of clusters k was chosen by maximizing the silhouette score.

Computational modeling of the VR-VIII loop of MDV1A, MDV1B, PAL1A, and PAL-like.1 was performed on the ProMod3-powered SWISS-MODEL server.47,48 AAV9 was used as a template for homology modeling (PDB: 3UX1)32 and all structures were visualized in PyMOL.

For transgene expression quantification, RNA was extracted from mouse and macaque tissues with TRIzol (Thermo Fisher) and treated with Turbo DNase (Thermo Fisher). cDNA was synthesized with SuperScript IV reverse transcriptase (Thermo Fisher) with an oligo-dT primer. For transgene delivery (vector genome quantification) experiments, DNA was extracted from mouse and macaque tissues with QuickExtract DNA extract solution (Lucigen) following pulverization of snap-frozen tissue with a Geno/Grinder 2010 (SPEX SamplePrep, Metuchen, NJ). Transgene mRNA and DNA were measured by qPCR using Taqman assays specific to the transgene (EGFP or HA- or FLAG-tagged hFN) mRNA or DNA or a housekeeping control (GAPDH). All measurements were quantified based on a standard curve generated by amplifying a gblock containing the target sequence of each Taqman assay, and absolute quantities of transgene mRNA and DNA were then normalized to the housekeeping gene.

Whole brains harvested from mice were fixed in 4% PFA for 1 h at room temperature, washed with DPBS, and cryoprotected in 30% sucrose at 4° C. overnight. Tissues harvested from the macaque injected with PAL2- and AAV9-CBh-hFXN were fixed in 4% PFA overnight at 4° C. and washed 3 times with DPBS. Fixed macaque tissues were cryoprotected in 15% sucrose at 4° C. overnight and then 30% sucrose at 4° C. for up to 3 days. Cryoprotected tissues were then embedded in O.C.T. compound (Sakura Finetek USA, Torrance, CA) and snap frozen in liquid nitrogen-chilled isopentane. Frozen tissue blocks were sectioned at a thickness of 12 μm on a CM1860 cryostat (Leica Biosystems, Wetzlar, Germany) and mounted onto Superfrost Plus slides (VWR, Radnor, PA). Whole 5 mm coronal slabs of fixed macaque brain hemispheres were embedded in 4% low melting point agarose (Sigma-Aldrich, St. Louis, MO) and 40 μm free-floating sections were collected in DPBS using a VT1000S vibrating blade microtome (Leica Biosystems).

IHCs were performed with an HRP micropolymer kit (ab236466, Abcam, Cambridge, UK) according to the manufacturer's instructions except where noted below. All primary antibody incubations on cryosections were performed at 4° C. overnight in blocking buffer containing 5% normal goat serum, 2% bovine serum albumin, 2% M.O.M. protein concentrate (Vector Labs, Burlingame, CA), and 0.1% Tween-20. Mouse brain cryosections were stained with a 1:1000 diluted rabbit anti-GFP primary antibody (A11122, Thermo Fisher). Following primary antibody incubation, sections were washed three times with PBS and incubated with HRP conjugate at RT for 30 minutes.

To visualize cells in the macaque brain expressing HA- or FLAG-tagged hFXN transgene, IHC was performed on 40 μm free-floating sections. Sections were blocked at room temperature for 1 hour in blocking buffer containing 5% normal goat serum with 0.2% Triton X-100 and then incubated with 1:7500 rabbit anti-HA antibody (ab9110, Abcam) in the same blocking buffer overnight at 4° C. with agitation. Following primary antibody incubation, sections were washed and incubated with 25% v/v HRP conjugate (ab236466, Abcam) diluted in PBS overnight at 4° C. with agitation.

For all IHC experiments, sections were washed with PBS following incubation with HRP conjugate and the signal was visualized with 3,3′-diaminobenzidine (Abcam) prepared according to the manufacturer's instructions for 3 minutes at RT. The free-floating macaque brain sections were then mounted onto HISTOBOND+ slides (Marienfeld, Lauda-Königshofen, Germany). Mouse sections were mounted with VectaMount AQ (Vector Labs). Macaque sections were dehydrated in a graded ethanol series, cleared with three changes of CitriSolv (Decon Laboratories, King of Prussia, PA), and mounted with VectaMount (Vector Labs). Sections were imaged on an EVOS M7000 all-in-one microscope using a 4× objective lens.

Cryosections of macaque brain, spinal cord, and neuroretina tissue were permeabilized for 10 minutes in 5% normal goat serum with 0.2% Triton X-100 before being blocked at room temperature for 1 hour in blocking buffer containing 5% normal goat serum, 2% bovine serum albumin, 2% M.O.M. protein concentrate (Vector Labs), and 0.1% Tween-20. Primary antibody incubations were performed overnight at 4° C. in blocking buffer; retina sections were labeled with 1:500 rabbit anti-HA (MA5-27915, Thermo Fisher) and 1:1000 mouse anti-rhodopsin (MA1-722, Thermo Fisher) antibodies, brain and spinal cord sections were labeled with 1:250 rabbit anti-HA (MA5-27915, Thermo Fisher) and 1:1000 mouse anti-NeuN (MA5-33103, Thermo Fisher) antibodies. Sections were washed three times with PBS before being incubated at room temperature for 30 minutes in blocking buffer with 1:500 goat anti-rabbit Alexa Fluor 488 (A11034, Thermo Fisher) and 1:500 goat anti-mouse Alexa Fluor 594 (A32742, Thermo Fisher) secondary antibodies. Brain and spinal cord sections were mounted with VECTASHIELD antifade mounting media (Vector Labs). Retina sections were treated with the TrueVIEW autofluorescence quenching kit (Vector Labs) according to the manufacturer's instructions and mounted with VECTASHIELD Vibrance antifade mounting media (Vector Labs). Sections were imaged on an EVOS M7000 all-in-one microscope using a 20X objective lens. Linear contrast adjustments were applied to images.

Macaque spinal cord and DRG sections were stained with hematoxylin and eosin (ab245880, Abcam) according to the usual method. A board-certified neuropathologist who was blinded to the experimental design reviewed anonymized slides and assigned a severity score between 0 (within normal limits) and five as previously described.38 Severity scores were established for the spinal cord and DRG on sections from three segments each from the cervical, thoracic, and lumbar regions.

All statistical analyses were performed in GraphPad Prism v8 (GraphPad Software, San Diego, CA). All data are presented as mean±SD where applicable. Datasets were tested for normality using the Shapiro-Wilk test at a significance level of 0.01. All datasets were tested for outliers using the ROUT method and Q=0.5%. Outliers were identified and removed from AAV9-injected female C57BL6J spinal cord RNA and DNA (one outlier each, FIG. 15A-15F) and C57BL/6J brain and liver RNA (one outlier each where the entire animal was removed from analysis, FIG. 17A-17E). For comparisons between AAV9 and MDV1A (FIG. 15D-15E), differences were tested for significance with Welch's t-test with Holm-Šidák correction for multiple comparisons. For comparisons between AAV9 and multiple other variants (FIGS. 17A-17E and 20A-20B), differences were tested for significance with a one-way ANOVA assuming equal variance and Dunnett's multiple comparison test using AAV9 as a control mean. All statistical tests were performed on raw data without normalization to AAV9, though mRNA expression data are presented normalized to the mean of AAV9 expression for greater interpretability.

  • 1. Deverman, B. E., Ravina, B. M., Bankiewicz, K. S., Paul, S. M., and Sah, D. W. Y. (2018). Gene therapy for neurological disorders: progress and prospects. Nat Rev Drug Discov 17, 641-659.
  • 2. Ojala, D. S., Amara, D. P., and Schaffer, D. V. (2015). Adeno-Associated Virus Vectors and Neurological Gene Therapy. Neuroscientist 21, 84-98.
  • 3. Wu, Z., Asokan, A., and Samulski, R. J. (2006). Adeno-associated Virus Serotypes: Vector Toolkit for Human Gene Therapy. Molecular Therapy 14, 316-327.
  • 4. Naso, M. F., Tomkowicz, B., Perry, W. L., and Strohl, W. R. (2017). Adeno-Associated Virus (AAV) as a Vector for Gene Therapy. BioDrugs 31, 317-334.
  • 5. Hocquemiller, M., Giersch, L., Audrain, M., Parker, S., and Cartier, N. (2016). Adeno-Associated Virus-Based Gene Therapy for CNS Diseases. Hum Gene Ther 27, 478-496.
  • 6. Afione, S. A., Conrad, C. K., Kearns, W. G., Chunduru, S., Adams, R., Reynolds, T. C., Guggino, W. B., Cutting, G. R., Carter, B. J., and Flotte, T. R. (1996). In vivo model of adeno-associated virus vector persistence and rescue. J Virol 70, 3235-3241.
  • 7. Mendell, J. R., Al-Zaidy, S., Shell, R., Arnold, W. D., Rodino-Klapac, L. R., Prior, T. W., Lowes, L., Alfano, L., Berry, K., Church, K., et al. (2017). Single-Dose Gene-Replacement Therapy for Spinal Muscular Atrophy. N Engl J Med 377, 1713-1722.
  • 8. Foust, K. D., Wang, X., McGovern, V. L., Braun, L., Bevan, A. K., Haidet, A. M., Le, T. T., Morales, P. R., Rich, M. M., Burghes, A. H. M., et al. (2010). Rescue of the spinal muscular atrophy phenotype in a mouse model by early postnatal delivery of SMN. Nat Biotechnol 28, 271-274.
  • 9. Leone, P., Shera, D., McPhee, S. W. J., Francis, J. S., Kolodny, E. H., Bilaniuk, L. T., Wang, D.-J., Assadi, M., Goldfarb, O., Goldman, H. W., et al. (2012). Long-Term Follow-Up After Gene Therapy for Canavan Disease. Science Translational Medicine.
  • 10. Hwu, W.-L., Muramatsu, S., Tseng, S.-H., Tzen, K.-Y., Lee, N.-C., Chien, Y.-H., Snyder, R. O., Byrne, B. J., Tai, C.-H., and Wu, R.-M. (2012). Gene Therapy for Aromatic 1-Amino Acid Decarboxylase Deficiency. Science Translational Medicine.
  • 11. Saraiva, J., Nobre, R. J., and Pereira de Almeida, L. (2016). Gene therapy for the CNS using AAVs: The impact of systemic delivery by AAV9. Journal of Controlled Release 241, 94-109.
  • 12. Zincarelli, C., Soltys, S., Rengo, G., and Rabinowitz, J. E. (2008). Analysis of AAV Serotypes 1-9 Mediated Gene Expression and Tropism in Mice After Systemic Injection. Molecular Therapy 16, 1073-1080.
  • 13. Gray, S. J., Woodard, K. T., and Samulski, R. J. (2010). Viral vectors and delivery strategies for CNS gene therapy. Ther Deliv 1, 517-534.
  • 14. Foust, K. D., Nurre, E., Montgomery, C. L., Hernandez, A., Chan, C. M., and Kaspar, B. K. (2009). Intravascular AAV9 preferentially targets neonatal-neurons and adult-astrocytes in CNS. Nat Biotechnol 27, 59-65.
  • 15. Huang, L., Wan, J., Wu, Y., Tian, Y., Yao, Y., Yao, S., Ji, X., Wang, S., Su, Z., and Xu, H. (2021). Challenges in adeno-associated virus-based treatment of central nervous system diseases through systemic injection. Life Sciences 270, 119142.
  • 16. Liu, D., Zhu, M., Zhang, Y., and Diao, Y. (2021). Crossing the blood-brain barrier with AAV vectors. Metab Brain Dis 36, 45-52.
  • 17. Hinderer, C., Katz, N., Buza, E. L., Dyer, C., Goode, T., Bell, P., Richman, L. K., and Wilson, J. M. (2018). Severe Toxicity in Nonhuman Primates and Piglets Following High-Dose Intravenous Administration of an Adeno-Associated Virus Vector Expressing Human SMN. Hum Gene Ther 29, 285-298.
  • 18. Gao, G., Lu, Y., Calcedo, R., Grant, R. L., Bell, P., Wang, L., Figueredo, J., Lock, M., and Wilson, J. M. (2006). Biology of AAV Serotype Vectors in Liver-Directed Gene Transfer to Nonhuman Primates. Molecular Therapy 13, 77-87.
  • 19. Morales, L., Gambhir, Y., Bennett, J., and Stedman, H. H. (2020). Broader Implications of Progressive Liver Dysfunction and Lethal Sepsis in Two Boys following Systemic High-Dose AAV. Molecular Therapy 28, 1753-1755.
  • 20. Palazzi, X., Pardo, I., Sirivelu, M., Newman, L., Kumpf, S., Qian, J., Franks, T., Lopes, S., Liu, J., Monarski, L., et al. (2021). Biodistribution and Tolerability of AAV-PHP.B-CBh-SMN1 in Wistar Han Rats and Cynomolgus Macaques Reveal Different Toxicologic Profiles. Hum Gene Ther.
  • 21. Deverman, B. E., Pravdo, P. L., Simpson, B. P., Kumar, S. R., Chan, K. Y., Banerjee, A., Wu, W.-L., Yang, B., Huber, N., Pasca, S. P., et al. (2016). Cre-dependent selection yields AAV variants for widespread gene transfer to the adult brain. Nat Biotechnol 34, 204-209.
  • 22. Chan, K. Y., Jang, M. J., Yoo, B. B., Greenbaum, A., Ravi, N., Wu, W.-L., Sanchez-Guardado, L., Lois, C., Mazmanian, S. K., Deverman, B. E., et al. (2017). Engineered AAVs for efficient noninvasive gene delivery to the central and peripheral nervous systems. Nat. Neurosci. 20, 1172-1179.
  • 23. Kumar, S. R., Miles, T. F., Chen, X., Brown, D., Dobreva, T., Huang, Q., Ding, X., Luo, Y., Einarsson, P. H., Greenbaum, A., et al. (2020). Multiplexed Cre-dependent selection yields systemic AAVs for targeting distinct brain cell types. Nat Methods 17, 541-550.
  • 24. Nonnenmacher, M., Wang, W., Child, M. A., Ren, X.-Q., Huang, C., Ren, A. Z., Tocci, J., Chen, Q., Bittner, K., Tyson, K., et al. (2021). Rapid evolution of blood-brain-barrier-penetrating AAV capsids by RNA-driven biopanning. Molecular Therapy—Methods & Clinical Development 20, 366-378.
  • 25. Goertsen, D., Flytzanis, N. C., Goeden, N., Chuapoco, M. R., Cummins, A., Chen, Y., Fan, Y., Zhang, Q., Sharma, J., Duan, Y., et al. (2022). AAV capsid variants with brain-wide transgene expression and decreased liver targeting after intravenous delivery in mouse and marmoset. Nat Neurosci 25, 106-115.
  • 26. Hordeaux, J., Wang, Q., Katz, N., Buza, E. L., Bell, P., and Wilson, J. M. (2018). The Neurotropic Properties of AAV-PHP.B Are Limited to C57BIJ6J Mice. Molecular Therapy 26, 664-668.
  • 27. Huang, Q., Chan, K. Y., Tobey, I. G., Chan, Y. A., Poterba, T., Boutros, C. L., Balazs, A. B., Daneman, R., Bloom, J. M., Seed, C., et al. (2019). Delivering genes across the blood-brain barrier: LY6A, a novel cellular receptor for AAV-PHP.B capsids. PLOS ONE 14, e0225206.
  • 28. Mathiesen, S. N., Lock, J. L., Schoderboeck, L., Abraham, W. C., and Hughes, S. M. (2020). CNS Transduction Benefits of AAV-PHP.eB over AAV9 Are Dependent on Administration Route and Mouse Strain. Molecular Therapy—Methods & Clinical Development 19, 447-458.
  • 29. Matsuzaki, Y., Konno, A., Mochizuki, R., Shinohara, Y., Nitta, K., Okada, Y., and Hirai, H. (2018). Intravenous administration of the adeno-associated virus-PHP.B capsid fails to upregulate transduction efficiency in the marmoset brain. Neuroscience Letters 665, 182-188.
  • 30. Tabebordbar, M., Lagerborg, K. A., Stanton, A., King, E. M., Ye, S., Tellez, L., Krunnfusz, A., Tavakoli, S., Widrick, J. J., Messemer, K. A., et al. (2021). Directed evolution of a family of AAV capsid variants enabling potent muscle-directed gene delivery across species. Cell 184, 4919-4938.e22.
  • 31. B6mer, K., Kienle, E., Huang, L.-Y., Weinmann, J., Sacher, A., Bayer, P., Stillein, C., Fakhiri, J., Zimmermann, L., Westhaus, A., et al. (2020). Pre-arrayed Pan-AAV Peptide Display Libraries for Rapid Single-Round Screening. Molecular Therapy 28, 1016-1032.
  • 32. DiMattia, M. A., Nam, H.-J., Vliet, K. V., Mitchell, M., Bennett, A., Gurda, B. L., McKenna, R., Olson, N. H., Sinkovits, R. S., Potter, M., et al. (2012). Structural Insight into the Unique Properties of Adeno-Associated Virus Serotype 9. Journal of Virology.
  • 33. Samaranch, L., Salegio, E. A., San Sebastian, W., Kells, A. P., Foust, K. D., Bringas, J. R., Lamarre, C., Forsayeth, J., Kaspar, B. K., and Bankiewicz, K. S. (2012). Adeno-Associated Virus Serotype 9 Transduction in the Central Nervous System of Nonhuman Primates. Human Gene Therapy 23, 382-389.
  • 34. Aguirre, G. D. (2017). Concepts and Strategies in Retinal Gene Therapy. Investigative Ophthalmology & Visual Science 58, 5399-5411.
  • 35. Hordeaux, J., Hinderer, C., Goode, T., Buza, E. L., Bell, P., Calcedo, R., Richman, L. K., and Wilson, J. M. (2018). Toxicology Study of Intra-Cistema Magna Adeno-Associated Virus 9 Expressing Iduronate-2-Sulfatase in Rhesus Macaques. Mol Ther Methods Clin Dev 10, 68-78.
  • 36. Hordeaux, J., Buza, E. L., Dyer, C., Goode, T., Mitchell, T. W., Richman, L., Denton, N., Hinderer, C., Katz, N., Schmid, R., et al. (2020). Adeno-Associated Virus-Induced Dorsal Root Ganglion Pathology. Human Gene Therapy 31, 808-818.
  • 37. Perez, B. A., Shutterly, A., Chan, Y. K., Byrne, B. J., and Corti, M. (2020). Management of Neuroinflammatory Responses to AAV-Mediated Gene Therapies for Neurodegenerative Diseases. Brain Sciences 10, 119.
  • 38. Hordeaux, J., Buza, E. L., Jeffrey, B., Song, C., Jahan, T., Yuan, Y., Zhu, Y., Bell, P., Li, M., Chichester, J. A., et al. (2020). MicroRNA-mediated inhibition of transgene expression reduces dorsal root ganglion toxicity by AAV vectors in primates. Science Translational Medicine 12, eaba9188.
  • 39. Novartis announces AVXS-101 intrathecal study update Novartis. https://www.novartis.com/news/media-releases/novartis-announces-avxs-101-intrathecal-study-update.
  • 40. Geisler, A., Jungmann, A., Kurreck, J., Poller, W., Katus, H. A., Vetter, R., Fechner, H., and Müller, O. J. (2011). microRNA122-regulated transgene expression increases specificity of cardiac gene transfer upon intravenous delivery of AAV9 vectors. Gene Ther 18, 199-209.
  • 41. Qiao, C., Yuan, Z., Li, J., He, B., Zheng, H., Mayer, C., Li, J., and Xiao, X. (2011). Liver-specific microRNA-122 target sequences incorporated in AAV vectors efficiently inhibits transgene expression in the liver. Gene Ther 18, 403-410.
  • 42. Xie, J., Xie, Q., Zhang, H., Ameres, S. L., Hung, J.-H., Su, Q., He, R., Mu, X., Seher Ahmed, S., Park, S., et al. (2011). MicroRNA-regulated, Systemically Delivered rAAV9: A Step Closer to CNS-restricted Transgene Expression. Molecular Therapy 19, 526-535.
  • 43. Geisler, A., and Fechner, H. (2016). MicroRNA-regulated viral vectors for gene therapy. World J Exp Med 6, 37-54.
  • 44. Gray, S. J., Foti, S. B., Schwartz, J. W., Bachaboina, L., Taylor-Blake, B., Coleman, J., Ehlers, M. D., Zylka, M. J., McCown, T. J., and Samulski, R. J. (2011). Optimizing Promoters for Recombinant Adeno-Associated Virus-Mediated Gene Expression in the Peripheral and Central Nervous System Using Self-Complementary Vectors. Hum Gene Ther 22, 1143-1153.
  • 45. Challis, R. C., Ravindra Kumar, S., Chan, K. Y., Challis, C., Beadle, K., Jang, M. J., Kim, H. M., Rajendran, P. S., Tompkins, J. D., Shivkumar, K., et al. (2019). Systemic AAV vectors for widespread and targeted gene delivery in rodents. Nat Protoc 14, 379-414.
  • 46. Fischer, I. (2002). Similarity-preserving Metrics for Amino-acid Sequences. In.
  • 47. Studer, G., Tauriello, G., Bienert, S., Biasini, M., Johner, N., and Schwede, T. (2021). ProMod3—A versatile homology modelling toolbox. PLOS Computational Biology 17, e1008667.
  • 48. Waterhouse, A., Bertoni, M., Bienert, S., Studer, G., Tauriello, G., Gumienny, R.,
  • Heer, F. T., de Beer, T. A. P., Rempfer, C., Bordoli, L., et al. (2018). SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research 46, W296-W303.

Various modifications and variations of the described methods, pharmaceutical compositions, and kits of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific embodiments, it will be understood that it is capable of further modifications and that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the art are intended to be within the scope of the invention. This application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure come within known customary practice within the art to which the invention pertains and may be applied to the essential features herein before set forth.

ENGINEERED CENTRAL NERVOUS SYSTEM COMPOSITIONS (2025)
Top Articles
Latest Posts
Recommended Articles
Article information

Author: Gov. Deandrea McKenzie

Last Updated:

Views: 6116

Rating: 4.6 / 5 (46 voted)

Reviews: 85% of readers found this page helpful

Author information

Name: Gov. Deandrea McKenzie

Birthday: 2001-01-17

Address: Suite 769 2454 Marsha Coves, Debbieton, MS 95002

Phone: +813077629322

Job: Real-Estate Executive

Hobby: Archery, Metal detecting, Kitesurfing, Genealogy, Kitesurfing, Calligraphy, Roller skating

Introduction: My name is Gov. Deandrea McKenzie, I am a spotless, clean, glamorous, sparkling, adventurous, nice, brainy person who loves writing and wants to share my knowledge and understanding with you.