![]() Polypeptide material with flexible pore properties
专利摘要:
The invention is the flexible pore polypeptide material resulting from the two- or three-dimensional assembly of fusion proteins from at least two protein domains, wherein at least one domain is a double helix and at least one is the protein oligomerization domain. The invention relates to the polypeptide material which is e.g. in chemical catalysis and in the separation of molecules based on their properties. 公开号:AT511130A2 申请号:T94962009 申请日:2009-10-12 公开日:2012-09-15 发明作者: 申请人:Kemijski Inst; IPC主号:
专利说明:
1 (38588) HEL Polypeptide material with flexible pore properties Subject of the invention The subject of this invention is the bionanomaterial, a novel material resulting from the two- or three-dimensional assembly of fusion proteins from at least two protein domains, wherein at least one domain represents a double-walled segment and at least one P rotein oligomerization domain. The subject of the invention is also the fusion protein from which polypeptide material and DNA encoding the protein are prepared. The subject of the invention is also the use of polypeptide material for the separation of molecules according to their properties and for chemical or enzymatic catalysis. State of the art The membrane-based filtration systems are used for the preparation of drinking water, but increasingly also in the food industry, e.g. for the elimination of impurities from the water, in the pharmaceutical industry for the removal of infectious microorganisms and unwanted components, for the concentration of selected components as well as for the membrane chemical reactors and bioreactors. For the preparation of ultrafiltration systems for the separation of macromolecules organic polymers are generally used, the pore size is defined by physical methods for the creation of pores of arbitrary dimensions, for example, transverse bovine ···························· · Ff "2 fertilization of fibers or damping of solvents. The membranes, which consist of dense protein layers, were prepared from the bacterial S-layer (S-layer). Peng et al. (Peng et al., 2009, Nature Nanotechnology, 4, 353-357) describes the ultrafiltration membranes prepared from the protein ferritin. While the filtration success changes depending on the pH, which determines the charge of the analyte and the membrane, the pore size remains constant and is determined by the density of the protein molecules. A pore surrounded by three 12 nm diameter ferritin molecules with a diameter of 2.2 nm, which limits the usefulness of this system. The preparation of the membrane based on the stacking of protein molecules limits the range of possible pore dimensions as well as their geometry and chemical properties. U.S. U.S. Patent No. 6,756,039 B1 reports regular structures obtained by self-assembly of the fusion proteins consisting of rigidly linked monomers having the ability to self-assemble into oligomers. The patent presents the production of the discrete cage and the one-dimensional fiber and proposes a three-dimensional network of fusion proteins in which each fusion protein consists of two monomers which are combined to dimer and trimer. The choice of the corresponding monomers is relatively complex because the monomers are linked by the helix with orientation from the C-terminus of the first to the N-terminus of the second monomer. Such a rigid compound defines the orientation of the monomers in the fusion protein, and thus defines and limits also the structures that such a fusion protein can form as well as its use. WO / 2004/033487 and U.S. Pat. Patent Application 2008/0097080 A1 presents the protein network and structures consisting of fusion proteins, called protomers. These consist of monomers that combine to form oligomers with rational symmetry axes of the N-rank. The lack of potential use of the network described in patent WO / 2004/033487, as a carrier for protein crystallization and electro-microscopy, is the inability to change pore properties with only small component manipulations. Neither 6,756,039 B1 nor WO / 2004/033487 or U.S. Pat. Patent Application 2008/0097080 A1 describe the method for the preparation of the proteinic filtration membranes. The invention presented here describes the self-assembling proteins as building blocks, the changes of which influence the pore properties, whereby the materials with different technological properties can be prepared. One of the domains that make up the building blocks is explicitly defined as a double helix. It can be chosen according to the length, because this directly affects the pore size, or depending on the side groups of the amino acid residues on the surface exposed positions, if they do not prevent the formation of the double helix. These amino acid residues, mostly at positions b, c and f, determine the chemical properties of the pore and give the material new technological properties. Summary of the invention The invention relates to the polypeptide material having flexible pore properties resulting from the two- or three-dimensional assembly of fusion proteins from at least two protein domains, wherein at least one domain forms a double helix and at least one protein oligomerization domain having at least the oligomerization stage 3. The Oligomerisationsstufe is according to the invention between 3 and 12, usually between 3 and 6. The invention relates to the polypeptide material in which the pore shape and size are determined by the mode of connection of the protein domains of said fusion protein and the pore size also by the length of the double coil forming segment. The invention relates to the polypeptide material in which the length of the double helix-forming segment comprises between 2 to 100 heptads (14 to 700 amino acid residues). The invention relates to the polypeptide material whose technological properties are defined inter alia by the introduction of positively charged, negatively charged, hydrophobic, hydrophilic, cysteine, histidine and other amino acid residues which allow specific interactions with the surface exposed segments, while the segments required for the formation of the coiled coil are obtained. The amino acid residues that determine the pore technological properties are most often introduced to positions b, c, and f of the coiled-coil forming segment. The invention also relates to the polypeptide material in which the chemical properties of the pores are also defined by the properties of the protein domains of said fusion protein. These properties include, but are not limited to, the net charge, surface-exposed hydrophobic amino acid residues, and the size of the protein domains. The invention relates to the polypeptide material with flexible pore properties, which results from the two- or three-dimensional assembly of fusion proteins from at least two protein domains, wherein at least one domain forms a double helix and at least one protein oligomerization domain and the domains with each other in any order however, may be linked by the flexible linker of 1 to 20 amino acid residues, preferably 1 to 6, but the fusion protein may also contain signal sequence for directing protein rejection outside the cell and polypeptide tagging. The invention relates to the flexible pore polypeptide material in which the double helix-forming segment is based on the sequence SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 or the intended peptides with functionally similar properties. The invention relates to the polypeptide material with flexible pore properties, in which the protein oligomerization domain is preferably selected from the sequences SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 10. 84 and the sequences having greater than 50% homology with these sequences which have the ability to form oligomers of the same type. The invention relates to the polypeptide material with flexible pore properties, which is characterized by the composition of the sequences SEQ ID Nos. 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 selected fusion proteins. The invention relates to the polypeptide material with flexible pore properties, prepared with the mixing of two fusion proteins according to claims 1 to 10, wherein connect the segments of the two fusion proteins, which form the double helix, into a parallel double helix. The oligomerization domain of one fusion protein is located at the N-terminus and the other at the C-terminus, but the oligomerization domains have the same level of oligomerization. The invention relates to the polypeptide material with flexible pore properties, for the preparation of which the coiled-coil-forming segments are selected from natural or the planned parallel coiled coils or from the following pairs: SEQ ID NOs: 16, 18, 20, 22, 24, 26 , 28, 30, 32. The invention relates to the polypeptide material having flexible pore properties, for the preparation of which the protein oligomerization domains are chosen from among the tetrameric proteins, preferably among the sequences SEQ ID No. 10 and SEQ ID No. 12 or among the trimeric proteins, preferably below SEQ ID NO: 8 and SEQ ID NO: 84. The invention relates to the DNA having the information for the polypeptide material having flexible pore characteristics, which is operably linked to regulatory elements, the promoter and the terminator, which regulate the expression of the fusion proteins in the cell of the host. The invention relates to the polypeptide material having flexible pore properties for the separation and concentration of molecules, molecule complexes, viruses or nanoparticles according to their properties. The invention relates to the polypeptide material having flexible pore properties for chemical catalysis. Description of the pictures Figure 1: A) The scheme of the antiparallel double helix dimer with the arrangement of the amino acid residues in the peptidic heptad motif. The amino acid residues at positions b, c and f are exposed at the surface and can thus be modified, whereby the pore properties can also be changed. B) The Scheme of the Polypeptide Material Consisting of the Assembly of Fusion Proteins from a Tetramerization Domain and a Domain Forming an Anti-parallel Double Coil Dimer. C) The scheme of the material from point B), which shows that the change in the length of the coiled-coil forming segment and the introduction of the positive charge to the exposed sites in the coiled-coil dimer affect the physico-chemical and technological pore properties. Figure 2: A) Isolation of the fusion protein Dimtetra-A1. The expression of the fusion protein dimetra-A1 in the cell lysate (line 1) and the insoluble fraction {inclusion body) (line 2) was tested. Pure Dimtetra-A1 protein was obtained by flushing of inclusion bodies, dissolution in 6M GdnHCl, and Milli-Q water dialysis (line 3). On line 4 is the polypeptide standard. B) The long-wave UV CD spectrum of the slurry of polypeptide material from the protein Dirn- • * * * * * * ·· * • * * * * * * * t • • B | 4 * · · V ί * • tt · f · · · 9 · * 4 »» * · · «*» ·· · * 7 tetra-A1 has a secondary alpha helix structure, which is the corresponding twist of the Dimtetra-Al confirmed in the material. Figure 3: The filtration of M13 bacteriofagens (A) and the dye dextran blue (B) through the membrane prepared from the fusion protein Dimtetra-Al. After filtration of the bacteriophage they were not detected in the filtrate (n. N.). The dye concentration was determined by the absorbance measurement at 625 nm. Detailed description of the invention Before further description, it should be understood that the invention is not limited solely to the descriptions presented, but that the modifications of certain descriptions may be within the scope of the claims. Unless otherwise specified, all technical and scientific terms used herein have the same meaning as commonly known to those of ordinary skill in the art. The purpose of the terminology used in describing the invention is to illustrate a particular segment of the invention and not the limitation of the invention. All publications mentioned in the description are cited as references. The description of the invention and the claims are written in the single number, but they also include the majority, but this is not particularly emphasized in the description for the sake of simplicity. polypeptide material The basis of this invention is the discovery that by combining double helix-forming domain fusion proteins and the protein oligomerization domain, two- or three-dimensional polypeptide materials having flexible pore properties can be obtained. The usability of such materials is high, from the separation of molecules to chemical catalysis. • • • • • »t» · »ft · * · · · · ·« «9 * < · '* r | «T · · · 0 9 9 · Ψ * 4 9 # ♦» »» · * 8 The term 'polypeptide material with flexible pore properties' refers to the material having pores of particular shape, size and physicochemical properties. The invention relates to the polypeptide material prepared from fusion proteins consisting of at least two protein domains, wherein at least one of the domains forms the coiled coil and at least one domain is the proteinic oligomerization domain. The domains may occur singly or unified at the level of the proteins with the chemical reaction, but they may also be genetically encoded as a fusion protein. The domains can be derived from natural proteins or, according to known information about the properties of the coiled-coil proteins (US Pat. No. 7,045,537 B1), are designed to develop the desired properties. Double helix-forming segments At least one domain of said fusion protein that binds to polypeptide material is a double helix-forming segment. The term 'double helix-forming segment' refers to the motif of heptad repeats of the amino acid sequence wherein the amino acid residues of a heptad are labeled abcdef, residues a and d are predominantly hydrophobic and e and g are predominantly charged. Upon interaction of two or more such polypeptide chains (double helix-forming segments), these assume the helical shape, which wrap around each other and thus form the 'double helix'. In the single helix, the heptad motif is repeated approximately every two turns. When the polypeptide chains form a dimer in the form of a coiled coil, the amino acid residues at positions a and d at each heptad repeat represent a hydrophobic core of the coiled coil and the residues at positions e and g interact through electrostatic interaction (Figure 1A ). In such complexes, amino acid residues at positions b, c, and f no longer contribute to the intermolecular interactions in the resulting dimer. Thus, at positions b, c and f, amino acid residues with different properties may be present, e.g. charged, hydrophobic, cysteines, and others that facilitate chemical modification, metal chelation, specific interactions, etc., and over which the inventors have found that they can be used to modify chemical pore properties. The amino acid sequences for double helix can be planned or chosen from natural double helix-forming peptides or double helix-forming protein domains. Such domains are found in numerous such proteins, e.g. Transcription factors, oncoproteins, tropomyosin, viral proteins, etc. The basis of this invention is the discovery that the physico-chemical pore properties (which are limited by coiled coils coupled to oligomerization domains) can be defined and adjusted with the change in length of the coiled-coil forming segment, and by alteration and modification of the amino acid residues on the Positions b, c and f. These residues may be positively or negatively charged, hydrophobic or hydrophilic, as well as chelators for metal clays, those that enable chemical modifications (Asn, Gin, Cys) or specific interactions; the only limitation is that these changes can not prevent the formation of the double helix. The essence of this invention is the ability to define and modify the pore size with the help of the length of the coiled-coil forming segment. The term 'pore size' refers to the dimensions of the pore surrounded by the fusion proteins according to the invention. The length of the coiled coil forming segment may comprise from 2 to more than 150 heptads (14 to 1050 amino acid residues), most often 2 to 100 heptads (14 to 700 amino acid residues). The double helix according to the invention (which results from the merging of said segments) can consist of identical (homologous double helix) or different segments (heterologous helix). Identical coiled-coil segments are proteions with the same primary structure. Different double helix-forming segments are proteins of different primary structure that can combine to form coiled coils, with at least two different segments always joining in a double helix. If all 10 If the polypeptide chains (double helix-forming segments) in the coiled coil are in the same direction, the chain orientation is parallel, but if they run in opposite directions, it is antiparallel. The principles that govern the properties of the double helix are well known to those skilled in the art. Table 1 gives a few natural double helix-forming segments as well as some sequences designed by the inventors. Table 1: Double spiral forming segments Designation SEQ ID NO: (sequence number for) DNA, protein sequence Oligomerization step Double helix type Bcr 5, 6 2 Antiparallel homodimer APH 3,4 2 Antiparallel homodimer APH-1 1,2 2 Antiparallel homodimer GCN 19, 20 2 Parallel homodimer P1 15 , 16 2 parallel heterodimer with P2 P2 17, 18 2 parallel heterodimer with P1 P3 21,22 2 parallel heterodimer with P4 P4 23,24 2 parallel heterodimer with P3 P5 25,26 2 parallel heterodimer with P6 P6 27,28 2 parallel heterodimer with P5 P7 29, 30 2 parallel heterodimer with P8 P8 31,32 2 parallel heterodimer with P7 Protein oligomerization The basis of this invention is also that at least one fusion protein domain is the oligomerization domain. The term 'oligomerization domain' usually, but not exclusively, refers to the natural proteins or their parts which tend to bind to the same or different proteins or their parts. Thus, they form homologous or heterologous dimers or 11 Oligomers with a higher degree of oligomerization. The homodimer {homologous dimer) forms two identical protein domains and the heterodimer (heterologous dimer) forms two distinct protein domains. The term 'identical protein domains' means that the proteins have the same primary structure and 'different protein domains' are proteins of different primary structure. The structures with a higher degree of oligomerization consist of at least three protein domains, which may have the same {homologous oligomers) or different (heterologous oligomers) primary structure. The structural information about the chosen natural protein oligomerization domains is well known to those of skill in the art. The number of amino acid residues containing the domains according to the invention is usually 5 to 400, more often 15 to 200. The term 'oligomerization stage' refers to the number of protein domains that make up the oligomer and may be 3 to 12 , usually 3 to 6. Table 2 lists some examples of protein oligomerization domains. The basis of this invention is that also the properties of oligomerization domains, for example the isoelectric point and domain size, determine the pore properties and that the pore properties are also influenced by the oligomerization stage of the oligomer. Table 2: Protein Oligomerization Domains Designation DNA / protein sequence SEQ ID NO. Oligomerization step CutA1 83, 84 3 p53 Tetramerization domain 9, 10 4 Shaker tetramerization domain {Tshak) 11, 12 4 Verotoxin pentamerization domain (Vero5) 13, 14 5 left In contrast to the fusion proteins in patent U.S. Pat. Patent No. No. 6,756,039, in which the protein domains are interconnected with the rigid linker (such as alfa-helix), the linker linking the protein domains in the fusion protein of this invention is flexible. 12 • * The term 'linker' refers to a shorter amino acid sequence whose sole purpose is to separate the individual domains or protein segments. The role of the linker peptide in the fusion protein could also be to interrupt the regular secondary structures, impart flexibility, and post-translate modifications and better processing. The length of the linker is not limited, it is determined by desired technological material properties and most often it does not comprise more than 30 amino acid residues, but preferably the length is 1 to 20 amino acid residues, most often 1 to 6. The choice of amino acid residues however, small or hydrophilic amino acids or amino acids that introduce specific conformational or functional properties, such as serine, glycine, threonine, proline, valine, alanine, cysteine, and others, are most common. Fusion proteins and self-assembling material consisting of them The fusion proteins in the context of this invention consist of the described protein domains which are linked to the linker described above. The protein domains can occur in different order in the fusion protein, which, however, can influence the technological material properties. The number of protein domains (the segments that can form the coiled coil and the protein oligomerization domains) in the fusion protein can be up to 8, most often 2 to 4. The basis of this invention is that the selected protein domains of said fusion protein, under appropriate conditions, tend to combine to form two- or three-dimensional polypeptide material. The term 'two-dimensional material' refers to the planar structure of fusion proteins (the size of the composite material may be larger in two dimensions than in the third). The term 'three-dimensional material' refers to the material in which the fusion proteins are composed in three dimensions. In order for the two- or three-dimensional protein assembly to be possible at all, at least one protein domain (double helix-forming segment or the oligomerization domain) has to form oligomers with oligomerization stage 3 or more. The most important properties of the polypeptide material are: a) the pore shape and size are determined by the self-assembly of said domains of the fusion protein according to the invention b) the pore size is linked by the length of the segment that forms the coiled coil and in the fusion protein with the oligomerization domain , determined c) the pore properties are determined by physicochemical properties of the amino acid residues at the positions b, c in f in the double helix-forming segment d) the pore properties are also determined by the properties of said oligomerization domains in the fusion protein By self-assembling a fusion protein whose properties can be varied by choosing the protein domains determined by the design of the fusion protein according to the invention, different versions of the polypeptide material with flexible pore properties can be prepared. However, when fusion proteins whose domains are heterogeneously linked (such as homologous coiled coils) are used, more than one type of fusion protein must be used to prepare the polypeptide material. The number of fusion proteins used in this case is 1 to 10, usually 1 to 3. Pore Preparation from One or Two Different Fusion Proteins According to the invention, the polypeptide material may consist of a single or two or more different types of fusion protein, each fusion protein consisting of a double helix-forming segment and the oligomerization domain. The inventors discovered that in the case of preparation of the material of only one fusion protein type, the double helix-forming segment must be an antiparallel homodimer, but in the case of preparation of the material of several fusion protein types, the double helix-forming segment may have an antiparallel hete. 14 may be a rodimer or a parallel heterodimer. When said coiled-coil is a parallel heterodimer, the oligomerization domain must be in a fusion protein at the N-terminus and in the second fusion protein at the C-terminus of the protein. If one wishes to achieve the proper fusion of the fusion proteins to protein membranes, the oligomerization domains must have the same oligomerization stage, for example the trimer (the trimerization domain examples are CutA1 SEQ ID NO: 84 and Foldon, SEQ ID NO: 8) or the tetramer (the examples of the tetramerization domain are p53 SEQ ID NO: 10 and the tetramerization domain of the Shaker channel, SEQ ID NO: 12). If the double helix is more stable than the oligomerization domain, the oligomerization domains in both fusion proteins can be the same at the C- or N-terminus. Signal peptide, peptide labels The various amino acid sequences at the two termini or between the fusion protein segments that are not needed to form the polypeptide material may be added to the fusion protein for ease of purification and protein detection, as well as to introduce additional functional properties for the polypeptide material. The terms 'signal sequence' or 'signal peptide' refer to the amino acid sequence that directs the protein to the particular site in the cell. Signal sequences differ depending on the organism in which the fusion protein is expressed. Which signal sequence acts in which organism as well as the amino acid sequences of said signal sequences are well known to those skilled in this field. The term 'peptide labels' refers to the amino acid sequence added to the protein for easier purification / isolation / protein detection. The position of the signal sequence and peptide tags is arbitrary if it does not interfere with protein expression and retains the function for which the amino acid sequence was chosen, which is known to those of skill in the art. 15 t ff • « Fusion proteins with nucleic acid coding and host organisms The invention also relates to the fusion protein of the polypeptide material and the DNA. The term 'DNA / nucleic acid' refers to polynucleotide molecules such as DNA and RNA, including cDNA, genomic DNA, synthetic DNA, hymeric DNA and RNA. The nucleic acids can be single-stranded or double-stranded. They may contain nucleic analogues or derivatives. The host organism synthesizes the DNA-encoded fusion protein according to the invention by means of the heterologous DNA with the information for the fusion protein. The term " homologous protein " refers to proteins with preserved functional properties and the resulting amino acid sequences, which are usually obtained at least up to 50%, with the minimum of maintenance being 20%. The similarity (conservation state) of the sequences is determined by the method of providing the amino acid sequences known to those of skill in the art. The heterologous nucleic acid is generally involved in the expression vector. The expression vector generally contains the operably linked control elements operably linked to the DNA of the invention carrying the information of the fusion protein. Of course, the controls are chosen in a manner that corresponds to the utterance. The promoter may be constitutive or inducible depending on the desired utterance pattern. The promoter may be native or of foreign origin (not represented in the cells used) and may also be natural or synthetic. The promoter is chosen to function in the target cells of the host organism. Also included are initiation signals for efficient translation of the fusion protein, including ATG and the associated sequences. 16 • c * * • * The corresponding vectors include, but are not limited to, plasmids, virus vectors and others. The expression vector can be prepared for expression in prokaryotic and eukaryotic cells. For example; prokaryotic cells are bacteria, especially Escherichia coli. According to the invention, the use of prokaryotic cells serves to prepare the appropriate amount of nucleic acid and for protein production. The invention also includes the nucleic acid-containing host cells and organisms that are encoded for the fusion protein according to the invention. The term " host organism " refers to an organism into which the DNA with the protein code is introduced, with the intention of expressing itself. The corresponding host cells are well known and may be prokaryotic and eukaryotic. The vector transport into the host organisms proceeds by conventional methods, the methods being transformation and transfection, which include chemical transport, electroporation, microinjection, DNA lipofection, cell sonication, particle bombardment, transport the virus DNA and others. Transient expression refers to the delivery of vector DNA that is not included in the cell genome. Stable delivery is achieved by including the DNA in the host organism according to the invention. The DNA transfer, especially for the preparation of the host organism with integrated stable DNA, can be controlled by markers. The marker-encoding DNA refers to drug resistance, e.g. Antibiotics, and may be integrated in the vector according to the invention or in a separate vector. The host organism may be prokaryotic and eukaryotic. The eukaryotic cells for the expression of the fusion proteins must be designed so that the cell lines are compatible with the methods of propagation of the expression vector and expression of the fusion protein. Suitable, but not limiting, cell lines include fungi, yeasts, and herbal and mammalian Cells, such as Fibroblasts from mice, rats, monkeys and humans. For DNA expression, any bacterial host can be used according to the invention. However, according to the invention, the use of bacteria and yeasts is desirable for protein expression. The protein may be expressed in the bacteria E. coli or B. subtilis or in yeasts S. cerevisiae or P. pastoris. For protein expression, the most desirable bacterium is E. coli. The invention relates to protein expression in bacteria and yeasts. The invention relates to bacteria and yeasts expressing the protein, the most desirable being the bacterium E. coli and the yeast P. pastoris. Preparation and use of polypeptide material for the separation of the molecules according to their properties The property that the proteins according to the invention must have is the joining to polypeptide material under appropriate conditions. When the polypeptide material is formed from a fusion protein, appropriate conditions are sought by observing the solubility, the macromolecular structure, determining the secondary structure of the soluble portion, and especially the precipitates in buffers of various pH's (mostly pH 3 to pH 9) ), the ionic starch in the presence of various organic solvents (such as DMSO, acetonitrile, trifluoroethanol, hexafluoroisopropanol). When preparing the filter system for the separation of the molecules, it is important that the material combines with as few connection defects as possible. This can be accomplished by stepwise rewindering away from the conditions favoring the monomeric and soluble structures to the conditions that enable the formation of intermolecular interaction via the coiled-coil and proteinic oligomerization domains. Important factors in self-assembly are also the concentration of the fusion protein and the temperature. The polypeptide material suitable for filtration forms when the Protein concentration is between 0.1 _g / mL and 20 mg / mL, usually between 0.05 and 10 mg / mL. The self-assembly of the polypeptide material can be triggered by slowly cooling the fusion protein from the temperature above the melting point to the temperatures below the temperature threshold, usually to room temperature. The polypeptide material designed to result from the assembly of multiple fusion proteins will assemble upon fusion of the fusion proteins in the appropriate molar ratio and conditions that allow for assembly. Formation of filtration units When the polypeptide material is fully formed, it may be further crosslinked with compound reagents, such as glutaraldehyde. This polypeptide material can be used to separate molecules ranging in size from about 1 nm to about 100 nm, depending on pore size and properties. The potential applications include, for example, the filtration of pharmaceutical preparations, the removal of impurities from medical preparations, the removal of pathogens such as viruses, prions, etc., the purification of wastewater, the separation of nanoparticles by size, etc. An additional important property of the polypeptide material with pores with flexible properties is also that in addition to the pore size and other pore properties, eg For example, the charge, the specific reagents immobilized to pores, etc., allow the separation of molecules of similar size, but they differ in other characteristics such as polarity, charge, shape, specific surface properties, etc. Use of the polypeptide material with flexible pore properties for catalysis The import of certain amino acid residues, e.g. Histidine and cysteine, at positions b, c and f of the coiled coil, are made possible by the binding of the metals to the pores, which thereby act as catalysts. Enzymes which serve polypeptide material can also be bound to the polypeptide material having wider pores 19 thus as a tool for enzymatic catalysis. By introducing the active sites (e.g., the catalytic triads) of appropriate geometry into the pores of said material, artificial enzymes can also be formed. In the following, construction examples are given, the purpose of which is to illustrate the invention. The description of individual creation examples is not intended to limit the invention, but is presented as a demonstration of the operation of the invention. applications Example 1. Preparation of DNA Constructs The DNA sequences for the above-mentioned protein domains were prepared from amino acid sequences of the selected domains using Designer {DNA2.0. Inc.). This program allows the user to create the DNA fragments as well as the expression optimization for the chosen host (e.g., E. coli) by codon optimization. The genes were ordered from Mr. Gene GmbH (Gewerbepark B32, D-93059 Regensburg), they were excised with restriction endonucleases and cloned into the appropriate vector with all the necessary regulatory sequences known to those of skill in the art. Selected vectors include the commercially available vectors pET, pRSET A, and pSB1A2 (pSB1A2 http://partsregistry.Org/Part:pSB1A2), which possess all the required properties, for example, resistance to antibiotics, the site of doubling, the area of several places for cloning. For the preparation of DNA constructs, the methods of molecular biology were used (the cutting of DNA with restriction endonucleases, the propagation of DNA by chain reaction using polymerase PCR, PCR ligation, detection of DNA concentration, agarose gel electrophoresis, purification of the DNA fragments from the agarose gels, the ligation of the DNA fragments into the vector, the transformation of the chemically competent cells E. coli DH5 ..., the isolation of the plasmid DNA with commercial putty and selection). All procedures were performed at 20 sterile conditions. The DNA segments were characterized by restriction analysis and sequencing. The methods of molecular cloning are described in detail in the Handbook of Molecular Biology (Sambrook J., Fritsch E.F., Maniatis T. 1989. Molecular cloning: A laboratory manual, 2nd Ed., New York, Cold Spring Harbor Laboratory Press: 1659 p.). Example 2. The Production of Fusion Proteins Comprising the Antiparallel Homodimer Coil and the Tetramerization Domain To illustrate the feasibility of producing the fusion protein consisting of the antiparallel homodimer coplanet and the tetramerization domain, we prepared several DNA constructs (Table 3) ). Table 3: Fusion proteins consisting of the antiparallel homodimer double helix and the tetramerization domain p53 or the tetramerization domain of the Shaker channel. No. Designation Composition of Construct SEQ ID NO: 1 Dimetra-Al APH1 p53T 34 2 Dimetra-AIGSGS APH1 GSGS p53T 36 3 Dimetra-A APH p53T 38 4 Dimetra-B Bcr p53T 40 5 Tetradim-Al p53T APH1 42 6 Tetradim-A p53T APH 44 7 Tetradim-B p53T Bcr 46 8 A1-Tshak APH1 TShak 48 9 A-Tshak APH TShak 50 10 B-Ts hak Bcr TShak 52 11 Tshak-A1 TShak APH 1 54 12 Ts hak-A TShak APH 56 13 Tshak -B TShak Bcr 58 In addition to altering the orientation of protein domains in the fusion protein, the length of the linker can also be altered, from 2 to 20 amino acid residues, and the His6 peptide label can be located at the N-terminus or C-terminus of the protein. The plasmids encoding the open reading frames for fusion proteins of Table 3 were transformed by the inventors into chemically competent cells E. coli BL21 (DE3) pLysS. The selected bacterial colonies grown on LB plates with the addition of the antibiotic (ampicillin, kanamycin) were added to 100mL of broth LB supplemented with the antibiotic and incubated overnight at 37 ° C. The resulting culture was diluted 20-50 times the next day to reach the OD600 of the diluted culture between 0.1 and 0.2. When the bacterial culture grew to between OD600 and 0.6-0.8, the induction of protein expression by the inducer IPTG followed (0.4 mM to 1 mM). In some examples, the incubation temperature of the bacteria was reduced 0.5 hours before induction (25 aC-30 ° C). Two to five hours after induction, centrifugation of the broth was followed. The cells were then resuspended in the buffer for lysis (Tris pH 8.0, 0.1% deoxycholate with addition of protease inhibitors) and frozen at -80. The thawed cell slurry was lysed by sonication and spin. Using the SDS-PAGE techniques and, if needed, Western blot analysis using anti-His-tag antibodies as primary antibodies, the expression of the constructs in the precipitate or supernatant was tested. All constructs with p53T were generally contained in insoluble fractions (inclusion bodies) where the desired protein > Represented 80% of the total proteins. The inclusion bodies were rinsed several times with lysis buffer, twice with 2M urea and 10 mM Trisome pH 8.0 and once with Mili-Q water. Most of the time, such treatment resulted in greater than 95% purified protein. If the protein still contained impurities despite the treatment, the inclusion bodies were dissolved in 6 M GdnHCl pH 8.0 and applied to the Ni2 + NTA column (Qui-agen, GE). Purification under denaturation conditions was performed according to the manufacturer's instructions. After elution with 250 mM imidazole pH 5.8 22 • • • *, the fractions containing proteins were pooled and dialyzed twice with 10 mM HEPES pH 7.5 or another appropriate buffer. When the fusion protein was contained in the supernatant, it was applied by the inventors to Ni2 + -NTA column and purified under native conditions. After elution with 250 mM imidazole pH 5.8 or 500 mM imidazole pH 8.0, the fractions containing the desired protein were pooled and dialyzed twice with 10 mM HEPES pH 7.5 or another appropriate buffer. Example 3. The Production of Fusion Proteins Consisting of the Antiparallel Homodimer Double Coil and the Trimerization Domain To illustrate the feasibility of producing the fusion protein consisting of the antiparallel homodimer coplanet and the trimerization domain, we prepared several DNA constructs (Table 4) ). Table 4 .: Fusion proteins consisting of the antiparallel homodimer double helix and the trimerization domain No. Designation Composition of Construct SEQ ID: No.1 Foldon-Al Foldon APH1 60 2 Foldon-A Foldon APH 62 3 Foldon-B Foldon Bcr 64 4 Al-Foldon APH1 Foldon 66 5 A-Foldon APH Foldon 68 6 B -oldon Bcr Foldon 70 7 A1 -CutA1 APH1 CutA1 72 8 A-CutA1 APH CutAl 74 9 B-CutA1 BCR CutAl 76 10 CutA1 -A1 CutAl APH1 78 11 CutA1 -A CutAl APH 80 12 CutA1 -B CutAl Bcr 82 In addition to altering the orientation of protein domains in the fusion protein, the length of the linker can also be altered, from 2 to 20 amino acid residues, and the His6 peptide tag can be located at the N-terminus or C-terminus of the protein. The production and the isolation of the proteins were carried out as stated in Example 2. Example 4: The preparation of the self-assembling polypeptide material with rewinder of the protein based on dilution, dialysis or temperature annealing The initial experiments to determine the conditions in which the fusion proteins are helical in shape and self-assembling were compared with the protein dilution in the denaturant - 6 M GdnHCl - 1: 100 in buffers with different pH values (citrate buffer pH 2 and pH 3, acetate buffer pH 4, pH 5, phosphate buffer pH 5, pH 6, pH 7, Hepes buffer pH 7.5, Tris buffer pH 8, carbonate buffer pH 9 and pH 10), with different ionic strengths (100 mM , 300 mM, 1 Μ, 2M salt) and with up to 20% organic solvents, such as acetonitrile, DMSO, methanol, or in up to 50% in the case of trifluoroethanol. One could recognize the macroscopic protein compounds in the form of fine precipitate or the more desirable polypeptide aggregate - gel-like structures. To determine the solubility of the fusion protein, the proteinic absorption spectra were recorded. By means of CD spectroscopy, the secondary structures and the thermal stability of the soluble part were determined. When the conditions for self-assembly were determined, dialysis of larger amounts, e.g. of 1 mg of denatured protein to the chosen buffer, the slow removal of the denaturant causing self-assembly of the fusion protein. When the fusion protein contained APH-1, temperature was also an important factor because the homodimer APH-1 is more unstable at higher temperatures. This type of fusion protein was relatively soluble at room temperature. Its CD spectra and melting points corresponded to the measurements in the tetramerization domain p53T. In these examples, slow accumulation was achieved by slow temperature reduction from δΟ'Ό to 4 ° C in ten hours, at a protein concentration of 0.5 mg / ml. Example 5: Recapping of Fusion Proteins and Membrane Preparation The membranes of fusion proteins were prepared by various coil self-annealing methods with the initial dissolution of the fusion proteins in: a) 6M GdnHCl, b) 9M LiBr, c) hexafluoroisopropanol. The sample of isolated protein SEQ ID: 34 was dissolved in 6 M LiBr at a concentration of 0.1 mg / ml. At this concentration, the secondary structure of the protein was destroyed and it was soluble to more than 10 mg / ml. The assembly process of the proteins began with the dissolution of the protein in 9 M LiBr, followed by dilution into the buffer, preferably with native protein state, double helix formation and tetramerization of the p53 domain (buffer 20 mM HEPES at pH 7.5.) The protein slurry was slowly applied to the PVDF membrane filter having a pore size of 0.22 μm (millipore) and a diameter of 13 mm and acted as a support for the linked protein membrane Support filters were followed by rinsing out the solution with 10 ml of 20 mM HEPES buffer pH 7.5 The mechanical resistance of the protein membranes was improved by covalently cross-linking the self-assembled protein material with 10% glutaraldehyde To the application of glutaraldehyde, the membrane was treated with 10 ml of 1 M Tris with a pH-W purged of 8.0, this acted as a block reagent to remove unreacted glutaraldehyde, and with 10 ml of deionized water. The protein membrane was left in the water to prevent it from drying out before being used as a filter. Alternatively, the protein membrane was prepared by incubating the carrier filter with the polypeptide solution at 80 ° C and slowly cooling to 4 ° C as set forth in Example 4. The sample dissolved in hexafluoroisopropanol was added with water up to 5%, the hexafluoroisopropanol was evaporated, leaving a protein mineral in a small amount of water. Example 6: The use of the linked polypeptide material for the separation of large molecules and particles (bacteriophage) M13 bacteriophage and dextran blue were used to test the filtration capabilities of the resulting polypeptide material (membranes). The slurry of bacteriophage M13 was filtered through the filter of polypeptide material on the pad. The phage titer of the initial slurry was determined in a manner well known to those of skill in the art. The initial titer was 3x 1010 pfu / ml, but no phage was detected in the filtrate, indicating that the titer reduction was at least 6 orders of magnitude (less than 104 pfu / ml) (Figure 3A). Dextran blue (0.5 mg / ml) was also filtered and filtration success was determined by absorbance measurement at 625 nm (Figure 3B). 26 SEQUENCE NIST LIST <160 > 84 < 1 70 > Patent version 3.4 < 210 > 1 < 211 > 93 < 2 12 > DNA < 2 13 > synthetic sequence < 2 2C > < 221 > CDS < 222 > (1) - (93} <223> 1 <400> 1 atg aaa cag ctg gaa aaa gaa ctg Met Lys Gin Leu Glu Lys Glu Leu 1 5 caa ctg caa tgg aag get caa get Gin Leu Gin Trp Lys Ala Gin Ala 20 <210> 2 <211> 31 <212> PRT <2 13> synthetic sequence <400> 2 Met Lys; Gin Leu Glu Lys Glu Leu 1 5 Gin Leu i Gin Trp Lys Ala Gin Ala 2C <210> 3 <211> 135 <212> DNA <213> synthetic sequence <22C> <221> CDS <222> (1) - - (135) < 223> APH <400> 3 atg aaa [cag ctg gaa aaa gag ctg Met Lys i Gin Leu Glu Lys Glu Leu 1 5 gca att Cfcla 353 Cc3Cj ctg gca cag Ala Ile: Glu Lys Gin Leu Ala Gin 20 aaa aaa aaa clg gcc cag ctg aaa Lys Lys Lys Leu Ala Gin Leu Lys 35 4C <210> 4 <2 11> 45 <2 1 2> PRT <213> svnthet: see sequence <4 0 0> 4 ca gin gca Ai a 1 0 atc Ile gaa Glu aaa Lys cag Gin ctg gca Leu Ala 1 5 ege Arg 25 aag Lys aaa Lys aag Lys ctg Leu gca Ala 30 cag Gin 4 8 93 Gin Ala 10lg Glu Lys Gin Leu Ala 15 Arg 25 Lys Lys Lys Leu Ala 30 Gin aa Lys cag Gin 10 rta Leu gaa Glu aaa Lys gaa Glu ctg Leu 15 caa Gin ctg Leu 25 ca gin tgg Trp aaa Lys gca Ala cag Gin 3 0 gca Ala cgL Arg aaa Lys aaa Lys ett Leu cag Gin gcc Ala 45 48 96 135 27 • · · · * * φ «· 9 # ··· * · * · · I • · · * * ♦ «·«! I * > * * * 4 »· * Met 1 Lys Gin Leu Glu 5 Lys Glu Leu Lys Gin 10 Leu Glu Lys Glu Leu 15 Gin Ala Ile Glu Lys 20 Gin Leu Ala Gin Leu 25 Gin T rp Lys Ala Gin 30 Ala Arg T, ys Lys Lys Leu Ala Gin T eu Lys Lys Lys Leu Gin Ala 35 40 45 < 210 > 5 ≪ 211 > 111 < 212 > DNA < 213 > synthetic protein < 220 > ≪ 221 > CDS < 222 > (1)., (111) ≪ 223 > BCR < 400 > 5 atg Met 1 gat Aspatt Tie gaa Glu cag Gin 5 gaa Glu ctg T.eu gaa Glu ege Arg gca Al a 1 0 aaa T.ys gca Ala agc Serat Ile egt Arg 15 egt Arg 48 ctg Leu gaa Glu cag Gin gaa Glu 20 gtt Val aat Asn caa Gin gaa Glu egt Arg 25 agc Ser egt Arg atg Met gca Ala did Tyr 30 ctg Leu caa Gin 96 acc Thr ccq Leu ctg Leu 35 gca Ala aaa Lys 111 <21 0>. < 21 1 > < 21 2 > < 21 3 > 6 37 PRT synthetic. Protein < 4 0 0 > 6 Met 1 Asp Ile Glu Gin 5 Glu Leu Glu Arg Ala 10 Lys Ala Ser Ile Arg 15 Arg Leu Glu Gin Glu 20 Va 1 Asn Gin Glu Arg 2 5 Ser Arg Met. Ala Tyr 30 Leu Gin Thr Leu Leu Ala l.ys 35 < 210 > 7 < 211 > 84 ≪ 212 > DNA < 213 > synthetic protein < 220 > ≪ 221 > CDS < 22 2 > (1). (84) < 22 3 > Folcon < 4C0 > 7 atg ggt tat att. cct gaa gca cca egt gat ggc caa gca tue gtt egt 48 Met Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gin Ala Tyr Val Arg 15 10 15 aaa gac ggt gaa egg gtc ctg ctg tcc act ttc ctg Lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 20 25 84 28 28 • «• ¢. ≪ 210 > 8 < 211 > 28 < 2 1 2 > PRT < 213 > synthetic protein < 400 > 8th Met Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gin Ala Tyr Val Arg 1 5 10 '15 lys Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 20 25 < 210 > 9 < 211 > 96 ≪ 212 > DNA < 213 > synthetic protein < 220 > ≪ 221 > CDS < 222 > (1) .. (96) ≪ 223 > p53T < 40 0 > 9 atg gaa tac ttt acc c lg cac atc cg L ggc cg L gag cgL t LC gag agl 48 Ket 1 Glu Tyr Phe Thr 5 Leu His Ile Arg Gly 10 Arg Glu Arg Phe Glu 15 MeL ttc cgc gaa ctg aat gaa qcc ctg gaa ctg aas gat gct ca gca ggt 96 Phe Arg Glu Leu 20 Asn Glu Ala Leu Glu 25 Leu Lys Asp Ala C-In 30 Ala Gly < 210 > 10 < 211 > 32 ≪ 212 > PRT < 213 > synthetic protein < 4 0 0 > 10 Met Glu Tyr Phe Your Leu His Ile Arg Gly Arg Glu Arg Phe Glu Met 1 5 10 15 Phe Arg Glu Leu Asn Glu Ala Leu Giu Leu Lys Asp Ala Gin Ala Gly 20 25 3C < 210 > 11 < 211 > 285 ≪ 212 > DNA < 213 > synthetic protein < 2.2 0 > < 221 > CDS < 222 > (1) .. (285) < 223 > TetShak < 40 0 > 11 atg gaa egt gtt gtt aLl aat gtg agc ggt ctg egt ttt gaa acc cag 48 Met 1 Glu Arg Val Vc 1 5 Ile Asn Val Ser Gly 10 Leu Arg Phe Giu Thr 15 Gin ctg aaa acc ctg S ri tl cag ttc ccg gat acc ctg ctg ggt aat ccg cag 96 Leu Lys her Leu 20 Asn Gin Phe Pro Asp 25 Thr Leu Leu Gly Asn 30 Pro Gin aas egt aat egt. t-t tt cg ccg ctg cgc aac gaa act ttt ttt gat 144 Lys Arg Asn Arg Tyr Tvr Asp Pro Leu Arg Asn Glu Tyr Phe Phe Asp 29 35 40 4 5 cgc Arg aat Asn 50 egz Arg ccg Pro agc Ser ttt Phe gat Asp 55 gcc Ala att Ile ctg Leu did Tyr ttt Phe 60 did Tyr cag Gin agc Ser ggt Gly 192 ggt Gly 65 egt Arg ctg Leu egt Arg egt Arg ccg Pro 70 gtt Val aat Asn gtt Val ccg Pro ctg Leu 75 gat Asp gtg Val ttt Phe agc Ser gaa Glu 80 240 gag Glu ate 1 le aaa Lys Ltt Phe tat Tyr 85 gaa Glu ctg Leu gqc Gly gaa Glu aac Asn 90 gcc Ala ttt Phe gaa Glu cgc Arg tat Tyr 9 5 285 < 21 0 >; < 2 11 > < 2 1 2 > < 2 1 3 > 1 2 95 PRT synthetic; Protei i Ί < 400 > 12 Mer. 1 Gl.u Arg Val Val 5 1 le Asn Val Ser Gly 10 Leu Arg Phe Glu Thr 15 Gin Leu Lys Thr Leu 20 Asn Gin Phe Pro Asp 25 Leu Leu Gly Asn 30 Pro Gin Ly s Arg Asn 35 Arg Tyr Tyr Asp Pro 40 Leu Arg Asn Gl u Tyr 45 Phe Phe Asp Arg Asn 50 Arg Pro Ser Phe Asp 55 Ala Ile Leu Tyr Phe 60 Tyr Gl Ser Gly Glv 65 Arg Le Arg Arg Pro 70 val Asn Val Pro Leu 75 Asp Val Phe Ser Glu 80 Glu Ile Lys Phe Tyr 85 Glu Leu Gly Glu Asn 90 Ala Phe Glu Arg Tyr 9 5 < 210 > < 211 > ≪ 212 > ≪ 213 > 13 21C UNA synthetic i ProLein < 220 > < 221 > ≪ 222 > ≪ 223 > CDS (1) .. Verof (21C) < 40C > atg aca Met Thr 1 13 cg Pro gat Asp tgt Cys 5 gtt Val acc Thr gcg G'-y aaa lys gtg Val 10 gaa Glu Lat Tyr acc Thr aaa Lys did Tyr 15 aac Asn 48 g Asp gac Asp gat Asp t ac Thr 20 tz t Phe acc Thrgtg Val 33ö Lys gtg Val 25 ggt Gly gat ASp d itä. Lys gaa Glu ctg Leu 30 ttt Phe acc Thr 9 6 aat 7 sn egt Arg Lgg Trp 3 5 3 d. L Asn c zq Leu Cae Gin agc Ser ctg Leu 40 ctg Leu ctg Leu agc Ser gca Ala cag Gin 45 at Ile aeg Thr g Gly 144 atg Met acc Thr 5 0 gtt Val Thr ar I t add Lys dC C Thr 55 aat Asn gca Ala tgc Cys c, at His aat Asn 60 ggt Gly ggt Gly ggc Gly ttt Phe 192 agc gaa gtt dt trt egt 210 30 • * • m Ser Glu Val Ile Phe Arg 65 70 < 210 > 14 < 211 > 70 ≪ 212 > PRT < 213 > synthetic protein < 400 > 14 Met 1 Thr Pro Asp Cys 5 Val Thr Gly Lys Val 10 Glu Tyr Thr Lys Tyr 15 Asn Asp Asp Asp Thr 20 Phe Thr Val Lys Val 25 Gly Asp Tys Glu Leu 30 Phe Thr Asn Arg Trp 35 Asn _, eu Gin Ser Leu 40 Leu Leu Ser Ala Gin 45 Ile Thr Gl y Met Thr 50 Val Thr Ile Lys Thr 55 Asn Ala Cys His Asn 60 Gly Gly Gly Phe Ser Glu Val Ile Phe Arg 65 70 < 210 > 15 < 211 > 99 ≪ 212 > DNA < 213 > synthetic origin < 220 > < 2 21 > > CDS < 222 > ¢ 1) .. (99) < 223 > PI < 400 > 15 agc Ser 1 cca Pro gaa C-lu even. Asp gaa Glu 5 at.t Ile cag Gin g ca A '. a ctg Leu gaa Gl u 10 gaa Glu gaa Glu aat Asn gct Ala caa Gin 15 ctg Leu 48 gaa Glu cag Gin gaa Glu aac Asn 20 gcg Ala gcg Ala ctq Leu gaa Glu gaa Glu 25 gaa Glu atc Ile gca Al a cag Eq Le ctg Leu 30 gaa Glu Lac Tyr 96 ggc 99 Gly < 210 > 16 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 4 00 > 16 Ser Pro Glu Asp Glu Ile Gin Ala Leu Glu Glu Glu Asn Ala Gin Leu 1 5 10 15 Glu Gin Glu Asn Ala Ala Leu Glu Glu Glu Ile Ala Gin Leu Glu Tyr 20 25 30 Gly < 210 > 17 < 211 > 99 ≪ 212 > DNA 48 31 < 213 > synthetic origin < 22C > ≪ 221 > CDS < 222 > (1) . , (39) < 223 > P2 < 40C > 17 t. c L cca gaa gac aaa atc gca cag ctg aaa gaa aag aac gcc ctg Ser 1 Pro Gl u Asp Lys 5 Ile Ala Gin Leu LVS 10 Glu Lys Asn Ala Ala 15 Leu aaa gaa aaq aaa caa cag ctg aa q gag aaa atc caa gca ctg aaa did Lys Glu Lys Asn 20 C-ln Gin Leu Lys Glu 25 Lys Gin a: a Leu 30 Lys Tyr 96 99 ggc Gly < 210 > 18 < 211 > 33 ≪ 212 > PRT < 213 > Synthetic origin < 4 00 > 18 Ser Pro Glu Asp Lysis Ala Gin Leu Lys Glu lys Asn Ala Ala Leu 15 10 15 Lys Glu Lys Asn Gin Gin Leu Lys Glu Lys Ile Gin A_a Leu Lys Tyr 20 25 30 Gl y < 210 > 19 < 211 > 99 ≪ 212 > DNA < 213 > synthetic origin < 22C > ≪ 221 > CDS < 222 > (1) .. (99) ≪ 223 > GCN < 40C > 19 egt atg aaa cag ctg gaa gat. aaa atc gag gag ctg ctg tcc d c g atc Arg 1 Met Lys Gin Leu 5 Glu Asp Lys G1 u 10 Glu Leu Leu Ser Lys 15: le * L.dC cac ctg gaa aac gaa az t. get cgc ctg aa aag ctg att qgt gaa Tyr H: .s Leu Glu 20 Asn Glu 71 e Ala Arg 25 Leu Lys Lys Leu Ile 3C Gly Gl u cgc Arg 8 96 < 210 > 20 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 4C0 > 20 Leu Leu Ser Lys Tie Arg Met Lys Gin Leu Glu Asp Lys tle Glu Gnl 99 32 • «# · 1 5 10 15 Tyr His Leu Glu Asn Glu Ile Al «! Arg Leu Lys I.ys Leu Ile Gly Glu 20 25 30 Arg < 210 > 21 < 211 > 99 ≪ 2l2 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 2 2 2 > (1) .. (99) < 2 2 3 > P3 < 400 > 21 tcc ccg Ser Pro 1 gaa Glu gaL Asp gag Glu 5 atc Ile cag Gin caa G'n ccg Leu gaa Glu 10 gaa Glu gaa Glu atc Ile gcc Ala cag Gin 15 ctg Leu 48 gaa cag Glu Gin aaa lys aac Asn 2C gca Ala g Ala ctg Leu aaa T. gag Glu 25 aaa Lys aac Asn cag Gin geg Ala ctg Leu 30 aaa Lys tac Tyr 96 ggL Gl y 99 < 210 > < 211 > < 21 2 > ≪ 213 > 22 33 PRT has the same origin as < 400 > 22 Ser Pro 1 Glu Asp Glu 5 ile Gln Gin _eu Glu 10 Glu Glu Ile Ala Gin 15 Leu Glu Gln Lys Asn A 1 a Ala Leu Lys Gl u Lys Asn Gl ala Leu Lys Tyr 20 25 30 Gly < 210 > 23 < 211 > 99 ≪ 212 > DNA < 213 > synthetic origin < 22 0 > ≪ 22l > CDS < 22 2 > (1) .. (99) < 223 > P4 < 400 > 23 agc Ser 1 ccg Pr o Qari Glu gar Asp 3 3 ci I VS 5 " ütt Id Id Id Id Id Id Id Id G G G G Le Le Le ys ys ys ys ys ys ys ys ys ys IC IC IC IC IC IC in in in in in in in in in in in in in in in in in in in in 48 48 48 48 48 48 48 48 48 48 48 48 48 48 48 c c c c c c c Glu gag Glu 2 5 gaa Glu aac Asn gcc A! a gca A1 a ctg Leu 3C gaa Glu ca t Tyr 96 ggt. 99 33 Gly < 210 > 24 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 400 > 24 Ser Pro Glu Asp Lys 2 le Ala Gin Leu 1 5 Lys Gin Glu Asn Gin Gin Leu Glu Glu 20 25 Gly < 210 > 25 < 211 > 99 < 212 > DNA < 213 > syntl isisic origin < 220 > < 22 1 > CDS < 222 > (1) . , (99) < 223 > P5 < 400 > 25 tct cct ga gac gaa aac gca gct c Lg Ser Pro Gl u Asp Glu Asn Ala Ala Leu 1 5 aaa caa aag aac gcg gca ctg aaa gaa Lys GJ n Lys Asn Ala A 1 a Leu Lys Glu 20 2 5 ggc Gl y < 210 > 26 < 211 > 3 3 < 212 > PRT < 2 13 > synthetic origin. ≪ 400 > 26 Ser Pro Glu Asp Glu Asn Ala Ala Leu 1 5 Lys Gin L.ys Asn Ala Ala Leu Lys Glu 20 25 Gly gaa gag aaa att gca caa ctg 48 Glu Glu Lys Ile Ala Gin Leu 10 15 gaa ata caa gca ctg gaa tat 96 Glu Ile Gin Ala Leu Glu Tyr 3 0 99 Lys Gin Lys Ile Gin Ala Leu IC 15 Glu Asn Ala Ala Leu Glu Tyr 30 Glu Glu Lys Ile Ala Gin Leu 10 15 Glu Tie Gin Ala Leu Glu Tyr 30 < 21 0 > 27 < 211 > 99 < 21 2 > DNA < 21 3 > synthetic origin < 220 > < 2 21 > > CDS < 222 > (1) - - (99) < 2 2 3 > P6 48 34 < 4 0 0 > agc ccg Ser Pro 1 27 gaa Glu gat Asp aa 3. Lys 5 aac Asn gcc Ala gct Ala ctg Leu aaa Lys 10 gag Glu gaa Glu atc Ile cag Gin gcg Ala 15 ctq Leu gaa Glu gaa Glu gaa Glu aac Asn 2 0 cag Gin gcr Ala ctg Leu gaa Glu gag Glu 25 aaa Lys atc Ile gca Ala cag Gin ctg Leu 30 aaa Lys Lat Tyr 96 99 ggt Gly < 210 > 28 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 40C > 28 Ser Pro Glu Asp Lys Asn Ala Ala Leu Lys Glu Glu Ile Gin Ala Leu 15 10 15 Glu Glu Glu Asn Gin Ala Leu Glu Glu Lys Ile Ala Gin Leu Lys Tyr 20 25 30 Gly < 210 > 29 < 211 > 99 < 21 2 > DNA < 21 3 > synthetic origin < 22Q > < 2 21 > > CDS < 2 2 2 > (1) .. (99) < 2 2 3 > P7 < 400 > 29 Lee ccg gag gat. gag ar.c cag gcg ctg gaa gaa aag aac gcc cag ctq Ser 1 Pro Glu Asp Glu 5 Ile Gin Ala Leu Glu 10 Glu Lys Asn Ala Gin 15 Leu aag cag gaa atr. gcg gca ctg gaa gag aag aac cag gcc ctg aag Lac Lys Gin Glu I le 20 Ala Ala Leu Glu Glu 25 Lys Asn Gin Ala Leu 30 Lys Tyr 48 96 99 ggt Gly < 210 > 30 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 4 00 > 30 Ser Pro Glu Asp Glu Ile Gin Ala Leu Glu Glu Lys Asn Ala Gin Leu 1 5 10 15 Lys Gin Glu Ile Ala Ala Leu Glu Glu Lys Asn Gin Ala Leu Lvs Tyr 20 25 30 Gly 35 < 210 > 31 < 211 > 39 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CD3 < 222 > (1) .. (99) < 223 > P8 < 4C 0 > 31 tcc ccg gaa gac aaa atc gen cag ctg aaa gaa gaa aac cag cag ctg 48 Ser 1 Pro Glu Asp Lys 5 Ile Ala Gin Leu Lys 10 Glu Glu Asn Gin Gin 15 Leu gaa caa aag att cag gcc ctg aag gag gaa aac gca get ctg gaa Lac 96 Glu Gin; .ys Ile 20 Gln Ala Leu T.ys Glu 25 Glu Asn Ala Ala Leu 30 Gl u Tyr ggc 99 Gly < 210 > 32 < 211 > 33 ≪ 212 > PRT < 213 > synthetic origin < 4C0 > 32 Ser 1 Pr o Glu Asp Lys 5 Tie Ala Gin Leu Lys 10 Glu Glu Asn Gin Gin 15 Leu Glu Gl n Lys Tie 20 Gin Ala Leu Lys Glu 25 Glu Asn Ala Ala Leu 30 Glu Tyr Gly < 210 > 33 < 211 > 192 ≪ 212 > DNA < 213 > synthetic origin < 22C > ≪ 221 > CDS < 222 > (1). (192) < 223 > Dimtetra Al < 40C > 33 atg Met 1 aaa Lys cag Gin ctg Leu gaa Glu 5 aaa Lys gaa Gl u ctg Leu caa Gin gca Ai a 10 atc T le gaa Glu aaa Lys cag Gl i ctg I eu 15 gca Ala 48 caa Gl ctg Leu caa Eq n tgg l'rp 20 * aag Lys get Ala caa Gin get Ala ege Arg 25 aag Lys aaa Lys aag Lys ctg Leu gca Ala 3 0 cag Gin tcc Ser 96 ggc Gly gaa G.Lu tac Tyr 35 r.tn Phe acc Thr eng Le cac H.is Q LC 71 e 4C egt Arg ggc Glyggt Arg gag Gl u egt Arg 4 5 ttc Phe gag Glu atq Met 144 ttc Phe g e Arg gaa Glu ctg Leu aat Asn gaa Gl u gcc Ala ctg Leu gaa G ·. u ctg I.eu aaa -ys gat Asp get Al a caa Gin gca Ala ggt Gly 192 50 55 60 36 < 2 1 0 > 34 < 2ll > 64 ≪ 212 > PRT < 213 > synthetic origin < 4 0 0 > 34 Met 1 Ly s Gin Leu Glu 5 T ys Glu Leu Gin Ala 10 Ile Glu Lys Gin Leu 15 Ala Gin Leu Gln Trp 20 Lys Ala Gin Ala Arg 25 Lys Lys Lys Leu Ala 30 Gin Ser Gly Glu Tyr 35 Phe Thr Leu His I le 40 Arg Gly Arg Glu Arg 45 Phe Glu Met Phe Arg 50 Glu Leu Asn G1 u Ala 5 5 Leu Glu Leu Lys ASp 60 Ala Gin Ala Gly < 210 > 35 < 211 > 198 ≪ 212 > DNA < 213 > synthetic origin < 22C > ≪ 2 2 1 > CDS < 27 2 > (1) .. (198) ≪ 223 > Dimtetra-Al GSGS < 400 > 35 atg Met 1 3 g cl Lys cag Gin ctg Leu gaa Glu 5 aaa Lys gaa Glu ctg Leu caa Gin gca Al a 10 atc Tie gaa Glu aaa Lys cag Gin ctg Leu 15 gca Ala 48 caa Gin ctg Leu caa Gin tgg Trp 20 Aag Lys get Ala caa Gin get Ala cgc Arg 25 aag Lys aaa Lys aag Lys ctg Leu gca Ala 30 cag Gin tcc Ser 96 ggg Gly tcc Ser ggc Gly 35 gaa Glu tac Tyr LLL Phe acc Thr ctg Leu 40 cac Hls atc Ile egt Arg ggg Glyggt Arg 45 gag Glu cqt Arg ttc Phe 144 gag Glu atg Met 50 t. tc Phe cgc Arg gaa Glu ctg Leu aat Asn 55 gaa Glu gcc Ala ctg Leu gaa Glu ctg Leu 50 aaa Lys gat Asp get Ala caa Gl n 1 92 gca Al a 65 ggt Gly 198 < 210 > ≪ 211 > ≪ 212 > < 2 1 3 > 36 66 PRT synthetics; see origin < 4 C 0 > 3 6 Met 1 Lys Gln Leu Glu 5 Lys Glu Leu Gin Ala 1 Tie Glu Lys Gin Leu 15 Al a Gin Leu Gln Trp 20 Lys Ala Gin A 1 a Arg 2 5 Lys Lys Lys Leu Ala 30 Gin Ser Gl y Ser Gly Glu Tyr Phe Thr Leu H: s Tie Arg Gl V Arg Glu Arg Phe 35 40 45 37 37 Glu Glu Met Phe Arg Glu 50 Leu Asn Glu Ala Leu 55 Leu Lys Asp Ala Gin 60 Ala G1 y 6 5 < 210 > 37 < 211 > 234 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1) .. (234) ≪ 223 > DimLeLra-A < 400 > 37 atg Met 1 S cD <3 Lys cag Gin ctg Leu gaa Glu 5 aaa Lys gag Glu ct.g Leu aaa Lys cag Gin 10 tza Leu gaa Glu aaa Lys gaa Glu ctg Leu 15 caa Gl n 48 gca Ala att Ile gaa Glu ä3ä Lys 20 cag Gin ctg Leu gca Ala cag Gln ctg Leu 2 5 ca gin tgg Trp aaa Lys gca Ala cag Gin 30 gca Ala egt Arg 96 äää Lys öäd Lys aaa Lys 35 ctg Leu gcc Ala cag Gin ctg Leu aaa Lys 40 aaa Lys aaa Lys ett Leu cag Gin gcc Ala 45 tcc Ser ggc Gly gaa Glu 1 44 tac Tyr ttt Phe 50 acc Your ctg Leu cac His at.c Ile Arg Arg 55 ggg Gly gt Arg gag Glu egt Arg ttc Phe 60 gag Glu atg Met tue Phe ege Arg 192 gaa Glu 6 5 ctg Leu aat Asn gaa Glu gcc Ala ctg Leu 70 gaa Glu ctg Leu aaa Lys ga t Asp get Ala 75 caa Gl n gca Al a ggt Gly 234 < 210 > ≪ 211 > ≪ 212 > < 2 13 > > 38 78 PRT syn the Li see origin < 4 00 > 38 Met 1 Lys Gin Leu Glu 5 Lys Glu Leu JYS Gin 1C Leu Glu Lys Glu Leu 15 Gin Al a Ile Glu Lys 20 Glu Leu Ala Glu Leu 2 5 Gln Trp Lys A1 a Gl 30 Ala Arg Lys lys Lys 35 Leu Ala Gin Leu Lys 40 -ys Lys Leu Gin Ala 45 Ser Gly Glu Tyr Phe 5C Thr Leu Hi s Tie Arg 55 Gly Arg Glu Arg Phe 60 Glu MeL Phe Arg Gl u 65 Leu Asn Glu Ala Leu 7C Glu Leu Lys Asp Ala 75 gin Ala Gly < 210 > 39 ≪ 211 > 21C ≪ 212 > DNA < 213 > synthetic origin < 220 > < 2 21 > > CDS < 222 > (1) .. (210) 38 • * · · · • * * * 38 • * · · · • * * * < 223 > Dimetra-B < 400 > 39 atg Met 1 gat Asp att Ile gaa Glu cag Gin 5 gaa Glu ctg Leu gaa Glu ege Arg gca Ala io cl cl c3 Lys gca Ala agc Serat 1 le egt Arg 15 egt Arg 48 ctg Leu gaa Glu cag Gin gaa Glu 20 gtta aqaaa ga gaa glu egt Arg 25 agc Ser egt Arg atg Met gca Al a tat Tyr 3 O ctg heu caa Gin 96 acc Thru ctg Leu ctg Leu 35 gca Ala aaa Lys tcc Ser ggc Gly gaa Glu 40 tac Tyr ttt Phe Acc Thr ctg Leu cac His 45 atc Ile egt Arg ggc Gly 14 4 egt Arg gag Glu 50 egt Arg ttc Phe gag Glu atg Met ttc Phe 55 ege Arg gaa Glu ctg Leu aat Asn gaa Glu 60 gcc Aia ctg T.eu gaa Glu ctg Leu 192 aaa Lys 65 gat Asp get Ala caa Gin gca Ala ggt Gly 70 210 < 210 > 40 < 211 > 70 ≪ 212 > PRT < 213 > synthetic origin < 4 00 > 4 0 Met 1 Asp Ile Glu Gin 5 Glu Leu Glu Arg Ala 10 Lys Ala Ser Ile Arg 1 5 Arg Leu Glu Gin Glu 20 Val Asn Gl n Gl u Arg 25 Ser Arg Met Ala Tyr 30 Leu Gin Thr Leu Leu 3 5 Ala Lys Ser Gly Gl u 40 Tyr Phe Thr Leu His 45 Ile Arg Gly Arg Glu 5 0 Arg Phe Gl u Met Phe 55 Arg Glu Leu Asn Glu 6 C Ala Leu Glu Leu Lvs Asp Ala Gin Ala Gly 60 7C < 210 > 41 < 211 > 195 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1). (195) < 223 > Tetradim Al < 4 0 0 > 41 atg gaa tac ttt acc ctg cac atc cg L ggc cg t gag egt ttc gag atg Met 1 Gl u Tyr Phe Thr 5 Leu His Ile Arg Gly 10 Arg Gl u Arg Phe Glu 15 Met ttc ege gaa ctg aat gaa gcc ctg cl ctg aaa gat get caa gca gqt Phe Arg Glu Leu 20 Asn Glu Ala Leu Glu 25 Leu Lys Asp Ala Gin 3 0 Ala Glv tcc ggc atg cag ctg gaa aaa gaa ctg caa gca atc gaa aaa cag Ser Gly Met 3 5 Lys Gin Leu Glu Lys 40 Glu Leu Gin Ala Ile 45 Glu 1 ys Gin 48 96 144 192 39 192 39 ctg gca Leu Ala 50 Ca gin cLg Leu caa Gin tgg Trp aag Lys 55 get Ala caa Gl u get Ala cgc Arg aag Lvs 60 3 c) 3 Lysag Lys ctg Leu gca Ala cag Gin 65 < 210 > ≪ 211 > ≪ 212 > ≪ 213 > 42 65 FRT synthe- tic origin < 400 > 42 Met Glu 1 Tyr Phe Thr 5 Leu His Ile Arg Gly 10 Arg Glu Arg Phe Glu 15 Met Phe Arg Glu Leu 20 Asn Glu Ala Leu Glu 25 Leu Lys Asp Ala Gin 30 Ala Gly Ser Gly Met 35 Lys Gin Leu Glu Lys 40 Glu Leu Gin Ala Ile 45 Glu Lys Gin Leu Ala 50 Gin Leu Gin Trp Lys 55 Ala Gin Ala Arg Lys 60 Lys Lys Leu Ala Gin 6 5 < 21 0 > < 21 1 > < 21 2 > < 21 3 > 43,237 DNA syntheris origin < 220 > ≪ 221 > ≪ 222 > ≪ 223 > CDS (1). , (237 tetra-) A < 4 0 0 > atg gaa Met Glu 1 43 tac Tyr ILL Phe acc Thr 5 c.tg Leu cac. His acc Ile egt Arg ggc Gly 10 egr Arg g = g Glu egt Arg ttc Phe gag Gl u 1 5 atg Met ttc cgc Phe Arg gaa Glu ctg Leu 20 aat Asn gaa Glu qcc Ala erg Leu gaa Glu 25 ctg Leu ciclcä. Lys gat Asp get Ala caa Gin 30 gca Ala ggt Gly Lee ggc Ser Gly Met 35 aaa Lys cag Gin ctg Leu gaa Glu aaa Lys 40 gag Glu erg Leu aaa Lys cag Gin tta Leu 45 gaa Glu aaa Lys gaa Glu ctg caa Leu Gin 5C gca Ala att I _e gaa Glu äää Lys cag Gin 55 erg Le gca Ala cag C-ln ctq Leu caa Gin 60 tgg Trp äää Lys gca Ala cag Gin qca egt Ala Arg clcLH Ly s aaa Lys aaa Lys ctg Leu gcc Ala cag Gln ctg Leu aaa Lys aaa Lys aaa T.ys ett Leu cag Gin gcc Al a 65 70 75 195 48 96 144 192 23 7 < 21G > 44 < 211 > 79 ≪ 212 > PRT < 213 > synthetic origin 40 < 400 > 44 Met Glu Tyr 1 Phe Thr 5 Leu His Ile Arg Gly 10 Arg Glu Arg Phe Glu 15 Met Phe Arg Glu Leu 20 Asn Glu Ala Leu Glu 25 Leu Lys Asp Ala Gin 30 Ala Gly Ser Gly Met 35 Lys Gin Leu Glu Lys 40 Glu Leu Lys Gin Leu 4 5 Glu Lys Glu Leu Gin Ala 5 0 Ile Glu Lys Gin 55 Leu Al a Glu Leu Gin 60 Trp Lys Ala Gin Ala Arg Lys 65 Lys Lys Leu 70 Ala Gin Leu Lys Lys 75 Lys Leu Gin Ala < 210 > 45 < 211 > 210 < 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1) .. (210) <223> tetradiir B <4 0 0> 4 5 atg gaa tac Met Glu Tyr 1 ttt Phe acc Thr 5 ctg Leu cac His atc Ile egt Arg ggc Gly 10 egt Arg gag Glu egt Arg ttc Phe gag Glu 15 atg Met ttc cqc gaa Phe Arg Glu erg Leu 20 aat Asn gaa Glu gcc Ala ctg Leu gaa Glu 25 ctg Leu aaa Lys gat Asp gct Ala caa Gl n 30 gca Ala ggt Gly icc ggc gat Ser Gly Asp 35 att Tie gaa Glu cag Gin gaa Glu ctg Leu 40 gaa Glu ege Arg gca A_a aaa Lys gca Ala 45 agc Serat Ile egt Arg egt ctg gaa Arg Leu Glu 50 cag Gin gaa Glu gtr Val aat Asn 55 Cae Gin gaa Glu cqt Arg agc Ser egt Arg 60 atg Met gca Ala did Tyr ctg Leu caa acc ctg Gin Thr Leu 6 5 ctg Leu gca Ala aaa Lys 70 <210> 46 <211> 70 <212> PRT < 213> synthetic see1 <4 0 0> 46 Met Glu Tyr 1 Phe Thr 5 Leu His 1 le Arg Gly 10 Arg Glu Arg Phe Glu 15 Met Phe Arg Glu Leu 20 Asn G u Ala Leu Glu 25 Leu Lys Asp Ala Gin 30 Ala Gly Ser Gly Asp 3 5 Ile Glu G " n Glu Leu 40 Glu Arg Ala Lys Ala 45 Ser Ile Arg Arg Leu Glu 5C Gin Glu Val Asn 55 Gin Glu Arg Ser Arg 60 Met Ala Tyr Leu 48 96 144 192 210 41 • • • • # * • * Gin Thr Leu Leu Ala Lys 65 70 < 210 > 47 < 211 > 384 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1) .. (384) < 223 > Al-Tshak gca atc gaa aaa cag ctg gca 48 A! a 10 Ile Glu lys Glu Leu 15 Ala aag aaaag ctg gca cag tcc 96 Lys Lys Lys Leu Ala 30 Gin Ser agc ggt ctg egt. ttt gaa acc 144 Ser Gly Leu Arg 45 Phe Glu Thr gat acc ctg cgt ggt aat ccg 1 92 Asp Thr Leu 60 Leu Gly Asn Pro ctg cgc aac gaa Lat ttt ttt 240 Leu Arg 75 Asn Glu Tyr Phe Phe 80 att ctg tat ttt did cag agc 288 Ile 90 Leu Tyr Phe Tyr Gln 95 Ser gtt ccg ctg gat gtg ttt agc 336 Val Pro Leu Asp Val 110 Phe Ser gaa aac gcc ttt gaa cgc did 384 Glu Asn Ala Phe 125 Glu Arg Tyr Ala 10 TI e Gl u Lys Gin Leu 15 Al a Lys Lys Lys Leu Ala 3 0 Gin Ser Ser Gly Leu Arg 4 5 Phe Glu Thr Asp Thr Leu 60 Leu Gly Asn Pro < 400 > 47 atg Met 1 aa Lys cag Gin ctg Leu gaa Glu 5 aaa Lys gaa Glu ctg Leu caa Gin caa Gin ctg Leu caa Gin tgg Trp 20 aag Lys get Ala caa Gin get Ala cgc Arg 2 5 ggg Glv atg xet gaa Glu 35 egt Arg gtt Val gtt Valat I le aat Asn 40 gtg Val cag Gin ctg Leu 50 aaa Lys acc Thr ctg Leu eal Asn cag Gin 5 5 ttc Phe ccg Pro cag Gin 65 aaa Lys egt Arg aat Asn egt Arg tat Tyr 7C Lat Tyr gat Asp ccg Pro gat Asp cgc Arg aat Asn egt Arg ccg Pro 85 agc Ser ttt Phe qa L Asp gcc Ala ggt C-ly ggt. Gly lgt Arg ctg Leu 1 00 egt Arg cg L Arg ccg Pro get Valate Asn 105 gaa Glu gag Glu ctC Ile 115 aaa Lys ttc Phe tat Tyr gaa Glu ctg T.eu 1 20 ggc Gl y < 210 > ≪ 211 > ≪ 212 > ≪ 213 > 48 128 PRT synthetic origin t < 400 > 48 Met 1 Lys Gin Leu Glu 5 Lys Glu Leu Gin Gin Leu Gin Trp 20 Lys Ala Gin Ala Arg 25 Gly Met Glu 3 5 Arg Val Val 1 le Asn 40 Va 1 Gin Leu 50 Lys Thr Leu Asn Gin 55 Phe P ro Gin 6 5 Lys Arg Asn Arg Tyr 70 Tyr Asp Pro Leu Arg 75 Asn Glu Tyr Phe Phe 80 42 Asp Arg Asn Arg Pro 85 Ser Phe Asp Gly Gly Arg Leu 100 Arg Arg Pro Val Glu Glu Ile 115 Lys Phe Tyr Glu hay 1 20 Ala Ile 9C Leu Tyr Phe Tyr Gln 95 Ser Asn 105 Val Pro Leu Asp Val 110 Phe Ser Gly Glu Asn Ala Phe 125 Glu Arg Tyr < 210 > 49 < 211 > 426 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > COS < 222 > (1 ) . , (426) < 223 > A-Tshak < 40 0 > 49 atg aaa cag ctg gaa 3.0.3. gag ctg Met 1 Lys Gin Leu Glu 5 Lys Glu Leu gca att gaa aaa cag ctg gca cag Ala I le Glu Lys 20 Gin Leu Ala Gla aaaaaaaa ctg gcc cag ctg 3 da Lys Lys Lvs 35 Leu Ala Gin Leu Lys 4 C gaa egt gtt atl aat gtg agc Gl u Ara so val Val Asn Val 55 Ser sas acc ctg aat cag ctc ccg gat Lys 65 Thr Leu Asn Gin Phe 70 Pro Asp egt aat egt tat tat gat ccg erg Arg Asn Arg Tyr Tyr 85 Asp Pro Leu 31 egt ccg agc ttt gar gcc att Asn Arg Pro Ser 100 Phe Asp Al a 7 le egt ctg egt egt ccg gtt aat gtt Arg Leu Arg 115 Arg Pro Val Asn val 120 atc aaa t nt- L at gaa ctg ggc gaa 11 e I.ys 13 0 Phe Tyr Glu I.eu Gly 1 35 Glu < 21 0 > 50 < 2 1 1 > 142 ≪ 212 > PRT < 213 > synthetic origin < 4 0 0 > 50 Met. Lys Gin hu Glu Lys Glu Leu 1 5 48 96 144 192 240 288 336 .384 aaa cag tta gaa aaa gaa ctg caa Lys Gin Leu Glu Lys Glu Leu Gin 10 15 ctg caa tgg aaa gca cag gca egt. Leu Gin Trp Lys Al a Gin Al a Arg 25 3 0 333 aaa ett cag gcc tcc ggc atg Lys Lys Leu Gin Ala Ser Gly Met 45 ggt. ccg cgl ttt gaa acc cag ctg G] y Leu Arg Phe Glu Thr Gin Leu 60 acc erg ctg gqt aat ccg cag aaa Thr Leu Leu Gly Asn Pro Gin Lys 75 80 ege aac gaa did ttt ttt gat ege Arg Asn Glu Tyr Phe Phe Asp Arg 90 9 5 c Lg did t.tt did cag agc ggt ggl Leu Tyr Phe Tyr Gin Ser Gl y Gly 105 1 10 ccg ctg gat gtg ttt agc gaa gag Pro Leu Asp Val Phe Sei Glu Glu 1 25 aac gcc ttt gaa ege did Asn Ala Phe Glu Arg Tyr 140 Lyn Gin Leu Glu Lys Glu ee Gin 1 0 15 426 A _L a 1 le Glu Lys 20 Gin Leu Ala Gin Leu 25 Lys Lys Lys 35 Leu Ala Gin Leu Lys 40 Lys Glu Arg 50 Val Val Ile Asn Val 55 Ser Gly Lys 65 Thr Leu Asn Gin Phe 70 Pro Asp Thr Arg Asn Arg Tyr Tyr 85 Asp Pro Leu Arg Asn Arg Pro Ser 100 Phe Asp Ala Ile Leu 105 Arg Leu Arg 115 Arg Pro Val Asn Val 120 Pro ILe Lvs 130 Phe Tyr Glu Leu Gly 1 35 Glu Asn < 210 > < 211 > ≪ 212 > ≪ 213 > 51 402 DNA synthetic origin < 220 > < 2 21 > > ≪ 222 > ≪ 223 > CDS U). , (402) B-'i'shak < 400 > atg gat Met Asp 1 51 act Ile gaa Glu cag C-ln 5 gaa Glu ctg Leu gaa Glu ege Arg erg Leu gaa Glu cag Gin gaa Glu 20 gtt Val aat Asn caa Gin gaa Glu egt Arg 25 acc Thr ctg Leu ctg Leu 3 5 gca Ala aaa T.ys tcc Ser ggc Gly atg Met 40 gaa Glu ggt. Gly ctg Leu 50 egt Arg ttt Phe gaa Glu acc Thr cag Gin 55 ctg ^ eu aaa Lys acc Thr 6.5 ctg Leu ctg Leu ggt Gly aat Asn ccg Pro 70 cag Gin aaa Lys egt Arg ege Arg aac Asn gaa Glu tat Tyr ttt Phe 85 ttt Phe gat Asp ege Arg aat Asn ctg Leu did Tyr ttt Phe tat Tyr ICO cag Gin agc Sergt Gly glygt Arg e Arg 105 ccg Pro ctg Leu gat Asp 1 15 gtg Val ttt Phe agc Ser gaa Glu gag Glu 120 atc Ile * * «« • «• # '' • '*' • • • * * * • • • '' '' '' '' * '' * '' * '' 'tt • • • • • • • * * * ♦ • m Gin Trp Lys Ala Gin 3C Ala Arg Lys Lei Gin Ala 45 Ser Gly Met Leu Arg Phe 60 Glu Your Gin Leu Leu Leu 75 Gly Asn Pro Gin Lys 80 Asn 90 Glu Tyr Phe Phe Asp 95 Arg Tyr Phe Tyr Gin Ser 110 Gly Gly Leu Asp Val Phe 125 Ser Glu Glu Ala Phe Glu 140 Arg Tyr gca cl cl S gca agc ass egt 48 Ala 10 Lys Ala Ser Ile Arg 15 Arg aggtgtg gca did ctg caa 96 Ser Arg Met A la Tyr 30 Leu Gin egt gtt gtt att. aat gtg agc 144 Arg Val Val Τ ', e 4 5 Asn Val Ser acc ctg aat cag ttc ccg gat 192 Thr Leu Asn 60 Gin Phe Pro Asp aat egt tat gat ccg ctg 24C Asn Arg Tyr Tyr Asp Pro Leu 8C egt ccg agc ttt gat gcc at L Arg 90 Pro Ser Phe Asp A_a 9 5 Ile ctg egt Ca L ccg 91 L atat gtt Leu Arg Arg Pro Val 110 Asn Val 33a ttt tat gaa ctg ggc gaa Lys Phe Tyr Giu 125 Leu Gly Glu 384 402 44 aac gcc ttt gaa cgc did Asn Ala Phe Glu Arg Tyr 130 < 210 > 52 < 211 > 134 ≪ 212 > PRT < 213 > synthetic origin < 4C0 > 52 Met Asp 1 Ile Glu Gin 5 Glu Leu Glu Arg Ala 1 0 Lys Ala Ser Ile Arg 15 Arg Leu Glu Gin Glu 20 Val Asn Gin Glu Arg 25 Ser Arg Met Ala Tyr 30 Leu Gin Thr Leu Leu 35 Ala Lys Ser Gly Met 40 Glu Arg Val Val Ile 45 Asn Val Ser G1y Leu 50 Arg Phe Glu Thr Gin 55 Leu Lys Thr Leu Asn 60 Gln Phe Pro Asp Thr Leu 65 Leu Gly Asn Pro 70 Gin Lys Arg Asn Arg 75 Tyr Tyr Asp Pro Leu 80 Arg Asn Glu Tyr- Phe 85 Phe Asp Arg Asn Arg 90 Pro Ser- Phe Asp A_L ci 95 Ile Leu Tyr Phe Tyr 100 C-ln Ser Gly Gly Arg 105 Leu Arg Arg Pro Val 110 Asn Val Pro Leu Asp 115 Val Phe Ser Glu Glu 120 Ile Lys Phe Tyr Glu 125 Leu Gly Glu Asn Ala 130 Phe Glu Arg Tyr < 210 > < 21 1 > ≪ 212 > < 2 13 > > 53 384 DMA svntheci: see origin < 220 > < 221 > ≪ 222 > ≪ 223 > CDS (1) · (384) Tshak-Al < 4 0C > atg gaa Met Glu 1 53 egt Arg gcL Val gtt Val 5 att Ile aat Asn gtg Val agc Sergt Gly 1 0 ctg Leu egt Arg ttt Phe gaa Glu acc Thr 15 cag Gin ctg aaa Leu Lys acc Thr ctg Leu 20 aat Asn cag Gin Ttc Phe ccg Pro gat Asp 25 acc Thrggt Leu ctg Leu ggt Gly aat Asn 3c ccg Pro cag Gin aaa egt Lys Arg aa t Asn 35 egt Arg Lat Tyr tat Tyr gat Asp ccg Pro 40 ctg Leu ege Arg aac Asn g cj a Glu did Tyr 45 ttt Phe ttt Phe gat Asp cg c aal Arg Asn 50 egt Arg ccg Pro agc. Ser ttt Phe gat Asp 5 5 gcc Ala att Ile ctg Leu tat Tyr ttt Phe 60 Igl Tyr cag Gin agc Sergt Gly glygt egt Gly Arg ctg Leu egt Arg egt Arg ccg Pro gt t Val aac Asn gtt Val ccg Pro ctg jeu gat Asp gtg Val ttt Phe agc Ser gaa Gl u 48 96 144 192 240 45 f ········ 65 70 75 80 gag atc aaa ttt did Qaa ctg ggc gaa ci 5-C gCC ttt gaa cgc tat tcc 288 Glu I le Lvs Phe Tyr 85 Glu Leu Gly Glu Asn 90 A_a Phe Glu Arg Tyr 95 Ser ggc atg aaa cag ctg gaa 333 gaa ctg caa gca atc gaa aaa cag ctg 336 Gly Met 3ys Gin 100 Leu Glu LVS Glu Leu 105 Gin A ': a Tie Glu Lys 110 Gin Leu gca caa ctg caa tgg aag gct caa gct cgc aag aaag ctg gca cag .384 Ala Gin Leu 115 Gin Trp Lys Ala Gin 120 Ala Arq Lys Lys Lys 125 Leu Al a Gl n < 210 > 54 ≪ 211 > 128 < 212 > PKT < 213 > synthetic origin < 400 > 54 Met 1 Glu Arg Val Val 5 Ile Asn Val Ser Gly 10 Leu Arg Phe Glu Thr 15 Gin Leu Lys Thr Leu 20 Asn Gin Phe Pro Asp 25 Thr Leu Leu Gly Asn 30 Pro Gin Lys Arg Asn 35 Arg Tyr Tyr Asp Pro 40 Leu Arg Asn Glu Tyr 45 Phe Phe Asp Arq Asn 5C Arg Pro Ser Phe Asp 55 Air) Ile Leu Tyr Phe 60 Tyr Gin Ser Gly Gly 6 5 Arg Leu Arg Arg Pro 70 Val Asn Val Pro Leu 7 5 Asp val Phe Ser G1 u 80 Glu Tie Lys Phe Tyr 85 Glu Leu Gly Glu Asn 90 Ala Phe Glu Arg Tyr 95 Ser Gly Met Lys Gin ICO Leu Glu Lys Glu Leu 105 Gin Ala I Glu Lys 110 Gin Leu Ala Gin Leu 115 Gin Trp Lys Ala Gin 1 2 0 Ala Arg Lys Lys Lys 1 25 Leu Ala Gin < 210 > d 5 < 211 > 426 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > COS < 222 > (1) .. (426) ≪ 223 > Tshak-A < 4 0 0 > 5 5 atg Met 1 gaa Glu egt Arg gtt Val g LL Val 5 aat Ile aat Asn gtg Val agc Ser ggt Gly 1 0 ctg Leu egt Arg ut.t Phe gaa Gl u acc Th 15 cag Gln ctg Leu 5 L ys Acc Thr ctg Leu 20 aat Asn cag Gin ttc Phe ccg Pro gat Asp 25 acc Thru ctg Leu ctg Leu ggt Gly aat Asn 30 ccg Pro cag Gin aaa Lys cgL Arg aat Asn 3 5 egt Arg tat Tyr tat Tyr gat Asp cc Pro 40 ctg Leu cgc Arg aac Asn gaa Glu did Tyr 45 ttt Phe ttt Phe gat Asp 48 96 144 46 cgc aat Arg Asn 50 egt Arg ccg Pro agc Ser t.rt Phe gat. Asp 5 5 gcc A1 a ci Γ. L Ile Ctg Leu did Tyr ttr Phe 60 Lat Tyr cag Gin agc Sergt Gly 192 ggt egt Gly Arg 65 ctg Leu egt Arg egr Arg ccg Pro 70 gtt Val aat Asn gtr Val ccg Pro ctg Leu 75 gat Asp gtg Val ttt Phe agc Sera Glu 8C 240 gag atc Glu Ile aaa Lys r. 11. Phe t.at Tyr 85 gaa Glu ctg Leu ggc Gly gaa G'.u aac Asn 90 gcc Ala ttr Phe gaa Glu ege Arg tat Tyr 95 tcc Ser 288 ggc atg Gly Mer aaa Lys cag Gl n 100 ctg Leu gaa Glu aaa Lys gag Glu ctg Leu 105 aaa Lys cag Gin tta Leu gaa Glu clSol Lvs lio gaa Glu ctg Leu 336 caa gca Gin Ala atl Ile 1 1 5 gaa Glu aaa Lys C aC | Gin ctg Leu gca Ala 120 cag Gin erg Leu caa Glngg Trp aaa Lys 125 gca Ala cag Gin gca Ala 384 egr aaa Arg Lys 130 aaa Lys aaa Lys ctg Leu gcc Ala cag Gin 135 ctg Leu aaa Lys aaa Lys aaa Lys ett Leu 140 cag Gin gcc Ala 426 < 210 > ≪ 211 > ≪ 212 > ≪ 213 > 56 112 PRT syntheti: apparent origin < 4C0 > 56 Met Glu 1 Arg Val Va 1 5 ile Asn V ai Ser Gly 10 Leu Arg Phe Gl u Thr 15 Gin Leu [ys Thr Leu 20 Asn Gin Phe Pro Asp 25 Thr Leu Lei] Gly Asn 30 Pro Gin Lys Arg Asn 35 Arg Tyr Tyr Asp Pro 40 Leu Arg Asn Glu Tyr 45 Phe Phe Asp Arg Asn 50 Arg Pro Ser Phe Asp 55 A ^ Ile Leu Tyr Phe 60 Tyr Gin Ser Gly Gly. Arg 65 Leu Arg Arg Pro 10 Val Asn Va 1 Pro Leu 75 Asp Val Phe Ser Glu 8C Glu Ile T ys Phe Tyr 85 Glu Leu Gl y Glu Asn 90 A: a Phe Glu Arg Tyr 95 Ser Gly Mer Lys Gin 100 Leu Glu lys G'.u Leu 1 05 Lys Gl n Ieu Glu Lys 110 Glu Leu Gin Ala Ile 115 Glu Lys Gin Leu Aid 120 Gin Leu Gin Trp Lys 1 25 Ala C-ln Ala Arg Lys 1 3 C Lys Lys Leu Al a Gin 135 JGU Lys Lvs Lys Leu i / o Gin Ala < 210 > ≪ 211 > ≪ 212 > ≪ 213 > 51 39 9 DNA synl.hei i see origin <: 220 > ≪ 221 > ≪ 222 > CD3 (1). , , (399) 47 • »•« • · * < 2 2 3 > Tshak -B < 400 > atg gaa Met Gl li 1 57 egt Arg gtt Val gtt Val 5 at.t Ile aat Asn gtg Val agc Ser ctg Leu aaa Lys acc Thru gt Leu 20 aat Asn caq Gin ttc Phe ccg Pro gat Asp 25 aaa Lys egt Arg aat Asn 35 egt Arg did Tyr tat Tyr gat Asp ccg Pro 40 ctg Leu cgc Arg aat Asn 50 egt Arg ccg Pro agc Ser ttt Phe gat Asp 55 gcc Ala att a le ggt Gly 65 egt Arg ctg Leu egt Arg egt Arg ccg Pro 70 gtt Val aat Asn good Val gag Glu atc Ile aaa Lys ttt Phe tat Tyr 85 gaa Glu ctg Leu ggc Gly gaa Glu ggc Gly gaL Asp act Ile gaa Glu ICO cag Gin gaa Glu ctg Leu gaa Glu cgc Arg 1 05 ctg Leu gaa Glu cag Gin 115 gaa Glu gtt Val aat Asn caa Gin gaa Glu 120 egt Arg acc Thr ctg Leu 130 ctg Leu gca Ala aaa Lys < 210 > ≪ 211 > < 21 2 > < 21 3 > 5 8 133 PRT synth lethal origin < 40C > 58 Met 1 Glu Arg Val Val 5 Ile Asn Val Ser Leu Lys Thr Leu 20 Asn Gin Phe Pro Asp 25 Lys Arg Asn 35 Arg Tyr Tyr Asp Pro 40 Leu Arg Asn 50 Arg Pro Ser Phe Asp 55 Ala Ile Glv 65 * Arg Leu Arg Arg Pro 70 Val Asn Val Glu Ile Lys Phe Tyr 85 Glu Leu Gly Glu Gly Asp Ile Glu 100 Gin Glu Leu Glu Arg 1C5 ggt ctg egt ttt gaa acc cag 48 Gly io Arg Phe Glu Thr 15 Gin acc ctg ctg ggt aat ccg cag 96 Thr Leu Leu Gl y Asn 30 Pro Gin cgc aac gaa did ttt ttt gat 144 Arg Asn Glu Tyr 45 Phe Phe Asp ctg did ttt did cag agc. lg Tyr Phe 60 Tyr Gin Ser Gly ccg ctg gat gtg ttt agc gaa 240 Pro Leu 75 Asp Val Phe Ser Glu 80 aac gcc ttt gaa cgc tat tcc 288 k £ > ο ω D Ala Phe Glu Arg Tyr 95 Sercaça aca gca agc att egt egt 336 Ala Lys Ala Ser Ile 110 Arg Arg ag e atg gca act ctg caa 384 Se Arg Arg Ala 125 Tyr Leu Gin 399 Gly 1 0 Leu Arg Phe Glu Thr 15 Gin Thr Leu Leu Gl y Asn 30 Pro Gin Arg Asn Glu Tyr 4 5 Phe Phe Asp Leu Tyr Phe 60 Tyr Gin Ser Gly Pro Leu 75 Asp Val Phe Ser-Glu 80 Asn 90 Ala Phe Glu Arg Tyr 95 Ser Ala Lys Ala Ser Ile 110 Arg Arg 48 • * • η * * t * * * 41 I Leu Glu Gin 115 Glu Val Asn Gin Glu Arg Ser Arg Met 120 Ala Tyr Leu Gin 125 Thr Leu 130 Leu Ala Lys < 210 > 59 < 211 > 183 < 212 > DNA < 213 > synthetic origin < 2 2 Ο > ≪ 221 > CDS < 222 > (1), (183) < 223 > Foldon-Al < 40C > 59 atg Met 1 g Gly d Tyr att Ile cct Pro 5 gaa Glu gc.a Ala cca pro Arg Arg Asp 10 ggc Gly caa Gin gca Ala ~ O.C Tyr gtt Val 15 egt. Arg 48 aaa Lys gac Asp ggt Gly gaa Glu 20 Lgg Trp gtc Val ctg Leu ctg Leu tcc Ser 25 act Thrttc Phe ctg Leu tcc Ser ggc Gly 30 atg Met aaa Lys 96 cag Gin ctg Leu gaa Glu 35 aaa Lys gaa Glu ctg Leu caa Gin gca Ala 40 ac: c Tie gaa Glu oil 3 3 Lvs cag Gin ctg Leu 45 gca Ala caa Gin ctg Leu 144 caa Gin tgg Trp 50 aag Lys get Ala caa Gin get Ala ege Arg 5 5 aag Lys aaa Lys aag Lys ctg Leu gca Ala 60 cag Gin 183 ≪ 210 > 60 < 211 > 61 < 212 > PRT < 213 > synthetic origin < 400 > 60 Mel 1 Gly Tyr Ile Pro 5 Glu Ala Pro Arg Asp 10 Gl y Gin Ala Tyr Val 15 Arg Lys Asp Gly Glu 20 Trp Val Leu Leu Ser 25 Thr Phe Leu Ser Gly 30 Met Lys Gin Leu Gl u 3 5 Lys Glu Leu Gin Ala 40 1 Glu Lys Gin Leu 45 Ala Gin Leu Gin Trp 50 Lys Al a Gl Ala Arg 55 Lys Lys Lys Leu Ala 60 Gin < 210 > 61 < 211 > 225 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1) .. (225) ≪ 223 > Foldon A < 40 0 > 61 atg ggt tat cct gaa gca cca egt gat ggc caa gca tac gtt egt Met G] v Tyr ile Pro Glu Ala Pro Arg Asp Glv Gin Ala Tyr Val Arg 1 5 10 15 48 49 aaa gac Lys Asp ggt Gly gaa Glu 20 Lgg Trp gtc Val ctg Leu ctg Leu tcc Ser 25 dct Thr tcc Phe ctg Leu tcc Ser ggc Gly 30 atg Met aaa Lys 95 cag ctg Gin Leu gaa Glu 35 aaa Lys gag Glu ctg Leu aaa Lys cag Gl n 40 ita Leu gaa GLu aaa Lys gaa Glu ctg Leu 45 Ccicl Gin gca Ala att Ile 144 gaa aaa Glu Lys 50 cag Gin ctg Leu gca Ala cag Gln ctg Leu 55 caa Gin tgg Irp aaa Lys gca Ala cag Gin 60 gca Ala cg t Arg aaa Lys aaa Lys 192 aaa ctg Lys Leu 65 gcc Ala cag Gin ctg Leu aaa Lys 70 aaa Lys aaa Lys ett Leu cag Gin gcc Ala 75 225 < 21 0 > < 21 1 > < 21 7 > < 21 3 > 62 75 PKT synthetic origin < 4 0 0 > 62 MeL Gly 1 Tyr T le Pro 5 Glu Ala Pro Arg Asp 1.0 Gly Gin Ala Tyr Val 15 Arg Lys Asp Gly Glu 20 Trp Val Leu Leu Ser 25 Thr Phe Leu Ser Gly 30 Yet Lys Gin Leu Glu 35 Lys Glu Leu Lys Gin 40 Leu Glu Lys Glu Leu 45 Gin Ala Ile Glu Lys 50 Gin Leu A1 a Gin Leu 5 5 Gln Trp Lys Ala Gin 60 Ala Arg Lys Lys Lvs Leu 65 Ala Gin Leu Lvs 7 0 Lys Lys Leu Gin Ala 75 < 210 >; ≪ 211 > ≪ 212 > ≪ 213 > 6 3 198 DNA synchet1: apparent origin. ≪ 220 > < 221 > ≪ 222 > ≪ 2 2 3 > CDS (1) - (198) Foldon-B < 4 00 > atg ggt. Me t Gly 1 63 did Tyr aut Ile ccz Pro 5 gaa Glu gca Ala cca pro Arg Arg Asp 1 0 qgc Gly caa C-ln gca Ala tac Tyr gtt Val 15 egt Arg 4 8 aaa gac Lys Asp ggt. Gly gaa Glu 20 tgg Trp gtc Va 1 ctg Leu ctg Leu Lee Ser 2 5 dCt Thr 1.1. o Phe c Lg Leu tcc Ser ggc Gly 30 gat. Asp att Tie 96 gaa cag Glu Gin gaa Glu 3 5 ctg Leu gaa Glu ege Arg gca A _cl aaa Lys 40 gca A · lg ag Ser atr. 1 lit arg arg Arg 45 ctg Leu gaa Glu cag Gin 144 gaa gtt Glu Va: 5C aat As n Cää Gin gaa Glu egt Arg agc Ser 55 egt. A rg aLg Met gca Ala n t Tyr ctg Leu 60 g: n acc Th r ctg Leu ctg Leu 192 gca aaa A'a I.ys 198 50 65 < 210 > 64 ≪ 211 > 66 < 212 > PRT < 213 > synthetic origin < 40C > 64 Met 1 G.I. y Tyr Ile Pro 5 Glu Ala Pro Arg Asp 10 Gly G.1 n Ala Tyr Val 15 Arg > 1 Asp Gly Glu 20 Trp Val Leu Leu Ser 25 Thr Phe Leu Ser Gly 30 Asp Ile Glu Gin Gl u 35 T.eu Glu Arg Ala Lys 40 Ala Ser Ile Arg Arg 45 Leu Glu Gl Glu Val 50 Asn Gl.n Glu Arg Ser 55 Arg Met Ala Tyr Leu 60 Gin Thr Leu Leu Ala Lys 65 < 210 > 65 ≪ 211 > ISO < 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 22 > (1) .. (ISO) < 23 > Al -t'oldon < 400 > 65 arg Met 1 aaa T.ys cag Gin ctg Leu gaa Glu 5 aaa Lys gaa Glu ctg Leu caa Gin gca Ala 10 acc Ile gaa Glu aaa T.ys cag Gin ctg Leu 15 gca Ala 48 caa Gin ctg Leu caa Gl n Lgg T rp 20 aag iys gct Ala caa Gin gct Ala cgc Arg 25 aag Lys aaa lys aag Lys ctg Leu gca Ala 30 cag Gin tcc Ser 96 ggg Gly ggt G] y tat. Tyr 3 5 stt ile cc L Pro gaa Glu gca Ala cca Pro 4C egt. Arg ga l. Asp ggc Gly caa Gin gca Ala 45 tac Tyr gtt. Val Arg Arg 144 aaa Lys gac Asp 50 ggt Gly gaa Glu tgg T rp gtc Val ctg Leu 55 ctg Leu LCC Ser act. Thrttc Phe ctg Leu 60 1.80 ≪ 210 > 66 < 211 > 60 < 212 > PRT < 213 > synthetic origin < 4 00 > 66 Met 1 Lys Gin T.eia Glu 5 Lys Glu Leu Gin Ala 10 Ile Gl u Lys Gin T.eu 15 Ala Gin Leu Gin Trp 20 T VS Al a Gin A _ a Arg 25 Lys Lys I s Leu Ala 30 Gin Ser Gly Gly Tyr Ile Pro Gl u Ala Pro Arg Asp Gly Gin Ala Tyr Val Arg 51 Lys Asp Gly Glu Trp Val Leu Leu Ser Her Phe Leu 50 55 50 <21. 0 > 67 ≪ 211 > 222 < 212 > DNA < 213 > synther.isehe origin < 220 > ≪ 221 > CDS < 222 > (1) .. (222) < 223 > A-Foldon < 40 Q > 67 atg aaa cag ctg gaa aaa gag ctg aaa cag tta gaa aaa gaa ctg caa 48 Met Lys Gin Leu Glu Lys Glu Leu Lys Gin Leu Glu Lys Glu Leu Gin 1 5 10 15 gca att gaa aaa cag ctg gca caq ctg caa tgg aaa gca cag gca egt 96 Ala Ile Glu Lys Gin Leu Ala Gin Leu Gin Trp Lys Ala Gin Ala Arg 20 25 3C aaaaaaaa ctg gcc cag ctg aaa aaaaaa cc ccc gcc tcc ggc ggt 144 Lys Lys Lys Leu Ala Gin Leu Lys Lys Lys Leu G.1 n Al a Ser Gly Gly 35 40 45 tat att cct ga ca cca egt gat ggc caa gca tac gtt egt aaa gac 192 Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gin Ala Tyr Val Arg Lys Asp 50 55 60 ggt gaa tgg g lc ctg ctg tcc act ttc ctg 222 Gly Glu Trp Val Leu Leu Ser Thr Phe Leu 65 70 < 210 > 68 < 211 > 74 ≪ 212 > PRT < 213 > synthetic origin < 40C > 68 Met 1 Lys Gin ^ eu Gl u 5 Lys Glu Leu Lys Gin 10 Leu Glu Lys Glu Leu 15 Gin A '. A Ile Glu Lys 20 Gin Leu Ala Gin Leu 25 Gin Trp Lys Ala Gin 3 0 Ala Arg Lys Lys Lys 35 Leu Ala Gin Leu Lys 40 Lys Lys Leu Gin Ala 45 Ser Gly Gly Tyr I] e 50 Pro Glu Ala Pro Arg 5 5 Asp Gly Gin Ala Tyr 6 0 Val Arg Lys Asp Gly 65 Glu Trp Va 1 Leu Leu 70 Ser Thr Phe Leu < 210 > 69 < 211 > 198 ≪ 212 > DNA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 222 > (1) .. (198) < 223 > B-Foldon 69 < 400 > 52 atg Met 1 gat Asp old Ile gaa Glu cag Gin 5 gaa Glu ctg Leu gaa Glu ege Arg gca Ala 10 aaa Lys gca Ala agc Serat Ile egt Arg 15 egt Arg 48 ctg Leu ga a Glu cag Gin gaa Glu 20 gtt Val aat Asn caa Gin gaa Glu egt Arg 25 agc Ser egt Arg atg Met gca Ala did Tyr 30 cLg Leu caa Gin 96 acc Thru ctg Leu ctg Leu 35 gca Ala Lys tcc Ser ggc Gly ggt Gly 40 tat Tyr att I le cct Pro gaa Glu gca Ala 45 cca Pro Arg Arg Asp 144 ggc Gly ca Gin 50 gca Ala tac Tyr gtt Val egt Arg aaa Lys 55 gac Asp gt Gly gaa Glu g tgg Trp gtc Val 60 ctg Leu ctg Leu tcc Ser act Thr 192 tac Phe 65 ctg Leu 198 < 210 > ≪ 211 > < 21 2 > ≪ 213 > 70 66 PRT synthetic origin < 40 0 > 70 Met 1 Asp Ile Glu Gin 5 Glu Leu Glu Arg Ala 1 0 Lys Ala Ser Ile Arg 15 Arg Leu Glu Gl Glu 20 Val Asn Gl n Glu Arg 25 Ser Arg Me L Ala Tyr 30 Leu Gin Thr Leu T, eu 35 A1 a Lys Ser Gly Gly 40 Tyr Ile Pro Glu Ala 45 Pro Arg Asp Gly Gin 50 Ala Tyr Val Arg Lys 55 Asp Gly Glu Trp Val 60 Leu Leu Ser Thr Phe 65 Leu < 210 > < 2 11 > < 2 12 > < 213 > 71 432 LINA synthet.i: see origin < 22C > < 221 > ≪ 222 > < 223 > CD5 (1) ·· (432) Al-Cut Al < 4 0 0 > atg aaa Met I.ys 1 71 cag Gin ctg Leu gaa Glu 5 aaa T.ys gaa Glu ctg Leu caa Gin gca Ala 10 atc Ile gaa Glu aaa Lys cag Gin ctg Leu 15 gca Ala 48 caa Gin ctg Leu caa Gln egg Trp 20 aag Lys get Al 3 caa Gin get Ala cqc Arg 25 aag Lys aaa Lys aag Lys ctg Leu gca A1 a 30 cag Gln tcc Ser 96 ggc Gly ctt Leu gar. Asp 35 gaa Glu aaa Lys agt. Ser teg Ser aat Asn 40 acc Thrg Al ate Ser gtc Val gtg Val 45 gtg Val cta Leu tgt Cys 144 acg Thr gca Ala cca Pro gat Asp gaa Glu ga ALa aca Thr gcc Ala cag Gin gat Asp tta Leu gcc Ala gcc Ala aaa Lys gtg Val ctg Leu 1 9 2. 53 53 50 5o 60 gcg Ala 55 gaa Glu aaa Lys ctg Leu gcg Ala gcc Ala 70 tgc Cys gcg Ala acc Thr tt Leu atc i. le 75 ccc Pro gqc Gly gcc Ala acc Thr cct Ser 80 240 ctc Leu did Tyr tac Tyr tgg Trp gaa Glu 8 5 g Gly aag Lys erg Leu gag G 1 u ca Gl n 90 gaa Glu tac Tyr gaa Glu gtg Val cag Gin 95 atg Ket 288 att I le tta Leu aa Lys act Thr 100 dcc Thr gta Val tct Ser cac His cag Gin 105 cag Gin gca Ala ctg Leu ccg Leu gaa Glu 110 ege Cys ctg Leu 336 aag J.ys tct Ser cat His 115 cat Hi s cca Pro did Tyr a a Gin acc Thr 120 ccg Pro gaa Gl u ert Leu ctg Leu gtt Val 125 tca Leu ccc Pro gLL Va 1 384 aca Thr cac His 130 gga Gly gac Asp aca Thrgat Asp tac Tvr 135 ctc Leu cca Ser tgg Trp ctc Leu aac Asn 140 gca Ala tct Ser tta Leu ege Arg 432 < 210 > < 211 > ≪ 212 > ≪ 213 > 72 144 FRT synthetic Kerkunf L < 400 > 72 Mec 1 Lys Gin Leu Glu 5 Lys Gl u Leu Gin Ala 10 1.1 e Glu Lys Gin Leu 15 Ala Gin Leu Gin Trp 20 Lys Ala Gin Ala Arg 25 Lys Lys Lys Leu Ala 30 Gln Ser Gly Leu Asp 35 Glu Lys Ser Ser Asn 40 Thr Ala Ser Val Val 45 Val Leu Cys Thr Ala 50 Pro Asp Glu Ala Thr 55 Ala Gin Asp Leu Ala 60 Ala Lys Val Leu Ala 65 Glu Lys Leu Ala Ala 70 Cys A ± a Thr Leu 1 le 7 5 Pro Gly Ala Thr Ser 80 Leu Tyr Tvr Trp Glu 85 Gly Lys Leu Gl u G_n 90 Glu Tyr Glu Val Gin 95 Met I le Leu T.ys Thr IOC Thr Val Ser His Gin 105 Gin Ala Leu Leu Glu 110 Cys Leu Lys Ser His 1 15 His Pro Tyr Gin Thr 120 Pro Glu Leu Leu Val 125 Leu Pro Val Thr His 130 Gly Asp Thr Asp Tyr 135 Leu Ser Trp Leu Asn 140 Ala Ser T.eu Arg < 210 > 73 < 2Π > 474 < 2 1 2 > DKA < 213 > synthetic origin < 220 > ≪ 221 > CDS < 2 2 2 > (1) .. (474) < 223 > A CutAl 73 < 4 OC > 54 atg Mer. 1 aa Lys cag Gin ctg Leu gaa Glu 5 a aa Lys gag Glu ctg Leu aaa Lys cag Gin 10 tra Leu gaa Glu aaa Lys gaa Glu ctg Leu 15 caa Gin 48 gca Ala att Tie gaa Glu aaa Lys 20 cag Gin ctg Leu gca Ala cag Gin ctg Leu 25 caa Gin tgg Trp aaa Lys gca Ala cag Gin 30 gca Ala egt Arg 96 aaa Lys aaa Lys a aa Lys 35 ctg Leu gcc Ala cag Gin erg Leu aaa Lys 40 aaa Lys aaa Lys CtL Leu cag Gly n gcc Al a 45 Lee Ser ggg Gly lett Leu 144 gat Asp gaa Glu 5C aaa Lys agt Ser tcg Ser aat Asn acc Thr 55 gcg Ala rct Ser gtc Val gtg Val gtg Val 60 cta Leu tqt Cys aeg Thr gca Ala 192 cca Pro 65 gat Asp gaa Glu gcg Ala aca Thr gcc Ala 70 cag Gin even Asp t La Leu gcc Ala gcc Ala 7 5 aaa Lys gtg Val cLg Leu gcg Ala gaa Glu 80 240 aaa Lys ctg Leu gcg Ala gcc Ala tgc Cys 85 gcg Ala acc Thrttg Leu atc Ile ccc Pro 90 ggc Gly get Ala acc Thrctct Serctc Leu 95 did Tyr 238 IcC Tyr tag Trp gaa Glu ggt Gly 100 aag Lys ctg Leu gag Gl u caa Gin gaa Glu 105 rac Tyr gaa Glu gtg Val cag G ^ n atg Met 110 att Ile t ta Leu 336 aaa Lys act Thr acc Thr 115 gta Val tct Ser cac His cag Gin cag Gin 120 gca Ala ctg Leu ctg Leu gaa Glu tgc Cys 125 ctg Leu aag Lys tct Ser 384 cat His cat His 130 cca Pro rar Tyr caa Gin Acc Thr ccg Pro 135 gaa Glu ett Leu ctg Leu gt.e Val rLa T.eu 1 40 cc L Pro gtt Val aca Thr cac His 432 gga Gly 145 gac Asp aca Thrgat Asp Lac Tyr ctc Leu 1 5 0 tca Ser tgg Trp ctc Leu aac Asn gca Ala 155 r.ct Ser tta Leu ege Arg 474 < 2 10 > ≪ 2 11 > ≪ 212 > '< 213 > 74 158 PRT synthetic origin < 4 0 0 > 74 Met 1. Lys Gin Leu Glu 5 Lys Glu Leu Lys Gln 1 0 Leu Glu Lys Glu Leu 1 5 Gln Ala Glu Lys 20 Gin Leu Ala Gin Leu 2 5 Gin Trp Lys Ala Gin 30 Ala Arg lys Lys Lys 35 Leu Ala Gin Leu Lys 40 Lys Lys Leu Gin Ala 4 5 Ser Gly Leu Asp Glu 50 Lys Ser Ser Asn Thr 55 Ala Ser Val Val Val 60 Leu C ys Thr Ala Pro 65 Asp Glu Ala Thr Ala 70 Gin Asp Leu Ala Ala 7 5 Lys Val Leu Ala Glu 80 Lys Leu Ala Ala Cys 85 Ala Thr Leu Ile Pro 90 Gly Ala Thr Ser Leu 9 5 Tyr-Tyr Trp Glu Gly 100 Lys Leu Glu Gin Glu 105 Tyr Glu Val Gin Met 1 1 0 Ile Leu 55 Lys Thr Thr 115 Val Ser His Gin Gin 120 Ala His His 130 Pro Tyr Gin Thr Pro 135 Glu Leu Gly 1 4 5 Asp Thr Asp Tyr Leu 150 Ser Trp Leu < 210 > < 211 > ≪ 212 > ≪ 213 > 75 450 DNA synthetic: origin < 220 > ≪ 221 > ≪ 222 > ≪ 223 > CDS (1) .. (450) B-CutAl < 400 > atg gat Met Asp 1 75 att Ile gaa Glu cag Gin 5 gaa Glu ctg Leu gaa Glu ege Arg ctg Leu gaa Glu cag Gin gaa Glu 2C gtt Va_at Asn caa Gin gaa Glu egt Arg 25 acc Thr ctg Leu ctg Leu 3 5 gca Ala aa Lys tcc Ser ggc Gly ett Leu 40 gat Asp t'.CL Ser gtc Val 50 gtg Val gtg Val ctal tg 1 Cys aeg Thr 5 5 gca Al a cca Pro tta Leu 65 gcc Ala gcc Ala aaa Lys gtg Val ctg Leu 70 geg Ala gaa Glu aaa Lys ctg Ile ccc Pro ggc Gly get Ala acc Thr 85 tet Ser ctc Leu tat Tyr tac Tyr gaa Glu tac Tyr gaa Glu gtg Val 100 cag Gin atg Met 3. L * "Ile L " α Leu aaa Lys 105 gca Ala ctg Leu ctg Leu 115 gaa Glu tgc Cys ctg Leu aag Lys tet Ser 1 20 cat His cct Leu ctg Leu 130 gLt Val tta Leu cct Pro gLL Val äCä Thr 135 cac Hi s gga Gly c Lc Leu 145 aac Asn gca Ala tc: Ser tta Leu ege Arg 150 < 21C > ≪ 211 > ≪ 212 > ≪ 213 > 76 150 PRT Synthetic Origin L < 4 0 C > 76 Leu Leu Glu Cys 125 Leu T.ys Ser Leu Val Leu 140 Pro Val Thr His Asn Ala 155 Ser Leu Arg gca ad gca agc att egt egt 48 Ala 10 Lys Ala Ser Ile Arg 15 Arg ag egt atg gca act ctg caa 96 Ser Arg Met Al a Tyr 3 0 Leu Gin gaa aaa agt teg aat acc g 144 Glu Lys Ser Ser 45 Asn Thr Ala gal gaa g aca gcc cag gal 192 Asp G1 u Ala 60 Thr Ala Gin Asp ctg gcc tgc geg acc ttg 240 Leu Ala 75 Ala Cys A1 a Thr Leu 80 tgg gaa ggt aag ctg gag caa 288 Trp 90 Glu Gly Lys Leu Glu 95 Gl.n act acc gLa tet cac cag cag 336 Thr Thr Val Ser His 110 Gin Gin cat cca tat caa acc ccg gaa 384 His Pro Tyr Gin 125 Thr Pro Glu gac c-ca gal Lac ctc tca tgg 432 Asp Thr Asp 140 Tyr Leu Ser Trp 450 56 Met 1 Asp I Glu Gin 5 Glu Leu Glu Arg Ala 10 ijys Ala Ser le Arg 15 Arg Leu Glu Gin Glu 20 Val Asn Gin Glu Arg 25 Ser Arg Met Ala Tyr 30 Lei Gin Leu Leu 35 Ala Lys Ser G ' -Y Leu 40 Asp Glu Lys Ser Ser 45 Asn Thr Ala Ser Val 50 Val Val Leu Cys Thr 5 5 Ala Pro Asp Glu Ala 60 Thr Ala Gin Asp Leu 65 Ala Ala Lys Val Leu 70 Ala Glu Lys Leu Ala 75 Ala Cys Ala Thr Leu 80 Ile Pro C-ly Ala Thr 85 Ser Leu Tyr Tyr T rp 90 Glu Gl y Lys Leu Glu 95 Gin Glu Tyr Glu Val 100 Gin Met Ile Leu Lys 105 Thr Thr Val Ser Kis 110 Gin Gin Ala Leu Leu 115 Glu Gly Asp Thr Asp 140 Tyr Leu Ser Trp Leu 145 Asn Ala Ser Leu Arg 150 < 210 > < 211 > < 21 2 > ≪ 213 > 77,435 DNA synthetic origin < 220 > < 2 21 > > ≪ 222 > ≪ 223 > CDS (1). (435) Cut Al-Al < 400 > atg ctt Met Leu 1 77 gat Asp gaa Glu aaa Lys 5 agt Ser teg Ser ate Asn acc Thrgcg A 1 a 10 Lct Ser gtc Val gtg Val gtg Val cta Leu 15 tgt Cys 48 acg Thr gca Ala cca Pro gat Asp 20 gaa Glu gcg Al a aca Thr gcc Ala cag Gin 25 gat Asp tta Leu gcc Ala gcc Ala aaa Lys 30 gtg Val ctg Leu 96 gcg Ala gaa Glu aaa Ly 5 35 ctg Leu gcg Ala gcc Ala tgc Cys gcg Ala 40 acc Thr tt leu atc Ile ccc Pro qgc Glv 45 qct Ala acc Thr tot Ser 144 c Lc Leu did Tyr 50 tac Tyr tgg Trp gaa Glu ggt Glv aag Lys 55 ctg Leu gag Glu caa Gin gaa Glu tac Tvr 6 0 < 2 < < Glu gtg Val cag Gin alg Met 192 att Ile 65 tta leu aaa iy3 act Thr acc Th ί gt.a Val 70 tcc Ser cac Hi s cag Gin cag Gin gca Ala 75 ctg Leu ctg Leu gaa Glu tgc Cys ctg Leu 80 240 aag lvs zcz ser car kis cat hi α ca pro 85 did Tyr caa Gin acc Thr ccg Pro gaa Glu 90 ctt Leu ctg Leu gtt Val tta Leu cct Pro 95 gtt Val 288 aca ca c gga gac aca gat tac ctc tca tgg ctc aac gca ict tw C. ege 336 57 Thr His Gly Asp 100 Thr Asp Tyr T.eu Ser 105 Trp Leu Asn Ala Ser 110 Leu Arg tcc ggc atg aaa cag ctg gaa aaa gaa ctg caa gca atc gaa aaa cag 384 Ser Gly Met 115 Lys Gin Leu Glu Lys 120 Glu Leu Gin Ala Ile 125 Glu Lys Gin ctg gca caa ctg caa tgg aag gct caa gct cgc aag aaaag ctg gca 432 Leu Ala 130 Gin Leu Gin Trp Lys 135 Ala Gin Ala Arg Lys 140 T, yn Lys Leu Ala 435 cag Gin 145 < 210 > 78 < 211 > 145 ≪ 212 > PRT < 213 > synthetic origin < 40 0 > 78 Met 1 Leu Asp Glu Lys 5 Ser Ser Asn Thr Ala 10 Ser Val Val Val, i 15 Cys Thr Ala Pro Asp 20 Glu Ala Thr Ala Gin 25 Asp Leu Ala Ala Lvs 30 Val Leu Al a Gl u Lys 35 Leu Al a Ala Cys Ala 40 Thr Leu Ile Pro Gly 45 Ala Thr Ser Leu Tyr 50 Tyr Trp Glu Gly Lys 5 5 Leu Gl u Gin Glu Tyr 60 Glu Val Gin Met Ile 65 Leu Lys Thr Thr Val 70 Ser His Gin Gin Ala 75 Leu Leu Glu Cys Leu 80 Lys Ser His Pro 85 Tyr Gin Thr Pro Glu 90 Leu Leu Val Leu Pro 95 Val Thr HiS Gly Asp 100 Thr Asp Tyr Leu Ser 105 Trp Leu Asn Ala Ser 110 Leu Arg Ser Gly Met 115 Lys Gin Leu G_u Lys 120 Glu Leu Gl Ala Ile 125 Glu Lys Gin Leu Ala 130 Gin Leu Gin Trp Lys 135 Ala Gin ALa Arg Lvs 14C Lys Lys Leu Ala Gin 1 4 5 < 210 > 79 < 211 > 477 ≪ 212 > DNA < 213 > synthetic origin < 2 2 C > < 2 21 > > CDS < 2 2 2 > (1) .. (477) ≪ 223 > CutAl-A < 4 00 > 79 atg crt gat gaa aaa agt regaat acc gcg tct gtc gtg gtg cta egt Met Leu Asp Glu Lys Ser Ser Asn Thr Ala Ser Val Val Val Leu Cys 15 10 15 48 58 acg Your gca Ala cca Pro gat Asp 20 gaa Glu gcg Ala aca Thr gcc Ala cag Gin 25 gat Asp tta Leu gcc A_a gcc Ala Lys 30 gtg Val ctg Leu 96 gcg Ala gaa Glu aaa Lys 35 ctg Leu gcg Ala gcc Ala Lgc Cys gcg Ala 40 acc Thr ttg Leu atc I le ccc Pro ggg Gly 45 get Ala acc Thr tet Ser 144 ctc Leu did Tyr 50 tac Tyr tgg Trp gaa Glu ggt Gly aag Lys 55 ctg Leu gag G lu caa Gin gaa G ~ u tac Tyr 60 gaa Glu gtg Val cag Gin atg Met 192 att Ile 65 tta Leu aaa Lys act Thr acc Thr gta Val 70 tet Ser cac His cag Gin cag Gin qca Ala 75 ctg Leu ctg Leu gaa Glu tgc Cys ctg Leu 80 240 aag Ly s Lc L Ser cat H is cat His cca Pro 85 did Tyr caa Gin acc Thr ccg Pro gaa Glu 90 cLt Leu ctg Leu gtt Va ^ tta Leu cct Pro 9 5 gtt Val 288 aca Thr cac His gga Gly gac Asp 1 0 0 aca Thrgat Asp Lac Tyr ctc Leu tca Ser 105 tgg Trp ctc Leu aac Asn gca Ala rct Ser 110 tta Leu ege Arg 336 tcc Ser ggc C-ly arg Met 115 aaa Lys cag Gin ctg Leu gaa Glu aaa Lys 120 gag Glu ctg Leu aaa Lys cag Gln tta Leu 125 gaa Glu aaa Lys ga a Glu 384 ctg Leu caa Gin 130 gca Ala att Ile gaa Glu aaa Lys cag Gin 135 ctg Leu gca Ala csg Gin ctg Leu caa Gin 140 tgg Trp aaa Lys gca Ala cag Gin 432 gca Al a 145 egt Arg aaa Lys aaa Lys aaa Lys ctg T.eu 150 gcc Ala cag Gln ctg Leu aaa Lys aaa Lys 155 aaa Lys ett Leu cag Gin gcc A ^ a 477 < 210 > < 211 > ≪ 212 > ≪ 213 > 80 159 PRT synLheti: see Herk: unf '< 400 > 80 Met. 1 Leu Asp Glu Lys 5 Ser Ser Asn Thr Ala 10 Ser Val Va 1 Val Leu 15 Cys Th r Ala Pro Asp 20 Glu Ala Thr Ala Gin 25 Asp Leu Ala ALa Lys 30 Val Leu Ala Glu Lys 35 Leu Ala Ala Cys Ala 40 Thr Leu Ile Pro Gly 45 Ala Thr Ser Leu Tyr 50 Tyr Trp Glu Gl.y Lys 5 3 Leu Glu Gin Glu Tyr 60 Glu Val Gln Met Ile 6 5 Leu Lys Thr Thr Val 70 Ser His Gin Gin Al a 7 5 Leu Leu Gl u Cys Leu 80 Lys Ser His Pro 85 Tyr Gin Thr Pro Glu 90 Leu Leu Val Leu Pro 9 5 Va 1 Your Hin Gly Asp 100 Thr Asp Tyr I, eu Ser 105 Trp Leu Asn Ala Ser 110 Leu Arg Ser Glv Met 115 Lys Gln Leu Glu Lys 1 20 Glu Leu Lys Gin Leu 125 Gl u Lys Glu 59 Leu Gin Ala 130 He Glu Lys Gin 135 Leu Ala Ala 145 Arg Lys T, y s luVS Leu 150 Ala Gin Leu < 210 > 81 < 211 > 450 < 212 > DNA < 213 > synthetic origin i L < 220 > ≪ 221 > CDS < 222 > (1) .. < 223 > Cut Al (450) -B < 40C > 81 atg ett gat Met Leu Asp 1 gaa Glu aaa Lys 5 agt Ser teg Ser a ^ L Asn acc Thr aeg Th r gca cca Ala Pro gat Asp 20 gaa Glu gcg Ala aca Thr gcc Ala cag Gin 25 gcg Ala gaa aaa Glu Lys 35 c Lg Leu gcg Ala gcc Ala tgc Cys gcg Ala 40 acc Thr ctc Leu tat tac Tyr Tyr 50 tgg Trp gaa Glu ggt Gl y aag Lys 55 ctg Leu gag Glu att Ile 65 tta aaa Leu Lys act Thr acc Thr gta Val 70 tct Ser cac His cag Gin aag Lys tct cat Ser His cat His cca Pro 85 did Tyr caa Gin acc Thr ccg Prc aca Thr cac gga His Gly gac Asp 1 00 clC c Thr ga t Asp tac Tyr ctc Leu Lea Ser 1 05 tcc Ser ggc gat Gly Asp 115 ar. L Ile gaa Glu cag Gin gaa Glu ctg Leu 120 gaa Glu egt Arg ctg gaa Leu Glu 130 cag Gin gaa Glu gtt Val aat Asn 135 C 3 ci Gin gaa G_u caa C-ln 145 acc ctg Thr Leu ctg Leu gca Ala aaa Lys 1 50 < 210 > 82 < 211 > 150 < 212 > PRT < 213 > synthet1 see origin < 4 C 0 > 82 Met Leu Asp Glu Lys Ser Ser Asn Your 1 5 Gin Gen Gin Trp Lys Ala Gin 14C Lys Lys Lys Lei Gin Ala 1 5 5 gcg tct gtc gtg gtg cta tgt 48 Ala Ser Val Val Val Leu Oys 10 15 gat tta gcc gcc aaa gtg ctg 96 Asp Leu Ala Ala Lys Val Leu 30 ttc ccc ggc gct acc tct 144 Leu Ile Pro Gly Ala Thr Ser 45 caa gaa tac gaa gtg cag atg 192 Gin Glu Tyr Glu Val Gin Met 60 cag gca ctg ctg gaa tgc ctg 240 Gin Ala Leu Leu Glu Cys Leu 75 80 gaa clt ctg gtt tta cct gtt 288 Glu Leu Leu Val Leu Pro Val 90 95 tgg ctc aac gca tct tta cgc 336 Trp Leu Asn Ala Ser Leu Arg 110 cgc gca aaa gca agc att egt 384 Arg Ala Lys Ala Ser Ile Arg 125 egt agc egt aLg gca did ctg 432 Arg Ser Arg Met Ala Tyr Leu 140 Ala Ser Val Val Val Leu Cys 10 1 5 450 60 Your Ala Pro AEp 20 Glu Ala Thr Ala Gin 25 Asp Leu Ala Ala Lys 30 Val Leu Ala Glu Lys 3 5 i eu Ala Al a Cys Ara 40 Thr Leu Ile Pro Gly 45 Ala Thr Ser Leu Tyr 50 Tyr Trp Glu Gly Lys 55 Leu Glu Gin Glu Tyr 60 Glu Val Gin Met Ile 65 Leu Lys Thr Thr Val 70 Ser His Gin Gl n A1. a 75 Leu Leu Glu Cys Leu 80 Lys Ser His Pro 85 Tyr Gin Thr Pro Glu 90 Leu Leu Val Leu Pro 95 Val Thr His Gly Asp 100 Thr Asp Tyr Leu Ser 105 Trp Leu Asn Ala Ser 110 Leu Arg Ser Gly Asp 115 Ile Glu Gin Glu Leu 120 Glu Arg Ala I.ys Al a 125 Ser Ile Arg Arg Leu 13 C Glu Gin Glu Val Asn 135 Gin Glu Arg Ser Arg 140 Met Ala Tyr Leu Gin 145 Thr Leu Leu Ala Lys 150 < 210 > < 211 > < 21 2 > < 21 3 > 83,336 DNA synthet.i: apparent origin < 220 > ≪ 221 > ≪ 222 > ≪ 223 > CDS (1). CuLAl (336) < 4 0 0 > atg ctt Met Leu 1 83 gat Asp gaa Glu aaa Lvs 5 agt Ser teq Ser aat Asn acc Thrgcg Ala 10 cct Ser gtc Val gtg Val gtg Val cta Leu 15 tgt Cys 48 acg Thr gca Al a cca Pro gat Asp 20 gaa Glu gcg Ala aca Thr gcc Ala cag Gin 25 gat Asp tta Leu gcc Ala gcc Ala aaa Lys 30 gtg Val ctg Leu 96 gcg Ala gaa Glu aaa Lvs 35 ctg Leu gcg Ala gcc Ala ege Cys gcg Ala 40 acc Thr tt Leu atc Tie ccc Pro ggc Gly 45 gez Ala acc Thr Lei Ser 144 ctc Leu Zaz Tyr 50 tac Tyr tgg Trp gaa Glu ggt Gly aag Lys 55 ctg Leu gag Glu caa Gin gaa Glu tac Tyr 60 gaa Glu gtg Val cag Gin atg Met 192 att Ile 65 ; ta Leu aaa Lys act Thr acc Thr gca Val 70 tet Ser cac Hi s cag Gin cag Gl n gca A1 a 7 5 ctg Teu ctg Leu gaa Glu tgc Cys ctg Leu 80 240 aag Lys zcz Ser cat His cat His cca Pro 85 Did Tvr caa Gln acc Thr ccg Pro CJ ca fi Glu 90 ctt Leu ctg Leu gtt Val tta Leu cd Pro 95 gtL Val 288 aca Thr cac His gga Gly gac Asp 100 aca Thr gat, Asp tac Tyr ctc Leu tca Ser 105 tgg T rp ctc Leu α ciC Asn gca Ala tet Ser 110 tta Leu ege Arg 336 < 210 > 84 61 ≪ 211 > 112 < 212 > PRT < 213 > synthetic origin < 400 > 84 Met 1 Leu Asp Glu T.ys 5 Ser Ser Asn Thr Ata Pro Asp 20 Giu 7ila Thr Ala Ala Glu Lys 35 Leu Ala Ala Cys Ala 40 Leu Tyr 50 Tyr Trp Glu Gly Lys 55 Leu Ile 65 Leu Lys Thr Thr Val 70 Ser His Lys Ser His Pro 85 Tyr Gin Thr Thr H is Gly Asp 100 Thr Asp Tyr Leu Thr Ala 10 Ser Val Val Val Leu 15 Cys Gin 25 Asp Leu Ala Ala Lys 30 Val Leu Thr Leu Ile Pro Gly 45 Ala Thr Ser Glu Gl Glu Tyr 60 Glu Val Gin Met Gin Gin Ala 75 Leu Leu Giu Cys Leu 80 Pro Glu 90 Leu Leu Val Leu Pro 95 Val Ser 105 Trp Leu Asn Ala Ser 110 Leu Arg
权利要求:
Claims (16) [1] Patent Attorneys Dipl.-Ing. Helmut Hübscher Dipl.-Ing. Karl Winfried Hellmich Spittelwiese 7, A 4020 Linz (38588) HEL Patent Claims: 1. Polypeptide material with flexible pore properties, which results from the two- or three-dimensional assembly of fusion proteins from at least two protein domains, wherein at least one domain forms a double helix and at least one protein Oligomerization domain having at least the oligomerization stage 3. [2] 2. The polypeptide material according to claim 1 having the property that the oligomerization stage of the oligomerization domain is between 3 and 12, usually between 3 and 6. [3] 3. The polypeptide material according to any one of claims 1 to 2, wherein the pore shape and size are determined by the type of compound of the protein domains of the fusion protein according to the invention and the pore size by the length of the double coil forming segment. [4] 4. The polypeptide material according to any one of claims 1 to 3, wherein the length of the double helix-forming segment comprises between 2 and more than 100 heptads (14 to 700 amino acid residues). [5] 5. polypeptide material according to any one of claims 1 to 4, whose technological properties, inter alia by the introduction of positively charged, negatively charged, hydrophobic, hydrophilic, cysteine, histidine and other amino acid residues, which allow specific interactions with the exposed on the surface segments, are defined while the ability to form the double helix is obtained, and the amino acid residues that govern the technological pore eigenactions are most often introduced to positions b, c, and f of the helix-forming segment. [6] The polypeptide material according to any one of claims 1 to 5, wherein the chemical pore characteristics are also determined by the properties of the protein domains of said fusion protein, including, but not limited to, the net charge, surface-exposed hydrophobic amino acid residues and the size of the protein domains. [7] 7. The fusion protein consisting of the polypeptide material according to any one of claims 1 to 6, which consists of at least two protein domains, wherein at least one domain forms a double helix and at least one is the protein oligomerization domain and the domains among each other by the flexible linker from 1 to 20 However, the fusion protein may also contain a signal sequence for directing protein rejection and polypeptide tagging. [8] The polypeptide material according to any one of claims 1 to 7, wherein the double helix-forming segment is based on the sequence SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID NO: 6 or the intended peptides having functionally similar properties. [9] The polypeptide material according to any one of claims 1 to 8, wherein the protein oligomerization domain is preferably selected from the sequences SEQ ID NO: 8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 84 and the sequences having greater than 50% homology with these sequences which are given the ability to form oligomers of the same type. [10] 10. polypeptide material according to any one of claims 1 to 9, which has been prepared by the assembly of the fusion proteins, which are among the sequences SEQ ID No. 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82 were selected. * * * * II I II · »··« * * «III 3 [11] The polypeptide material according to any one of claims 1 to 10, prepared by mixing two fusion proteins according to any one of claims 1 to 10, wherein the segment of the first fusion protein forms the parallel double helix together with the segment of the second fusion protein. The oligomerization domain of one fusion protein is located at the N-terminus and the other at the C-terminus, but the oligomerization domains of both proteins have the same oligomerization stage. [12] 12. The polypeptide material according to claim 11, having the property that the double-helix-forming segments are selected from natural or the planned parallel helices or from the following pairs: SEQ ID NOs: 16, 18, 20, 22, 24, 26, 28 , 30, 32. [13] 13. polypeptide material according to any one of claims 11 to 12 having the property that the protein Oligomerisationsdomänen be selected among the tetramer proteins, preferably among the sequences SEQ ID NO: 10 and SEQ ID NO: 12 or among the trimeric proteins, preferably under SEQ ID NO: 8 and SEQ ID NO: 84. [14] 14. The DNA information fusion protein of any one of claims 1 to 13, wherein the DNA is operably linked to regulatory elements, the promoter and the terminator, which regulate the expression of the fusion proteins in the host's cell. [15] 15. The use of the polypeptide material according to any one of claims 1 to 13 for the separation and concentration of molecules, molecule complexes, viruses or Na-noteilchen according to their properties. [16] 16. The use of polypeptide material according to any one of claims 1 to 13 for chemical catalysis. Linz, April 12, 2012 Kemijski in £ titut by: VL ^ · '
类似技术:
公开号 | 公开日 | 专利标题 DE19742706B4|2013-07-25|lipocalin muteins DE69533644T2|2005-02-17|RECEPTOR STRUCTURES FROM BACTERIA DE69434520T3|2009-10-15|BIOTINYLATION OF PROTEINS EP1675623B1|2013-03-20|Ubiquitin or gamma-crystalline conjugates for use in therapy, diagnosis and chromatography DE60124678T2|2007-09-13|PROTEIN EQUIPMENT FOR ANTIBODY MIMETICS AND OTHER TIE PROTEINS DE69827507T2|2006-03-09|TRIMERIZING MODULE DE19932688B4|2009-10-08|Design of beta-sheet proteins of gamma-II-crystalline antibody-like DE69434083T2|2006-02-09|EXPRESSION OF A FUSION POLYPEPTIDE TRANSPORTED FROM THE CYTOPLASMA WITHOUT A LEADER SEQUENCE Azam et al.2016|Type III secretion as a generalizable strategy for the production of full‐length biopolymer‐forming proteins KR100925689B1|2009-11-10|Multifunctional Protein Simultaneous Delivering Antibodies and Nanoparticles DE60021188T2|2006-04-27|MODIFIED HUMAN GRANULOCYTE COLONY STIMULATING FACTOR AND METHOD FOR THE PRODUCTION THEREOF EP1227848A2|2002-08-07|Method for linking molecular substances AT511130A2|2012-09-15|Polypeptide material with flexible pore properties CN111704663B|2021-09-28|Preparation method of collagen hydrogel DE69629808T2|2004-07-15|Promoters for gene expression DE19856052A1|2000-06-08|Conjugate for enrichment in neuronal cells CN111333715A|2020-06-26|Preparation method of type I collagen fiber CN106755042B|2020-10-09|Preparation method of bioactive small peptide based on combined self-shearing and protein scaffold DE102013106462B3|2014-10-09|Detection method for the detection of bacteria, methods for the production of fusion proteins and fusion protein EP2764018B1|2017-10-11|Self-assembling polypeptide polyhedra DE102015108849A1|2016-12-08|Method for protein immobilization using modified DNase domains DE60125386T2|2007-04-12|A method of preventing a freeze-induced decrease in the activity of a protein DE102005009219A1|2006-08-31|New ribonuclease tandem enzyme, useful for treating tumors, comprising 2 ribonuclease units connected covalently through a peptide linker DE10155862A1|2003-05-28|Production of recombinant antibodies by fusion with elastin-like peptides DE60207157T2|2006-07-13|Purification method based on the binding of enzymes and labeled peptides
同族专利:
公开号 | 公开日 AT511130A5|2015-07-15| WO2011046520A1|2011-04-21|
引用文献:
公开号 | 申请日 | 公开日 | 申请人 | 专利标题 US20030219451A1|2001-10-29|2003-11-27|Whitehead Institute For Biomedical Research|Stable helical C peptides and uses therefor| GB2393959A|2002-10-08|2004-04-14|Isis Innovation|Protein lattice|WO2014126828A1|2013-02-15|2014-08-21|Merck Sharp & Dohme Corp.|Novel ph-switchable peptides for membrane insertion and pore formation| WO2018192719A1|2017-04-20|2018-10-25|Institut National De La Sante Et De La Recherche Medicale|Peptides, especially polypeptides, phage display screening method and associated means, and their uses for research and biomedical applications|
法律状态:
2016-05-15| REJ| Rejection|Effective date: 20160515 |
优先权:
[返回顶部]
申请号 | 申请日 | 专利标题 PCT/SI2009/000048|WO2011046520A1|2009-10-12|2009-10-12|Polypeptide material with adjustable pore properties| 相关专利
Sulfonates, polymers, resist compositions and patterning process
Washing machine
Washing machine
Device for fixture finishing and tension adjusting of membrane
Structure for Equipping Band in a Plane Cathode Ray Tube
Process for preparation of 7 alpha-carboxyl 9, 11-epoxy steroids and intermediates useful therein an
国家/地区
|