... our study, we used two different alphabets: a set of 20 amino acids residues, A , and a hydropathy-based alphabet, ΣH , derived from grammar complexity and syntactic structure of protein sequences ... MIV j (A) and MIVk (B), can be concatenated to form MIV j+k (C) ANALYSIS OF CORRELATION IN PROTEIN SEQUENCES In [1], Weiss states that protein sequences can be regarded as slightly edited random ... rather than to reach a specific classification accuracy We used the Pfam -A dataset to carry out this comparison The families contained in the Pfam database vary in sequence count and sequence length...