The Partitioned Global Address Space (PGAS) programming paradigm offers ease-of-use in expressing parallelism through a global shared address space while emphasizing performance by providing locality awareness through the partitioning of this address space. Therefore, the interest in PGAS programming languages is growing and many new languages have emerged and are becoming ubiquitously available on nearly all modern parallel architectures. Recently, new parallel machines with multiple cores are designed for targeting high performance applications. Most of the efforts have gone into benchmarking but there are a few examples of real high performance applications running on multicore machines. In this paper, we present and evaluate a parallelization technique for implementing a local DNA sequence alignment algorithm using a PGAS based language, UPC (Unified Parallel C) on a chip multithreading architecture, the UltraSPARC T1., {"references":["El-Ghazawi T., Carlson W., Sterling T, Yelick K.: UPC: Distributed\nShared Memory Programming. Book, John Wiley and Sons Inc.,\nNewYork. ISBN: 0-471-22048-5, 2005.","Yap T. K., FriederO., Martino R. L.: Parallel Computation in Biological\nSequence Analysis. IEEE Transactions on Parallel and Distributed\nSystems, Vol. 9, N 3, pp. 283-294, 1998.","Needleman, S. B., Wunsch C. D: A general method applicable to the\nsearch for similarities in the amino acid sequence of two proteins. Journal\nof Molecular Biology, Vol. 47, pp. 443-453, 1970.","Smith W., Waterman M.: Identification of Common Molecular Subsequences.\nJournal of Molecular Biology Vol. 147, pp. 195-197, 1981.","Gaber J.: Complexity Measure Approach for Partitioned Shared Memory\nModel, Application to UPC. Research report RR-10-04. Universite de\nTechnologie de Belfort-Montbeliard, 2004.","Lu M., Lin L.: Parallel algorithms for the Longest Common Subsequence\nProblem. IEEE Transaction on Parallel and Distributed System, Vol.5,\npp.835-848, 1994.","Valiant L.G.: A bridging model for parallel computation. Communication\nof the ACM, Vol. 33, N 8, pp. 103-111, 1990.","http://www.sun.com/servers/coolthreads/t1000/benchmarks.jsp","Garcia T., Myoupo J-F., Seme D.: A Coarse-Grained Multi-computer\nalgorithm for the longest common subsequence problem. 11-th Euromicro\nConference on Parallel Distributed and Network based Processing, 2003.\n[10] Alves C. E. R., Cceres E. N., Dehne F.: Parallel Dynamic Programming\nfor Solving the String Editing Problem on a CGM/BSP. In proceeding of\nACM SPAA-02, pp. 275-281, 2002.\n[11] Alves C. E. R. Cceres E. N., Dehne F. , Song S. W.: A Parallel Wavefront\nAlgorithm for Efficient Biological Sequence Comparison. International\nConference on Computational Science and its Applications, Montreal,\nCanada, May 18-21, Lecture Notes in Computer Science, V. 2668, pp.\n249-258, 2003.\n[12] Nicholas P. P.: Searching Biological Sequence Databases Using Distributed\nAdaptive Computing. Master thesis, Master of Science in Computer\nEngineering, Virginia Polytechnic Institute and State University,\n2003, available at http://scholar.lib.vt.edu/theses/\n[13] Chen Y., Wan A., Liu W.: A fast Parallel Algorithm for Finding\nthe Longest Common Sequence of Multiple Bio-sequences. Symposium\nof Computations in Bioinformatics and Bioscience (SCBB06). In conjunction\nwith the International Multi-Symposiums on Computer and\nComputational Sciences 2006 (IMSCCS06), 2006.\n[14] Bader D.A., Madduri K.,: A Graph-Theoretic Analysis of the Human\nProtein-Interaction Network Using Multicore Parallel Algorithms. Sixth\nIEEE International Workshop on High Performance Computational Biology\n(HiCOMB-07), 2007.\n[15] Bader D. A., Kanade V., Madduri K.: SWARM, A Parallel Programming\nFramework for Multicore Processors. First Workshop on Multithreaded\nArchitectures and Applications (MTAAP-07), 2007.\n[16] Voss G., Schrder A., Mller-Wittig W. Schmidt B.: Biological\nSequence Alignment on Graphics Processing Units. Available at\nhttp://www.ntu.edu.sg/home/asbschmidt/paper/BioGPU.pdf\n[17] Kayi A., Yao Y., El-Ghazawi T., Newby G.: Experimental Evaluation\nofEmerging Multi-core Architectures, Workshop on Performance\nModeling, Evaluation, and Optimisation of Ubiquitous Computing and\nNetworked Systems (PMEO07) IPDPS07 Proceedings, pp. 1-6, 2007.\n[18] Chen W-Y., Bonachea D., Duell J., Husbands P., Iancu C., Yelick K.:\nA Perfor- mance Analysis of the Berkley UPC Compiler. In Annual\nInternational Conference on Supercomputing (ICS), 2003.\n[19] Cantonnet F., El Ghazawi T., Lorenz P., Gaber J.: Fast Address Translation\nTech- niques for Distributed Shared Memory Compilers. International\nParallel and Dis- tributed Processing Symposium IPDPS06, 2006.\n[20] Bakhouya M., Gaber J., El-Ghazawi T.: Towards a Complexity Model\nfor Design and Analysis of PGAS-Based Algorithms, HPCC 2007 Proceedings,\nLNCS 4782 Springer, ISBN 978-3-540-75443-5, pp.672-682,\n2007."]}