Analysis of the genome sequence of the flowering plant Arabidopsis thaliana

Samir Kaul, Hean L. Koo, Jennifer Jenkins, Michael Rizzo, Timothy Rooney, Luke J. Tallon, Tamara Feldblyum, William Nierman, Maria Ines Benito, Xiaoying Lin, Christopher D. Town, J. Craig Venter, Claire M. Fraser, Satoshi Tabata, Yasukazu Nakamura, Takakazu Kaneko, Shusei Sato, Erika Asamizu, Tomohiko Kato, Hirokazu KotaniShigemi Sasamoto, Joseph R. Ecker, Athanasios Theologis, Nancy A. Federspiel, Curtis J. Palm, Brian I. Osborne, Paul Shinn, Ken Dewar, Christopher J. Kim, Eugen Buehler, Patrick Dunn, Qimin Chao, Huaming Chen, Athanasios Theologis, Brian I. Osborne, Valentina S. Vysotskaia, Catherine A. Lenz, Christopher J. Kim, Nancy F. Hansen, Shirley X. Liu, Eugen Buehler, Hootan Alta, Hitomi Sakano, Patrick Dunn, Bao Lam, Paul K. Pham, Qimin Chao, Michelle Nguyen, Guixia Yu, Huaming Chen, Audrey Southwick, Jeong Mi Lee, Molly Miranda, Mitsue J. Toriumi, Ronald W. Davis, Nancy A. Federspiel, Curtis J. Palm, Aaron B. Conway, Lane Conn, Nancy F. Hansen, Altafi Hootan, Bao Lam, R. Wambutt, G. Murphy, A. Düsterhöft, W. Stiekema, T. Pohl, K. D. Entian, N. Terryn, G. Volckaert, M. Salanoubat, N. Choisne, F. Artiguenave, J. Weissenbach, F. Quetier, M. Rieger, W. Ansorge, M. Unseld, B. Fartmann, G. Valle, Richard K. Wilson, M. Sekhon, K. Pepin, J. Murray, D. Johnson, L. Hillier, Melissa de la Bastide, Emily Huang, Lori Spiegel, Lidia Gnoj, Kristina Habermann, Neilay Dedhia, Larry Parnell, Raymond Preston, M. Marra, W. Richard McCombie, Ellson Chen, Robert Martienssen, Klaus Mayer, Kai Lemcke, Brian Haas, Dirk Haase, Stephen Rudd, Heiko Schoof, Dimitrij Frishman, Burkhard Morgenstern, Paulo Zaccaria, Hans Werner Mewes, Owen White, Todd H. Creasy, Cord Bielke, Rama Maiti, Jeremy Peterson, Maria Ermolaeva, Mihaela Pertea, John Quackenbush, Natalia Volfovsky, Dongying Wu, Steven L Salzberg, Michael Bevan, Todd M. Lowe, S. Rounsley, D. Bush, S. Subramaniam, I. Levin, S. Norris, R. Schmidt, A. Acarkan, I. Bancroft, A. Brennicke, J. A. Eisen, T. Bureau, B. A. Legault, Q. H. Le, N. Agrawal, Z. Yu, G. P. Copenhaver, S. Luo, D. Preuss, C. S. Pikaard, I. T. Paulsen, M. Sussman, A. B. Britt, D. A. Selinger, R. Pandey, V. L. Chandler, R. A. Jorgensen, D. W. Mount, C. Pikaard, G. Juergens, E. M. Meyerowitz, J. Dangl, J. D G Jones, M. Chen, J. Chory, C. Somerville

Research output: Contribution to journalArticle

Abstract

The flowering plant Arabidopsis thaliana is an important model system for identifying genes end determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

Original languageEnglish (US)
Pages (from-to)796-815
Number of pages20
JournalNature
Volume408
Issue number6814
DOIs
StatePublished - Dec 14 2000
Externally publishedYes

Fingerprint

Arabidopsis
Sequence Analysis
Genome
Eukaryota
Proteins
Genes
Horizontal Gene Transfer
Plant Genes
Plastids
Gene Duplication
Caenorhabditis elegans
Drosophila

ASJC Scopus subject areas

  • General

Cite this

Kaul, S., Koo, H. L., Jenkins, J., Rizzo, M., Rooney, T., Tallon, L. J., ... Somerville, C. (2000). Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature, 408(6814), 796-815. https://doi.org/10.1038/35048692

Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. / Kaul, Samir; Koo, Hean L.; Jenkins, Jennifer; Rizzo, Michael; Rooney, Timothy; Tallon, Luke J.; Feldblyum, Tamara; Nierman, William; Benito, Maria Ines; Lin, Xiaoying; Town, Christopher D.; Venter, J. Craig; Fraser, Claire M.; Tabata, Satoshi; Nakamura, Yasukazu; Kaneko, Takakazu; Sato, Shusei; Asamizu, Erika; Kato, Tomohiko; Kotani, Hirokazu; Sasamoto, Shigemi; Ecker, Joseph R.; Theologis, Athanasios; Federspiel, Nancy A.; Palm, Curtis J.; Osborne, Brian I.; Shinn, Paul; Dewar, Ken; Kim, Christopher J.; Buehler, Eugen; Dunn, Patrick; Chao, Qimin; Chen, Huaming; Theologis, Athanasios; Osborne, Brian I.; Vysotskaia, Valentina S.; Lenz, Catherine A.; Kim, Christopher J.; Hansen, Nancy F.; Liu, Shirley X.; Buehler, Eugen; Alta, Hootan; Sakano, Hitomi; Dunn, Patrick; Lam, Bao; Pham, Paul K.; Chao, Qimin; Nguyen, Michelle; Yu, Guixia; Chen, Huaming; Southwick, Audrey; Lee, Jeong Mi; Miranda, Molly; Toriumi, Mitsue J.; Davis, Ronald W.; Federspiel, Nancy A.; Palm, Curtis J.; Conway, Aaron B.; Conn, Lane; Hansen, Nancy F.; Hootan, Altafi; Lam, Bao; Wambutt, R.; Murphy, G.; Düsterhöft, A.; Stiekema, W.; Pohl, T.; Entian, K. D.; Terryn, N.; Volckaert, G.; Salanoubat, M.; Choisne, N.; Artiguenave, F.; Weissenbach, J.; Quetier, F.; Rieger, M.; Ansorge, W.; Unseld, M.; Fartmann, B.; Valle, G.; Wilson, Richard K.; Sekhon, M.; Pepin, K.; Murray, J.; Johnson, D.; Hillier, L.; de la Bastide, Melissa; Huang, Emily; Spiegel, Lori; Gnoj, Lidia; Habermann, Kristina; Dedhia, Neilay; Parnell, Larry; Preston, Raymond; Marra, M.; McCombie, W. Richard; Chen, Ellson; Martienssen, Robert; Mayer, Klaus; Lemcke, Kai; Haas, Brian; Haase, Dirk; Rudd, Stephen; Schoof, Heiko; Frishman, Dimitrij; Morgenstern, Burkhard; Zaccaria, Paulo; Mewes, Hans Werner; White, Owen; Creasy, Todd H.; Bielke, Cord; Maiti, Rama; Peterson, Jeremy; Ermolaeva, Maria; Pertea, Mihaela; Quackenbush, John; Volfovsky, Natalia; Wu, Dongying; Salzberg, Steven L; Bevan, Michael; Lowe, Todd M.; Rounsley, S.; Bush, D.; Subramaniam, S.; Levin, I.; Norris, S.; Schmidt, R.; Acarkan, A.; Bancroft, I.; Brennicke, A.; Eisen, J. A.; Bureau, T.; Legault, B. A.; Le, Q. H.; Agrawal, N.; Yu, Z.; Copenhaver, G. P.; Luo, S.; Preuss, D.; Pikaard, C. S.; Paulsen, I. T.; Sussman, M.; Britt, A. B.; Selinger, D. A.; Pandey, R.; Chandler, V. L.; Jorgensen, R. A.; Mount, D. W.; Pikaard, C.; Juergens, G.; Meyerowitz, E. M.; Dangl, J.; Jones, J. D G; Chen, M.; Chory, J.; Somerville, C.

In: Nature, Vol. 408, No. 6814, 14.12.2000, p. 796-815.

Research output: Contribution to journalArticle

Kaul, S, Koo, HL, Jenkins, J, Rizzo, M, Rooney, T, Tallon, LJ, Feldblyum, T, Nierman, W, Benito, MI, Lin, X, Town, CD, Venter, JC, Fraser, CM, Tabata, S, Nakamura, Y, Kaneko, T, Sato, S, Asamizu, E, Kato, T, Kotani, H, Sasamoto, S, Ecker, JR, Theologis, A, Federspiel, NA, Palm, CJ, Osborne, BI, Shinn, P, Dewar, K, Kim, CJ, Buehler, E, Dunn, P, Chao, Q, Chen, H, Theologis, A, Osborne, BI, Vysotskaia, VS, Lenz, CA, Kim, CJ, Hansen, NF, Liu, SX, Buehler, E, Alta, H, Sakano, H, Dunn, P, Lam, B, Pham, PK, Chao, Q, Nguyen, M, Yu, G, Chen, H, Southwick, A, Lee, JM, Miranda, M, Toriumi, MJ, Davis, RW, Federspiel, NA, Palm, CJ, Conway, AB, Conn, L, Hansen, NF, Hootan, A, Lam, B, Wambutt, R, Murphy, G, Düsterhöft, A, Stiekema, W, Pohl, T, Entian, KD, Terryn, N, Volckaert, G, Salanoubat, M, Choisne, N, Artiguenave, F, Weissenbach, J, Quetier, F, Rieger, M, Ansorge, W, Unseld, M, Fartmann, B, Valle, G, Wilson, RK, Sekhon, M, Pepin, K, Murray, J, Johnson, D, Hillier, L, de la Bastide, M, Huang, E, Spiegel, L, Gnoj, L, Habermann, K, Dedhia, N, Parnell, L, Preston, R, Marra, M, McCombie, WR, Chen, E, Martienssen, R, Mayer, K, Lemcke, K, Haas, B, Haase, D, Rudd, S, Schoof, H, Frishman, D, Morgenstern, B, Zaccaria, P, Mewes, HW, White, O, Creasy, TH, Bielke, C, Maiti, R, Peterson, J, Ermolaeva, M, Pertea, M, Quackenbush, J, Volfovsky, N, Wu, D, Salzberg, SL, Bevan, M, Lowe, TM, Rounsley, S, Bush, D, Subramaniam, S, Levin, I, Norris, S, Schmidt, R, Acarkan, A, Bancroft, I, Brennicke, A, Eisen, JA, Bureau, T, Legault, BA, Le, QH, Agrawal, N, Yu, Z, Copenhaver, GP, Luo, S, Preuss, D, Pikaard, CS, Paulsen, IT, Sussman, M, Britt, AB, Selinger, DA, Pandey, R, Chandler, VL, Jorgensen, RA, Mount, DW, Pikaard, C, Juergens, G, Meyerowitz, EM, Dangl, J, Jones, JDG, Chen, M, Chory, J & Somerville, C 2000, 'Analysis of the genome sequence of the flowering plant Arabidopsis thaliana', Nature, vol. 408, no. 6814, pp. 796-815. https://doi.org/10.1038/35048692
Kaul S, Koo HL, Jenkins J, Rizzo M, Rooney T, Tallon LJ et al. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000 Dec 14;408(6814):796-815. https://doi.org/10.1038/35048692
Kaul, Samir ; Koo, Hean L. ; Jenkins, Jennifer ; Rizzo, Michael ; Rooney, Timothy ; Tallon, Luke J. ; Feldblyum, Tamara ; Nierman, William ; Benito, Maria Ines ; Lin, Xiaoying ; Town, Christopher D. ; Venter, J. Craig ; Fraser, Claire M. ; Tabata, Satoshi ; Nakamura, Yasukazu ; Kaneko, Takakazu ; Sato, Shusei ; Asamizu, Erika ; Kato, Tomohiko ; Kotani, Hirokazu ; Sasamoto, Shigemi ; Ecker, Joseph R. ; Theologis, Athanasios ; Federspiel, Nancy A. ; Palm, Curtis J. ; Osborne, Brian I. ; Shinn, Paul ; Dewar, Ken ; Kim, Christopher J. ; Buehler, Eugen ; Dunn, Patrick ; Chao, Qimin ; Chen, Huaming ; Theologis, Athanasios ; Osborne, Brian I. ; Vysotskaia, Valentina S. ; Lenz, Catherine A. ; Kim, Christopher J. ; Hansen, Nancy F. ; Liu, Shirley X. ; Buehler, Eugen ; Alta, Hootan ; Sakano, Hitomi ; Dunn, Patrick ; Lam, Bao ; Pham, Paul K. ; Chao, Qimin ; Nguyen, Michelle ; Yu, Guixia ; Chen, Huaming ; Southwick, Audrey ; Lee, Jeong Mi ; Miranda, Molly ; Toriumi, Mitsue J. ; Davis, Ronald W. ; Federspiel, Nancy A. ; Palm, Curtis J. ; Conway, Aaron B. ; Conn, Lane ; Hansen, Nancy F. ; Hootan, Altafi ; Lam, Bao ; Wambutt, R. ; Murphy, G. ; Düsterhöft, A. ; Stiekema, W. ; Pohl, T. ; Entian, K. D. ; Terryn, N. ; Volckaert, G. ; Salanoubat, M. ; Choisne, N. ; Artiguenave, F. ; Weissenbach, J. ; Quetier, F. ; Rieger, M. ; Ansorge, W. ; Unseld, M. ; Fartmann, B. ; Valle, G. ; Wilson, Richard K. ; Sekhon, M. ; Pepin, K. ; Murray, J. ; Johnson, D. ; Hillier, L. ; de la Bastide, Melissa ; Huang, Emily ; Spiegel, Lori ; Gnoj, Lidia ; Habermann, Kristina ; Dedhia, Neilay ; Parnell, Larry ; Preston, Raymond ; Marra, M. ; McCombie, W. Richard ; Chen, Ellson ; Martienssen, Robert ; Mayer, Klaus ; Lemcke, Kai ; Haas, Brian ; Haase, Dirk ; Rudd, Stephen ; Schoof, Heiko ; Frishman, Dimitrij ; Morgenstern, Burkhard ; Zaccaria, Paulo ; Mewes, Hans Werner ; White, Owen ; Creasy, Todd H. ; Bielke, Cord ; Maiti, Rama ; Peterson, Jeremy ; Ermolaeva, Maria ; Pertea, Mihaela ; Quackenbush, John ; Volfovsky, Natalia ; Wu, Dongying ; Salzberg, Steven L ; Bevan, Michael ; Lowe, Todd M. ; Rounsley, S. ; Bush, D. ; Subramaniam, S. ; Levin, I. ; Norris, S. ; Schmidt, R. ; Acarkan, A. ; Bancroft, I. ; Brennicke, A. ; Eisen, J. A. ; Bureau, T. ; Legault, B. A. ; Le, Q. H. ; Agrawal, N. ; Yu, Z. ; Copenhaver, G. P. ; Luo, S. ; Preuss, D. ; Pikaard, C. S. ; Paulsen, I. T. ; Sussman, M. ; Britt, A. B. ; Selinger, D. A. ; Pandey, R. ; Chandler, V. L. ; Jorgensen, R. A. ; Mount, D. W. ; Pikaard, C. ; Juergens, G. ; Meyerowitz, E. M. ; Dangl, J. ; Jones, J. D G ; Chen, M. ; Chory, J. ; Somerville, C. / Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. In: Nature. 2000 ; Vol. 408, No. 6814. pp. 796-815.
@article{2a56fc1869594bbaab0476c0e4bff65e,
title = "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana",
abstract = "The flowering plant Arabidopsis thaliana is an important model system for identifying genes end determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.",
author = "Samir Kaul and Koo, {Hean L.} and Jennifer Jenkins and Michael Rizzo and Timothy Rooney and Tallon, {Luke J.} and Tamara Feldblyum and William Nierman and Benito, {Maria Ines} and Xiaoying Lin and Town, {Christopher D.} and Venter, {J. Craig} and Fraser, {Claire M.} and Satoshi Tabata and Yasukazu Nakamura and Takakazu Kaneko and Shusei Sato and Erika Asamizu and Tomohiko Kato and Hirokazu Kotani and Shigemi Sasamoto and Ecker, {Joseph R.} and Athanasios Theologis and Federspiel, {Nancy A.} and Palm, {Curtis J.} and Osborne, {Brian I.} and Paul Shinn and Ken Dewar and Kim, {Christopher J.} and Eugen Buehler and Patrick Dunn and Qimin Chao and Huaming Chen and Athanasios Theologis and Osborne, {Brian I.} and Vysotskaia, {Valentina S.} and Lenz, {Catherine A.} and Kim, {Christopher J.} and Hansen, {Nancy F.} and Liu, {Shirley X.} and Eugen Buehler and Hootan Alta and Hitomi Sakano and Patrick Dunn and Bao Lam and Pham, {Paul K.} and Qimin Chao and Michelle Nguyen and Guixia Yu and Huaming Chen and Audrey Southwick and Lee, {Jeong Mi} and Molly Miranda and Toriumi, {Mitsue J.} and Davis, {Ronald W.} and Federspiel, {Nancy A.} and Palm, {Curtis J.} and Conway, {Aaron B.} and Lane Conn and Hansen, {Nancy F.} and Altafi Hootan and Bao Lam and R. Wambutt and G. Murphy and A. D{\"u}sterh{\"o}ft and W. Stiekema and T. Pohl and Entian, {K. D.} and N. Terryn and G. Volckaert and M. Salanoubat and N. Choisne and F. Artiguenave and J. Weissenbach and F. Quetier and M. Rieger and W. Ansorge and M. Unseld and B. Fartmann and G. Valle and Wilson, {Richard K.} and M. Sekhon and K. Pepin and J. Murray and D. Johnson and L. Hillier and {de la Bastide}, Melissa and Emily Huang and Lori Spiegel and Lidia Gnoj and Kristina Habermann and Neilay Dedhia and Larry Parnell and Raymond Preston and M. Marra and McCombie, {W. Richard} and Ellson Chen and Robert Martienssen and Klaus Mayer and Kai Lemcke and Brian Haas and Dirk Haase and Stephen Rudd and Heiko Schoof and Dimitrij Frishman and Burkhard Morgenstern and Paulo Zaccaria and Mewes, {Hans Werner} and Owen White and Creasy, {Todd H.} and Cord Bielke and Rama Maiti and Jeremy Peterson and Maria Ermolaeva and Mihaela Pertea and John Quackenbush and Natalia Volfovsky and Dongying Wu and Salzberg, {Steven L} and Michael Bevan and Lowe, {Todd M.} and S. Rounsley and D. Bush and S. Subramaniam and I. Levin and S. Norris and R. Schmidt and A. Acarkan and I. Bancroft and A. Brennicke and Eisen, {J. A.} and T. Bureau and Legault, {B. A.} and Le, {Q. H.} and N. Agrawal and Z. Yu and Copenhaver, {G. P.} and S. Luo and D. Preuss and Pikaard, {C. S.} and Paulsen, {I. T.} and M. Sussman and Britt, {A. B.} and Selinger, {D. A.} and R. Pandey and Chandler, {V. L.} and Jorgensen, {R. A.} and Mount, {D. W.} and C. Pikaard and G. Juergens and Meyerowitz, {E. M.} and J. Dangl and Jones, {J. D G} and M. Chen and J. Chory and C. Somerville",
year = "2000",
month = "12",
day = "14",
doi = "10.1038/35048692",
language = "English (US)",
volume = "408",
pages = "796--815",
journal = "Nature",
issn = "0028-0836",
publisher = "Nature Publishing Group",
number = "6814",

}

TY - JOUR

T1 - Analysis of the genome sequence of the flowering plant Arabidopsis thaliana

AU - Kaul, Samir

AU - Koo, Hean L.

AU - Jenkins, Jennifer

AU - Rizzo, Michael

AU - Rooney, Timothy

AU - Tallon, Luke J.

AU - Feldblyum, Tamara

AU - Nierman, William

AU - Benito, Maria Ines

AU - Lin, Xiaoying

AU - Town, Christopher D.

AU - Venter, J. Craig

AU - Fraser, Claire M.

AU - Tabata, Satoshi

AU - Nakamura, Yasukazu

AU - Kaneko, Takakazu

AU - Sato, Shusei

AU - Asamizu, Erika

AU - Kato, Tomohiko

AU - Kotani, Hirokazu

AU - Sasamoto, Shigemi

AU - Ecker, Joseph R.

AU - Theologis, Athanasios

AU - Federspiel, Nancy A.

AU - Palm, Curtis J.

AU - Osborne, Brian I.

AU - Shinn, Paul

AU - Dewar, Ken

AU - Kim, Christopher J.

AU - Buehler, Eugen

AU - Dunn, Patrick

AU - Chao, Qimin

AU - Chen, Huaming

AU - Theologis, Athanasios

AU - Osborne, Brian I.

AU - Vysotskaia, Valentina S.

AU - Lenz, Catherine A.

AU - Kim, Christopher J.

AU - Hansen, Nancy F.

AU - Liu, Shirley X.

AU - Buehler, Eugen

AU - Alta, Hootan

AU - Sakano, Hitomi

AU - Dunn, Patrick

AU - Lam, Bao

AU - Pham, Paul K.

AU - Chao, Qimin

AU - Nguyen, Michelle

AU - Yu, Guixia

AU - Chen, Huaming

AU - Southwick, Audrey

AU - Lee, Jeong Mi

AU - Miranda, Molly

AU - Toriumi, Mitsue J.

AU - Davis, Ronald W.

AU - Federspiel, Nancy A.

AU - Palm, Curtis J.

AU - Conway, Aaron B.

AU - Conn, Lane

AU - Hansen, Nancy F.

AU - Hootan, Altafi

AU - Lam, Bao

AU - Wambutt, R.

AU - Murphy, G.

AU - Düsterhöft, A.

AU - Stiekema, W.

AU - Pohl, T.

AU - Entian, K. D.

AU - Terryn, N.

AU - Volckaert, G.

AU - Salanoubat, M.

AU - Choisne, N.

AU - Artiguenave, F.

AU - Weissenbach, J.

AU - Quetier, F.

AU - Rieger, M.

AU - Ansorge, W.

AU - Unseld, M.

AU - Fartmann, B.

AU - Valle, G.

AU - Wilson, Richard K.

AU - Sekhon, M.

AU - Pepin, K.

AU - Murray, J.

AU - Johnson, D.

AU - Hillier, L.

AU - de la Bastide, Melissa

AU - Huang, Emily

AU - Spiegel, Lori

AU - Gnoj, Lidia

AU - Habermann, Kristina

AU - Dedhia, Neilay

AU - Parnell, Larry

AU - Preston, Raymond

AU - Marra, M.

AU - McCombie, W. Richard

AU - Chen, Ellson

AU - Martienssen, Robert

AU - Mayer, Klaus

AU - Lemcke, Kai

AU - Haas, Brian

AU - Haase, Dirk

AU - Rudd, Stephen

AU - Schoof, Heiko

AU - Frishman, Dimitrij

AU - Morgenstern, Burkhard

AU - Zaccaria, Paulo

AU - Mewes, Hans Werner

AU - White, Owen

AU - Creasy, Todd H.

AU - Bielke, Cord

AU - Maiti, Rama

AU - Peterson, Jeremy

AU - Ermolaeva, Maria

AU - Pertea, Mihaela

AU - Quackenbush, John

AU - Volfovsky, Natalia

AU - Wu, Dongying

AU - Salzberg, Steven L

AU - Bevan, Michael

AU - Lowe, Todd M.

AU - Rounsley, S.

AU - Bush, D.

AU - Subramaniam, S.

AU - Levin, I.

AU - Norris, S.

AU - Schmidt, R.

AU - Acarkan, A.

AU - Bancroft, I.

AU - Brennicke, A.

AU - Eisen, J. A.

AU - Bureau, T.

AU - Legault, B. A.

AU - Le, Q. H.

AU - Agrawal, N.

AU - Yu, Z.

AU - Copenhaver, G. P.

AU - Luo, S.

AU - Preuss, D.

AU - Pikaard, C. S.

AU - Paulsen, I. T.

AU - Sussman, M.

AU - Britt, A. B.

AU - Selinger, D. A.

AU - Pandey, R.

AU - Chandler, V. L.

AU - Jorgensen, R. A.

AU - Mount, D. W.

AU - Pikaard, C.

AU - Juergens, G.

AU - Meyerowitz, E. M.

AU - Dangl, J.

AU - Jones, J. D G

AU - Chen, M.

AU - Chory, J.

AU - Somerville, C.

PY - 2000/12/14

Y1 - 2000/12/14

N2 - The flowering plant Arabidopsis thaliana is an important model system for identifying genes end determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

AB - The flowering plant Arabidopsis thaliana is an important model system for identifying genes end determining their functions. Here we report the analysis of the genomic sequence of Arabidopsis. The sequenced regions cover 115.4 megabases of the 125-megabase genome and extend into centromeric regions. The evolution of Arabidopsis involved a whole-genome duplication, followed by subsequent gene loss and extensive local gene duplications, giving rise to a dynamic genome enriched by lateral gene transfer from a cyanobacterial-like ancestor of the plastid. The genome contains 25,498 genes encoding proteins from 11,000 families, similar to the functional diversity of Drosophila and Caenorhabditis elegans - the other sequenced multicellular eukaryotes. Arabidopsis has many families of new proteins but also lacks several common protein families, indicating that the sets of common proteins have undergone differential expansion and contraction in the three multicellular eukaryotes. This is the first complete genome sequence of a plant and provides the foundations for more comprehensive comparison of conserved processes in all eukaryotes, identifying a wide range of plant-specific gene functions and establishing rapid systematic ways to identify genes for crop improvement.

UR - http://www.scopus.com/inward/record.url?scp=0034649566&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034649566&partnerID=8YFLogxK

U2 - 10.1038/35048692

DO - 10.1038/35048692

M3 - Article

C2 - 11130711

AN - SCOPUS:0034649566

VL - 408

SP - 796

EP - 815

JO - Nature

JF - Nature

SN - 0028-0836

IS - 6814

ER -