SolrCloud performance test
来源:互联网 发布:世界军事软件 编辑:程序博客网 时间:2024/06/13 22:04
http://wikicentral.cisco.com/display/PROJECT/SolrCloud+Performance+Test
environment|
SolrCloud servers: X.X.X.71, X.X.X.72, X.X.X.73. 72 and 73 with Mem:16G CPU:16 core 2.4GHz; 71 with Mem:8G CPU:4 core 2.27GHz
zookeeper servers: X.X.X.22, X.X.X.23, X.X.X.24
OS: Linux x86_64 GNU/Linux
tools: jmeter2.6 , youykit11.0.5
config & start service
zookeeper run as default parameters and config(zookeeper start)
SlorCloud configeration refer to (solrloud start)
1. for X.X.X.71 add $SOLRCLOUD_HOME/example/solr/conf/schema.xmlwith fields as:
<fields> <!-- Valid attributes for fields: name: mandatory - the name for the field type: mandatory - the name of a previously defined type from the <types> section indexed: true if this field should be indexed (searchable or sortable) stored: true if this field should be retrievable multiValued: true if this field may contain multiple values per document omitNorms: (expert) set to true to omit the norms associated with this field (this disables length normalization and index-time boosting for the field, and saves some memory). Only full-text fields or fields that need an index-time boost need norms. Norms are omitted for primitive (non-analyzed) types by default. termVectors: [false] set to true to store the term vector for a given field. When using MoreLikeThis, fields used for similarity should be stored for best performance. termPositions: Store position information with the term vector. This will increase storage costs. termOffsets: Store offset information with the term vector. This will increase storage costs. required: The field is required. It will throw an error if the value does not exist default: a value that should be used if no value is specified when adding a document. --> <field name="id" type="string" indexed="true" stored="true" required="true" /> <field name="ts" type="text_general" indexed="true" stored="true"/> <field name="name" type="text_general" indexed="true" stored="true"/> <field name="age" type="text_general" indexed="true" stored="true"/> <field name="company" type="text_general" indexed="true" stored="true"/> <field name="branch" type="text_general" indexed="true" stored="true"/> <field name="mail" type="text_general" indexed="true" stored="true"/> <field name="interest" type="text_general" indexed="true" stored="true"/> <field name="address" type="text_general" indexed="true" stored="true"/> <field name="text_general" type="text_general" indexed="true" stored="false" multiValued="true" /> </fields>
change the jetty threads limit:
open $SOLR_HOME/example/etc/jetty.xml ,change maxThreads to 10000
<!-- =========================================================== --> <!-- Server Thread Pool --> <!-- =========================================================== --> <Set name="ThreadPool"> <!-- Default queued blocking threadpool --> <New class="org.eclipse.jetty.util.thread.QueuedThreadPool"> <Set name="minThreads">10</Set> <Set name="maxThreads">10000</Set> <Set name="detailedDump">false</Set> </New> </Set>
this threads num will effect the pressure test. so a big vaule is necessary
2. cd to $SOLR_HOME/example
3. start SolrCloud with command on X.X.X.71:
java -Xmx6g -Xms6g -Dbootstrap_confdir=./solr/conf -Dcollection.configName=myconf -Djetty.port=8900 -DzkHost=X.X.X.22:2181,X.X.X.23:2181,X.X.X.24:2181 -DnumShards=1 -jar start.jar
4. start SolrCloud with command on 72 and 73
java -Xmx6g -Xms6g -Djetty.port=8900 -DzkHost=X.X.X.22:2181,X.X.X.23:2181,X.X.X.24:2181 -jar start.jar
5. access solr admin tool http://X.X.X.71:8900/solr/ and will see follow graph:
so finally got one shard and tow replicas
preper jmeter script & index data
1. the index request for preper index-data to solrcloud cluster look like :
POST http://X.X.X.72:8900/solr/collection1/updatePOST data:<add><doc><field name="id">4al6q90c-8ouj-1255-Sind-201206282rmo</field><field name="ts">1340859312956</field><field name="name">byan</field><field name="age">21</field><field name="company">Ciscobyan Systemsbyan, Incbyan</field><field name="branch">Cloudbyan Applicationbyan Servicesbyan</field><field name="mail">byan@cisco.com</field><field name="interest">Have intensive interest in Internet-surfingbyan,singingbyan, writingbyan and readingbyan </field><field name="address">abyan, Gatebyan Buidingbyan Streetbyan Provincebyan Contrybyan</field></doc></add>[no cookies]Request Headers:Content-Length: 598Connection: keep-aliveContent-Type: application/xml
document for indexing:
<add><doc> <field name="id">c7${id_counter}</field> <field name="ts">1342463467567</field> <field name="name">person</field> <field name="age">10</field> <field name="company">Cisco Systems, Inc.</field> <field name="branch">Cloud Application Services</field> <field name="mail">person@cisco.com</field> <field name="interest">Have intensive interest in Internet-surfing,singing, writing and reading.</field> <field name="address">address,The Golden Gate Bridge,Wall Street.</field></doc></add>
each doc have a size about 300 bytes
the detail of this script can find here:solr_prepare_data_cluster.jmx
2. after got 15G index data with more than 20 million docs, we are ready to test the performance ofSolrCloud
Test & Result
all test with NRT availible
one or more clients to request to X.X.X.71-73 randomly. this test case with 1 shard and tow replicas
1. index performance test (without search): index jmeter script solr_index_cluster.jmx
index result:
2. search performance test(without index):search jmeter script solr_search_cluster.jmx
3. index performance test (with search)
index result:
at the same time the search result:
Summary
the performance of indexing without searching is not bad. But the performance of indexing while searching running is not good. It's look like the searching effect strongly on the performance of indexing. we are digging deeply to find why .
- SolrCloud performance test
- SolrCloud Capability Test
- SolrCloud Performance 测试(query-fetch)
- Javascript Performance Test (3)
- Javascript Performance Test (4)
- Berkeley DB Performance Test
- Berkeley DB Performance Test
- ElasticSearch(ES) performance Test
- WebGL performance Test URL
- performance &OS Test Note
- real time performance test
- 2 performance test tools
- system performance test snapshot
- real time performance test
- USKSA Performance Test Script
- performance-test-of-zhifubao
- Use ApacheBench test HTTP performance.
- Load and Performance Test Tools
- C#基础 之 string
- sql server 与excel 的表关系,导入导出
- Ubuntu更新源设置
- Linux下Vi命令详解
- C#基础 之 预处理介绍
- SolrCloud performance test
- ubuntu下firefox版本的替换
- 关于软件测试的理解比较学术的几个角度
- C#基础 之 List
- 逐步修改素数高效算法
- 建立自己的知识体系
- win32汇编提醒
- 新昊旅游网站项目总结
- Solution Manager 2.1 To 7.1